8000 vMaroon (Maroon Ayoub) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View vMaroon's full-sized avatar
  • IBM

Organizations

@IBM @RedHat-Israel @stolostron @neuralmagic @kubestellar @TekClinic @llm-d

Block or report vMaroon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Incubating P/D sidecar for llm-d

Go 8 2 Updated May 21, 2025

llm-d benchmark scripts and tooling

Shell 11 2 Updated May 22, 2025

A light weight vLLM simulator, for mocking out replicas.

Go 15 5 Updated May 22, 2025

Helm charts for llm-d

Shell 27 14 Updated May 22, 2025

Inference scheduler for llm-d

Go 39 15 Updated May 22, 2025

⛩️ Pure Golang engine for Jinja templates

Go 89 13 Updated Feb 27, 2025

llm-d is a Kubernetes-native high-performance distributed LLM inference framework

Makefile 743 38 Updated May 22, 2025

Distributed KV cache coordinator

Go 28 3 Updated May 21, 2025
Python 58 10 Updated Apr 3, 2025

Gateway API Inference Extension

Jupyter Notebook 298 87 Updated May 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,864 7,554 Updated May 22, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,911 886 Updated May 21, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,259 188 Updated May 21, 2025
Go 1 Updated Jan 28, 2025

LangChain for Go, the easiest way to write LLM-based programs in Go

Go 6,621 834 Updated May 22, 2025

GUI tool for visualizing the result data of deBruijn sequence complexity distribution study

C++ 2 Updated Feb 20, 2024

KubeStellar - a flexible solution for multi-cluster configuration management for edge, multi-cloud, and hybrid cloud

Go 397 119 Updated May 22, 2025

the main repository for the multicluster global hub

Go 21 33 Updated May 22, 2025
0