-
-
-
bazel-central-registry Public
Forked from bazelbuild/bazel-central-registryThe central registry of Bazel modules for the Bzlmod external dependency system.
Starlark Apache License 2.0 UpdatedApr 2, 2025 -
gemma3-int4 Public
Forked from gau-nernst/gemma3-int4PyTorch inference library for Gemma 3 INT4
Python UpdatedMar 31, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedMar 10, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMar 3, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedJan 3, 2025 -
-
torchtune Public
Forked from pytorch/torchtuneA Native-PyTorch Library for LLM Fine-tuning
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 27, 2024 -
rules_cuda Public
Forked from bazel-contrib/rules_cudaStarlark implementation of bazel rules for CUDA.
Starlark MIT License UpdatedAug 15, 2024 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 13, 2024 -
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Python Apache License 2.0 UpdatedJun 4, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedMay 29, 2024 -