-
mi.com
- Beijing
-
16:55
(UTC +08:00) - https://www.zhihu.com/people/csioza
- https://mp.weixin.qq.com/s/XhoaZYNBepX8VhRU1nlrag
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 16, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedMay 16, 2025 -
AimRT Public
Forked from AimRT/AimRTA high-performance runtime framework for modern robotics.
C++ Other UpdatedMay 16, 2025 -
LLMs-from-scratch Public
Forked from rasbt/LLMs-from-scratchImplement a ChatGPT-like LLM in PyTorch from scratch, step by step
Jupyter Notebook Other UpdatedApr 20, 2025 -
nixl Public
Forked from ai-dynamo/nixlNVIDIA Inference Xfer Library (NIXL)
C++ Apache License 2.0 UpdatedApr 2, 2025 -
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedMar 21, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedMar 7, 2025 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedDec 13, 2024 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedNov 12, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedOct 29, 2024 -
manim Public
Forked from 3b1b/manimAnimation engine for explanatory math videos
Python MIT License UpdatedOct 21, 2024 -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedOct 10, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Python Apache License 2.0 UpdatedAug 12, 2024