zhuohan123

Zhuohan Li zhuohan123

🎓 cs phd @ 🌁 uc berkeley | building @vllm-project | machine learning system | the real agi is the friends we made along the way

1.1k followers · 130 following

UC Berkeley
San Francisco Bay Area
22:16 (UTC -07:00)
https://zhuohan.li
@zhuohan123
in/zhuohan-li

Achievements

x4 x3 x2

Achievements

x4 x3 x2

Organizations

Stars

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 3,822 375 Updated May 21, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 1,185 92 Updated May 21, 2025

ndjc / controlfreak

A program to read, merge, and write programs for the Breville Control °Freak®

Java 23 1 Updated Dec 31, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 90,186 24,223 Updated May 22, 2025

ZachGoldberg / Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

13,638 754 Updated Mar 19, 2025

openai / chz

Python 100 4 Updated Mar 20, 2025

huggingface / kernels

Load compute kernels from the Hub

Python 130 7 Updated May 21, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,905 884 Updated May 21, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,367 598 Updated May 20, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,778 277 Updated May 15, 2025

dmlc / dlpack

common in-memory tensor structure

C++ 992 147 Updated May 12, 2025

genmoai / mochi

The best OSS video generation models

Python 3,168 353 Updated Jan 8, 2025

google / pyglove

Manipulating Python Programs

Python 661 29 Updated May 16, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,371 131 Updated May 22, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 810 37 Updated May 10, 2025

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 376 30 Updated Apr 18, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 8,983 2,403 Updated May 22, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 939 60 Updated Apr 15, 2025

EricLBuehler / mistral.rs

Blazingly fast LLM inference.

Rust 5,622 403 Updated May 22, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 54,772 6,184 Updated May 22, 2025

HabanaAI / vllm-fork

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 72 97 Updated May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuohan Li zhuohan123

Achievements

Achievements

Organizations

Block or report zhuohan123

Stars

pytorch / torchtitan

tile-ai / tilelang

ndjc / controlfreak

pytorch / pytorch

ZachGoldberg / Startup-CTO-Handbook

openai / chz

huggingface / kernels

deepseek-ai / 3FS

deepseek-ai / DeepGEMM

deepseek-ai / open-infra-index

dmlc / dlpack

genmoai / mochi

google / pyglove

vllm-project / llm-compressor

efeslab / Nanoflow

microsoft / vattention

EleutherAI / lm-evaluation-harness

bytedance / flux

EricLBuehler / mistral.rs

All-Hands-AI / OpenHands

HabanaAI / vllm-fork

HazyResearch / ThunderKittens

NaiboWang / EasySpider

HPMLL / BurstGPT

lmarena / arena-hard-auto

openai / simple-evals

zeux / calm

stanfordnlp / dspy

axonn-ai / axonn

hao-ai-lab / Consistency_LLM