zijian-hu

Zijian Hu zijian-hu

ML research at Scale AI.

21 followers · 5 following

@scaleapi
San Francisco, CA, California
www.zijianhu.com
in/zijianhu

Achievements

Highlights

Lists (5)

Sort

Stars

thu-ml / SageAttention

Quantized Attention achieves speedup of 2-3x and 3-5x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 1,499 106 Updated May 2, 2025

wolfecameron / nanoMoE

Forked from karpathy/nanoGPT

An extension of the nanoGPT repository for training small MOE models.

Python 143 16 Updated Mar 9, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 13,737 828 Updated May 8, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 49,427 7,125 Updated Apr 20, 2025

ByteDance-Seed / ByteCheckpoint

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 210 7 Updated Apr 2, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,371 170 Updated May 15, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,035 364 Updated May 19, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,765 1,486 Updated Apr 24, 2025

gpu-mode / popcorn

HTML 14 Updated May 10, 2025

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 8,586 1,880 Updated Apr 25, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,871 191 Updated May 17, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,642 914 Updated Jul 1, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,771 276 Updated May 15, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,041 260 Updated May 17, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,724 656 Updated May 19, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,664 769 Updated May 12, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,551 834 Updated Apr 29, 2025

databricks / megablocks

Python 1,355 194 Updated Apr 29, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,919 98 Updated Apr 8, 2025

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 786 45 Updated May 5, 2025

natolambert / rlhf-book

Textbook on reinforcement learning from human feedback

TeX 899 79 Updated May 15, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,126 963 Updated May 18, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,539 7,456 Updated May 18, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,452 2,250 Updated May 18, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,487 101 Updated Mar 7, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,358 129 Updated May 16, 2025

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,770 134 Updated Jan 17, 2025

google-deepmind / alphafold3

AlphaFold 3 inference pipeline.

Python 6,475 815 Updated May 15, 2025

trotsky1997 / MathBlackBox

Python 1,019 105 Updated Dec 17, 2024

Lightning-AI / litData

Transform datasets at scale. Optimize datasets for fast AI model training.

Python 478 64 Updated May 17, 2025

Zijian Hu zijian-hu

Highlights

Lists (5)

Distributed ML

Fused Kernels

LLM

Papers

Tutorial

Stars