8000 zijian-hu (Zijian Hu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zijian-hu's full-sized avatar

Highlights

  • Pro

Block or report zijian-hu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Quantized Attention achieves speedup of 2-3x and 3-5x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.

Cuda 1,499 106 Updated May 2, 2025

An extension of the nanoGPT repository for training small MOE models.

Python 143 16 Updated Mar 9, 2025

Machine Learning Engineering Open Book

Python 13,737 828 Updated May 8, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 49,427 7,125 Updated Apr 20, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 210 7 Updated Apr 2, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,371 170 Updated May 15, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 4,035 364 Updated May 19, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,765 1,486 Updated Apr 24, 2025
HTML 14 Updated May 10, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 8,586 1,880 Updated Apr 25, 2025

Minimalistic large language model 3D-parallelism training

Python 1,871 191 Updated May 17, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,642 914 Updated Jul 1, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,771 276 Updated May 15, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,041 260 Updated May 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,724 656 Updated May 19, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,664 769 Updated May 12, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,551 834 Updated Apr 29, 2025
Python 1,355 194 Updated Apr 29, 2025

Official Repo for Open-Reasoner-Zero

Python 1,919 98 Updated Apr 8, 2025

Helpful tools and examples for working with flex-attention

Python 786 45 Updated May 5, 2025

Textbook on reinforcement learning from human feedback

TeX 899 79 Updated May 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,126 963 Updated May 18, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 47,539 7,456 Updated May 18, 2025

Fully open reproduction of DeepSeek-R1

Python 24,452 2,250 Updated May 18, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,487 101 Updated Mar 7, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,358 129 Updated May 16, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,770 134 Updated Jan 17, 2025

AlphaFold 3 inference pipeline.

Python 6,475 815 Updated May 15, 2025

Transform datasets at scale. Optimize datasets for fast AI model training.

Python 478 64 Updated May 17, 2025
Next
0