Stars
slime is a LLM post-training framework aiming at scaling RL.
r2e: turn any github repository into a programming agent environment
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
verl: Volcano Engine Reinforcement Learning for LLMs
My learning notes/codes for ML SYS.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Fully open reproduction of DeepSeek-R1
Efficient Triton Kernels for LLM Training
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)
A generative world for general-purpose robotics & embodied AI learning.
Xiaomi Home Integration for Home Assistant
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Janus-Series: Unified Multimodal Understanding and Generation Models
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
We write your reusable computer vision tools. 💜
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
how to optimize some algorithm in cuda.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
SGLang is a fast serving framework for large language models and vision language models.