Lists (1)
Sort Name ascending (A-Z)
Starred repositories
verl: Volcano Engine Reinforcement Learning for LLMs
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
moojink / openvla-oft
Forked from openvla/openvlaFine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
Machine learning metrics for distributed, scalable PyTorch applications.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Seamless operability between C++11 and Python
Open-Sora: Democratizing Efficient Video Production for All
The devkit of the nuScenes dataset.
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Aligning pretrained language models with instruction data generated by themselves.
Ongoing research training transformer models at scale
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
High-Resolution Image Synthesis with Latent Diffusion Models
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Denoising Diffusion Probabilistic Models
A latent text-to-image diffusion model