Stars
Scalable toolkit for efficient model reinforcement
Repo for Objaverse++, Curated 3D Object Dataset with Quality Annotations
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
BioNeMo Framework: For building and adapting AI models in drug discovery at scale
A tool to configure, launch and manage your machine learning experiments.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery. CVPRW 2024
Scalable data pre processing and curation toolkit for LLMs
A collection of design patterns/idioms in Python
✨✨Latest Advances on Multimodal Large Language Models
Robust recipes to align language models with human and AI preferences
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Minimalistic large language model 3D-parallelism training
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
Scalable toolkit for efficient model alignment
Ongoing research training transformer models at scale
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Machine Learning Engineering Open Book
Resources from the EleutherAI Math Reading Group
neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
chatbot does what you ask, like open Google search, post a Tweet, etc.
LAVIS - A One-stop Library for Language-Vision Intelligence