Stars
TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of untrimmed videos.
[ACL 2025] Graph-guided agentic framework for code localization
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
A generative world for general-purpose robotics & embodied AI learning.
A suite of image and video neural tokenizers
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
DSPy: The framework for programming—not prompting—language models
A framework for few-shot evaluation of language models.
Data and tools for generating and inspecting OLMo pre-training data.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
✨✨Latest Advances on Multimodal Large Language Models
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Gogh is a collection of color schemes for various terminal emulators, including Gnome Terminal, Pantheon Terminal, Tilix, and XFCE4 Terminal also compatible with iTerm on macOS.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)