Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Official code for the CVPR 2025 paper "Navigation World Models".
Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models. CVPR Oral 2025.
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/TokenBridge
Solve Visual Understanding with Reinforced VLMs
MAGI-1: Autoregressive Video Generation at Scale
Implementing DeepSeek R1's GRPO algorithm from scratch
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt, Created by 「云中江树」
🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀
[CVPR 2025] EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
[CVPR 2025 Best Paper Award Candidate] VGGT: Visual Geometry Grounded Transformer
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Repo of "GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving"
Explore the Multimodal “Aha Moment” on 2B Model
Wan: Open and Advanced Large-Scale Video Generative Models
Latest Advances on System-2 Reasoning
Official Code for Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Fully open reproduction of DeepSeek-R1
Call Arxiv API and automatically update paper list