Lists (3)
Sort Name ascending (A-Z)
Stars
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Vision-Language Navigation Benchmark in Isaac Lab
Low-level locomotion policy training in Isaac Lab
[RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".
Official implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Learning"
Official implementation for the paper "Model-based Diffusion for Trajectory Optimization". Model-based diffusion (MBD) is a novel diffusion-based trajectory optimization framework that employs a dy…
Official implementation of "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
[CVPR2025] Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving
Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged …
[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training
1 million FPS multi-agent driving simulator
Integrated Planning and Control for Quadrotor Navigation in Presence of Sudden Crossing Objects and Disturbances
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
Graph-based Topology Reasoning for Driving Scenes
Code for Reinforcement Learning from Vision Language Foundation Model Feedback
Evaluate TUM format pose files on KITTI Dataset, and pose interpolation is added.