Starred repositories
[CVPR'25] UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image
[CVPR 2024] Official repository for "Tactile-Augmented Radiance Fields".
Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
Code for SGP 2025 Graduate School tutorial "Deep Learning on Meshes and Point Clouds"
The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.
[ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision
NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
Towards a Generative 3D World Engine for Embodied Intelligence
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Generative Models by Stability AI
A comprehensive list of Implicit Representations, NeRF and 3D Gaussian Splatting papers relating to SLAM/Robotics domain, including papers, videos, codes, and related websites
Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment
Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024
[ICLR'24] GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion
Mesh Silksong: Auto-Regressive Mesh Generation as Weaving Silk
[CVPR 2025] RollingDepth: Video Depth without Video Models
Official Repository for ICCV 2025 paper DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion Models
[ICLR 2025] Latent Radiance Fields with 3D-aware 2D Representations
Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Official release for SplArt: Articulation Estimation and Part-level Reconstruction with 3D Gaussian Splatting.