Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
[ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
β¨β¨Latest Papers and Benchmarks in Reasoning with Foundation Models
A curated list of foundation models for vision and language tasks
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
[RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation
A simple training-free approach adapting DUSt3R for dynamic scenes.
Simple Viser Viewer for 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
Gaussian Splatting from VGGSfM and Mast3r, and their comparison
Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022
[IROS 2020] se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains
π GsplatLoc π―: Ultra-Precise Pose Optimization via 3D Gaussian Reprojection π
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
A curated list of Object SLAM papers and resources
Code for "Scalable Real2Sim: Physics-Aware Asset Generation Via Robotic Pick-and-Place Setups"
[3DV 2024] Color-NeuS: Reconstructing Neural Implicit Surfaces with Color
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Segment-Anything + 3D. Let's lift anything to 3D.
[CVPR 2025] Official PyTorch implementation of MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.