Stars
Official implementation of Continuous 3D Perception Model with Persistent State
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)
[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
PyTorch implementation of FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models (CVPR-2024)
A generative world for general-purpose robotics & embodied AI learning.
A suite of image and video neural tokenizers
Universal Monocular Metric Depth Estimation
[3DV'25] 3D Reconstruction with Spatial Memory
Depth Any Video with Scalable Synthetic Data (ICLR 2025)
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
Empowering Unified MLLM with Multi-granular Visual Generation
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
A collaboration friendly studio for NeRFs
[CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
CUDA accelerated rasterization of gaussian splatting
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data
A feature-rich command-line audio/video downloader
Easily create large video dataset from video urls