Lists (1)
Sort Name ascending (A-Z)
Stars
OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
CityGaussian Series for High-quality Large-Scale Scene Reconstruction with Gaussians
[WACV2025] Official repository for "LumiGauss: Relightable Gaussian Splatting in the Wild"
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
This repository contains a command-line interface(CLI) that can detect and blur out faces and license plates(PII) from images and videos. The CLI takes an image or video file as input, runs an anon…
Lidar-visual dataset with ground truth 3D map for SLAM/NeRF
Python bindings to the Apriltags library
AprilTag is a visual fiducial system popular for robotics research.
3D Gaussian Splatting (3DGS) on fisheye cameras
Ray tracing and hybrid rasterization of Gaussian particles
This package contains the original 2012 AlexNet code.
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]
Matterport3D is a pretty awesome dataset for RGB-D machine learning tasks :)
This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arxiv.org/abs/2406.10224).
Roblox Foundation Model for 3D Intelligence
projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
The Arcade Learning Environment (ALE) -- a platform for AI research.
[ECCV 2024] Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats
SpatialLM: Large Language Model for Spatial Understanding