Stars
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Build a CNN network to predict 3D bounding box of car from 2D image.
Official code for NeurIPS 2023 SpotLight: VoxDet: Voxel Learning for Novel Instance Detection
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
😇 A PyTorch-like deep learning framework. Just for fun.
Pytorch code for ICCV'23 paper. NEO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
A truly simple website template for academics
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Code release for "Learning to Detect Mobile Objects from LiDAR Scans Without Labels" [CVPR 2022]
Papers and Datasets about Point Cloud.
3D Object Detection for Autonomous Driving: A Comprehensive Survey (IJCV 2023)
This repository is an open-source PointPainting package which is easy to understand, deploy and run!
A quickstart and benchmark for pytorch distributed training.
You like pytorch? You like micrograd? You love tinygrad! ❤️