Stars
[TIP 24] The offical implementation of Efficient Small Object Detection on High-Resolution Images
Start building LLM-empowered multi-agent applications in an easier way.
[ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
A curated list of awesome resources for generic object detection in aerial images.
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
This repository is about downloading and using the UAVOD-10 dataset
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
wenyi5608 / GroundingDINO
Forked from IDEA-Research/GroundingDINOOfficial implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
This repository is a paper digest of Transformer-related approaches in visual tracking tasks.
YOLO-MIF(YOLOv8-RGBT) is an improved version of YOLOv8 for object detection in gray-scale images, incorporating multi-information fusion to enhance detection accuracy. The detection of RGBT mode is…
[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3
TensorRT implementation of Depth-Anything V1, V2
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
This is the offical repository for "DetFusion: A Detection-driven Infrared and Visible Image Fusion Network" (ACM MM 2022).
An open and scalable video surveillance system for anyone making this world a better and more peaceful place.