Stars
Dense Distinct Query for End-to-End Object Detection (CVPR2023)
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
PyTorch code and models for the DINOv2 self-supervised learning method.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
EVA Series: Visual Representation Fantasies from BAAI
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Fine tuning grounding Dino
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Collection of 383 car logos images with few variations of sizes and JSON file for better usability.
We write your reusable computer vision tools. 💜
Vehicle logo detection (VLD) is a special and significant topic in object detection for vehicle identification system applications. Nevertheless, the range of the research and analysis for VLD are …
Implementation of paper - DEYO: DETR with YOLO for End-to-End Object Detection
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
(TPAMI 2024) A Survey on Open Vocabulary Learning
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
YMIR, a streamlined model development product.
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Deep Learning for Person Re-identification: A Survey and Outlook
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.