Stars
SOTA Re-identification Methods and Toolbox
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
👫 Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral) 👫
[ECCV24] Keypoint Promptable Re-Identification: SOTA ReID method robust to occlusions and multi-person ambiguity
yolov8 prune using torch-pruning
Build multimodal language agents for fast prototype and production
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
Scenic: A Jax Library for Computer Vision Research and Beyond
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
(CVPR 2022) Pytorch implementation of "Self-supervised transformers for unsupervised object discovery using normalized cut"
DIAGen: Semantically Diverse Image Augmentation with Generative Models for Few-Shot Learning (GCPR 2024)
Loopers is graphical live looper, written in Rust, designed for ease of use and rock-solid stability
Easy-to-use finetuned YOLOv8 models.
NVIDIA DeepStream SDK 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 / 5.1 implementation for YOLO models
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
This Repo is the official implementation of AgentCoder and AgentCoder+.
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, ICASSP 2023
YOLO SHOW - YOLOv11 / YOLOv10 / YOLOv9 / YOLOv8 / YOLOv7 / YOLOv5 / RTDETR / SAM / MobileSAM / FastSAM YOLO GUI based on Pyside6