Stars
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Awesome papers for markerless animal motion capture and 3D reconstruction.
The official dataset repository of "MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description". ECCV [Oral] 2024.
[ECCV 2024 Oral] PetFace: A Large-Scale Dataset and Benchmark for Animal Identification https://arxiv.org/abs/2407.13555
Incorporating VIsual LAyout Structures for Scientific Text Classification
A Unified Toolkit for Deep Learning Based Document Image Analysis
Given a scholarly PDF, extract figures, tables, captions, and section titles.
OpenMMLab Detection Toolbox and Benchmark
Animal identification using face recognition based methods
Code for CVPR22 paper: Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection.
Code for I3D Feature Extraction
[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854
Inflated i3d network with inception backbone, weights transfered from tensorflow
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.