Stars
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
MobileNetV2-YoloV3-Nano: 0.5BFlops 3MB HUAWEI P40: 6ms/img, YoloFace-500k:0.1Bflops 420KB:fire::fire::fire:
⚡ A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is …
[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
[ICCV2021] Official code for "Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition"
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
A deep learning library for video understanding research.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Deep learning techniques for skin segmentation on novel abdominal dataset. Work conducted as part of the development process of an autonomous robotic ultrasound system.
[AAAI 2021] (oral) Progressive One-shot Human Parsing, [TPAMI 2023] End-to-end One-shot Human Parsing
Code repository for Self-supervised Structure-sensitive Learning, CVPR'17
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Visualizer for neural network, deep learning and machine learning models
Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Resource scheduling and cluster management for AI
Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
Learning Optical Flow from a Few Matches (CVPR 2021)
Code release for "STMask: Spatial Feature Calibration and Temporal Fusion for Effective One-stage Video Instance Segmentation"(CVPR2021)
Official codes of CVPR21 paper: Learning Normal Dynamics in Videos with Meta Prototype Network
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks