Starred repositories
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…
[ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
SkyReels-V2: Infinite-length Film Generative model
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
Fold markdown documents by section.
Vim script for text filtering and alignment
Simplify navigation in large markdown files.
Instant Markdown previews from Vim
Proceed with text detection only in the selected area of the image
Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan
This is the official code repository for "MedMamba: Vision Mamba for Medical Image Classification"
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
VMamba: Visual State Space Models,code is based on mamba
OpenMMLab Pre-training Toolbox and Benchmark
[ECCV 2024] Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
Configs and boilerplates for Label Studio's Machine Learning backend
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
PyTorch implementation of over 30 realtime semantic segmentations models, e.g. BiSeNetv1, BiSeNetv2, CGNet, ContextNet, DABNet, DDRNet, EDANet, ENet, ERFNet, ESPNet, ESPNetv2, FastSCNN, ICNet, LEDN…
SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.