-
Zhejiang University of Technology
-
04:05
(UTC +08:00)
Lists (3)
Sort Name ascending (A-Z)
Stars
📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Tutorials for Physics-Informed Neural Networks
2025年全网最全即插即用模块,免费分享!CVPR2025,AAAI2025,ICLR2025,TNNLS2025,arXiv2025......包含人工智能全领域(机器学习、深度学习等),适用于图像分类、目标检测、实例分割、语义分割、全景分割、姿态识别、医学图像分割、视频目标分割、图像抠图、图像编辑、单目标跟踪、多目标跟踪、行人重识别、RGBT、图像去噪、去雨、去雾、去阴影、去模糊、超分辨…
CVPR 2025: Frequency Dynamic Convolution for Dense Image Prediction
torchange - A Unified Change Representation Learning Benchmark Library
The pytorch implementation for "SNUNet-CD: A Densely Connected Siamese Network for Change Detection of VHR Images"
[CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.
[CVPR 2025] SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
DynamicEarth: How Far are We from Open-Vocabulary Change Detection?
GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]
MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)
[CVPR 2025] SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
[CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"
A collection of papers related to Geo-spatial Information Science in CVPR 2025.
Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan
1st place solution for "xView2: Assess Building Damage" challenge.
[CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'