Stars
[NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped, CVPR 2022
(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing
Fine-grained object detection in satellite images
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
This is the pytorch implement of the paper "RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models"
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
[IEEE TIP 2024] TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution
Official code repository for ICLR 2024 paper "DiffusionSat: A Generative Foundation Model for Satellite Imagery"
[CVPR 2025] Official code repository for "Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA Approach"
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution
(CVPR 2025) Adversarial Diffusion Compression for Real-World Image Super-Resolution [PyTorch]
Segment Anything in High Quality [NeurIPS 2023]
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Official inference repo for FLUX.1 models
Stable Diffusion web UI
WebUI extension for ControlNet
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
Official PyTorch Code for "ATPrompt: Textual Prompt Learning with Embedded Attributes"
GeoPixel: A Pixel Grounding Large Multimodal Model for Remote Sensing is specifically developed for high-resolution remote sensing image analysis, offering advanced multi-target pixel grounding cap…
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities