Stars
[CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
Official inference repo for FLUX.1 models
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Official Code for Stable Cascade
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
Karras et al. (2022) diffusion models for PyTorch
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Generative Models by Stability AI
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Codes for CVPR2023 paper "DegAE: A New Pretraining Paradigm for Low-level Vision"
Taming Transformers for High-Resolution Image Synthesis
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
✨✨Latest Advances on Multimodal Large Language Models
[IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance