Stars
[WACV 2025] Official implementation of "RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation"
Simplest working implementation of Stylegan2, state of the art generative adversarial network, in Pytorch. Enabling everyone to experience disentanglement
PyTorch implementations of Generative Adversarial Networks.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Image-to-image translation with conditional adversarial nets
Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Layout Conditioned Image Generation, NeurIPS2024
diffusion-based layout-to-image generation model
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
Dense matching library based on PyTorch
Code release for CVPR'24 submission 'OmniGlue'
FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba
Supervised Raw Video Denoising with a Benchmark Dataset on Dynamic Scenes. CVPR 2020
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
Mora: More like Sora for Generalist Video Generation
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis