Stars
Official implementations for paper: VACE: All-in-One Video Creation and Editing
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
DreamO: A Unified Framework for Image Customization
Official implementation in ComfyUI of CVPR 2025 paper "HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis"
MAGI-1: Autoregressive Video Generation at Scale
Concat-ID: Towards Universal Identity-Preserving Video Synthesis
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Wan: Open and Advanced Large-Scale Video Generative Models
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
HunyuanVideo: A Systematic Framework For Large Video Generation Model
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
[CVPR 2025] Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Official repository of In-Context LoRA for Diffusion Transformers
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Create images of a given character in different poses
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
A curated list of recent diffusion models for video generation, editing, and various other applications.
[CSUR] A Survey on Video Diffusion Models
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models