Stars
Nodes for image juxtaposition for Flux in ComfyUI
[ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations
Personalize Anything for Free with Diffusion Transformer
New generation of CLIP with fine grained discrimination capability, ICML2025
[CVPR 2025] Official Implementation of MotionPro: A Precise Motion Controller for Image-to-Video Generation
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
Official implementations for paper: VACE: All-in-One Video Creation and Editing
Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou…
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
DreamO: A Unified Framework for Image Customization
🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers
Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment
Image Deblurring by Exploring In-depth Properties of Transformer (IEEE TNNLS)
[NeurlPS2024] One-Step Effective Diffusion Network for Real-World Image Super-Resolution
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model