Stars
Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving
V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction
Official inference repo for FLUX.1 models
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
Wan: Open and Advanced Large-Scale Video Generative Models
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional variability in sampling steps
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A Collection of Variational Autoencoders (VAE) in PyTorch.
The world's simplest facial recognition api for Python and the command line
State-of-the-art 2D and 3D Face Analysis Project
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Codes for ID-Specific Video Customized Diffusion
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper