Stars
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Transparent Image Layer Diffusion using Latent Transparency
Large World Model -- Modeling Text and Video with Millions Context
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.
ComfyUI nodes for Stable Video Diffusion
OpenMMLab Text Detection, Recognition and Understanding Toolbox
ComfyUI's ControlNet Auxiliary Preprocessors
Nodes related to video workflows
ControlNet scheduling and masking nodes with sliding context support
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
ShadowDiffusion (CVPR2023), Pytorch implementation
Industry leading face manipulation platform
This repository contains the code release for the SIGGRAPH 2020 paper "One Shot 3D Photography"
[CVPR2024] DisCo: Referring Human Dance Generation in Real World
[CVPR 2023] 3D Cinemagraphy from a Single Image