Stars
Training-free Regional Prompting for Diffusion Transformers 🔥
[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥
Consistency Distillation with Target Timestep Selection and Decoupled Guidance
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
[ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"
OpenMMLab Text Detection, Recognition and Understanding Toolbox
StoryMaker: Towards consistent characters in text-to-image generation
DynamicPose, a simple and robust framework for animating human images.