Highlights
Starred repositories
Create and edit images using your voice
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
🐬DeepChat - A smart assistant that connects powerful AI to your personal world
[AAAI2025] DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Model Compression Toolbox for Large Language Models and Diffusion Models
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Official inference framework for 1-bit LLMs
Suna - Open Source Generalist AI Agent
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1Wan 2.1 for the GPU Poor
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
The ultimate training toolkit for finetuning diffusion models
Pytorch Implementation of: "Stable-Hair: Real-World Hair Transfer via Diffusion Model" (AAAI 2025)
A pipeline parallel training script for diffusion models.
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise"
A set of nodes to edit videos using the Hunyuan Video model
Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"