Highlights
- Pro
Starred repositories
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability 8000 features and more, letting yo…
Janus-Series: Unified Multimodal Understanding and Generation Models
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT4.1/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Official repository of In-Context LoRA for Diffusion Transformers
Everything-Reactivity in ComfyUI (audio, MIDI, motion, proximity, and more).
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Real-time video and audio processing on Streamlit
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
OneTrainer is a one-stop solution for all your stable diffusion training needs.
A pytorch quantization backend for optimum
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Convert your videos to densepose and use it on MagicAnimate
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.