Stars
High quality training free inpaint for every stable diffusion model.
AlUlkesh / stable-diffusion-webui-images-browser
Forked from yfszzx/stable-diffusion-webui-images-browseran images browse for stable-diffusion-webui
an embedded package for Florence-2 for quick interrogator
A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefN…
A Gradio web UI for Large Language Models with support for multiple inference backends.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Enhancements & experiments for ComfyUI, mostly focusing on UI features
Official repository of In-Context LoRA for Diffusion Transformers
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
The ultimate training toolkit for finetuning diffusion models
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos ma…
Custom nodes that extend the capabilities of Comfyui
🐳 LeetCode 算法笔记:面试、刷题、学算法。在线阅读地址:https://datawhalechina.github.io/leetcode-notes/
Port of OpenAI's Whisper model in C/C++
💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides MCP tool-use capabilities.
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
ComfyUI adaptation of IDM-VTON for virtual try-on.
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding