Starred Repositories
Browse starred repositories
Sort: Recently starred
-
SkyReels-V2: Infinite-length Film Generative model
-
chat log tool, easily use your own chat data. 聊天记录工具,轻松使用自己的聊天数据
-
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
-
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
-
Presentation Slides for Developers
-
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
-
🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.
-
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
-
-
A Training-free Iterative Framework for Long Story Visualization
-
Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"
-
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
-
An open-sourced end-to-end VLM-based GUI Agent
-
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
-
Your AI Operator for Web, Android, Automation & Testing.
-
Toolkit for linearizing PDFs for LLM datasets/training
-
AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording
-
-
Memory-Guided Diffusion for Expressive Talking Video Generation
-
AI model that understands text & humanoids.
-
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
-
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
-
SkyReels V1: The first and most advanced open-source human-centric video foundation model
-
An LLM-based Web Navigating Agent (KDD'24)
-
微信机器人,可接入DeepSeek、Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。微信 hook WeChat Robot Hook.
-
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
-
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
SOTA Open Source TTS
-
UI Library for Design Engineers. Animated components and effects you can copy and paste into your apps. Free. Open Source.
-
One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform