Stars
โกFlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. ๐ณDocker-friendly.โกAlways in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs,โฆ
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
This app creates or read parquet dataset
A TTS model capable of generating ultra-realistic dialogue in one pass.
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
FastAPI Implementation of Orpheus TTS streaming Chatbot
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
A simple screen parsing tool towards pure vision based GUI agent
A Unified Toolkit for Deep Learning Based Document Image Analysis
An accurate GUI element detection approach based on old-fashioned CV algorithms [Upgraded on 5/July/2021]
Multi-tier UIScrollView nested scrolling solution. ๐๐๐
A Python SDK with async I/O for CQHTTP (OneBot).
Template for telegram bots using aiogram, starlette-admin, telegram login widget, and FastAPI.
Personal Project: An admin panel to manage the telegram bot in Python Django
TeleAdminPanel: An all-in-one web-based dashboard for seamless Telegram bot management and analytics. ๐๐ค
Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.
Centrally manage your locked layers.้ไธญ็ฎก็ๆๆ็้ๅฎๅพๅฑใ
#1 Locally hosted web application that allows you to perform various operations on PDF files
Easily communicate between iOS/OSX devices using BLE
๐ฑ Collaborative List of Open-Source iOS Apps
๐ Accelerate inference and training of ๐ค Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
The first GitHub Copilot, Codeium and ChatGPT Xcode Source Editor Extension
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRโฆ
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image