Starred repositories
Powerful yet simple to use screenshot software 🖥️ 📸
A lightweight LMM-based Document Parsing Model
Containerization is a Swift package for running Linux containers on macOS.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
基于无障碍,高级选择器,订阅规则的自定义屏幕点击 Android 应用 | An Android APP with custom screen tapping based on Accessibility, Advanced Selectors, and Subscription Rules
Build AdGuard Home DNS server by Magisk.
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
11 Lessons to Get Started Building AI Agents
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
PyInjector - Inject Python code into python process.
A secure, efficient TCP/UDP tunneling solution that delivers fast, reliable access across network restrictions using pre-established TLS/TCP connections. 通用TCP/UDP隧道解决方案,免配置单文件多模式,采用控制数据双路分离架构,内置零延…
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
Access your entire server infrastructure from your local desktop
End to end, high speed, and privately self-host free version of Google Translate - 低占用速度快可私有部署的自由版 Google 翻译
A Model Context Protocol server for converting almost anything to Markdown
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, advanced RAG, advanced summaries, scriptable, etc
Reverse Engineering: Decompiling Binary Code with Large Language Models