Lists (6)
Sort Name ascending (A-Z)
Stars
Python tool for converting files and office documents to Markdown.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
No fortress, purely open ground. OpenManus is Coming.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
JumpServer is an open-source Privileged Access Management (PAM) tool that provides DevOps and IT teams with on-demand and secure access to SSH, RDP, Kubernetes, Database and RemoteApp endpoints thr…
Open-Sora: Democratizing Efficient Video Production for All
Convert PDF to markdown + JSON quickly with high accuracy
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
OCR, layout analysis, reading order, table recognition in 90+ languages
Build Real-Time Knowledge Graphs for AI Agents
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
ASCII generator (image to text, image to image, video to video)
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
A lightweight LMM-based Document Parsing Model
🌐 WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor & WebShaper https://arxiv.org/abs/2507.15061 https://arxiv.org/pdf/2507.02592
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured …
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
PDF2zh for Zotero | Zotero PDF中文翻译插件
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
从 Word 文档 (`.docx`) 中批量提取数据并将其导出到 Excel 文件的工具