Lists (17)
Sort Name ascending (A-Z)
Stars
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
🦜🔗 Build context-aware reasoning applications
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Model Context Protocol Servers
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
CryptMatrix / UnbalancedPSI
Forked from alibaba-edu/mpc4jUSENIX Security'24 - Unbalanced Circuit-PSI from Oblivious Key-Value Retrieval