Stars
Provides network connectivity to WSL 2 when blocked by VPN
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Ragflow-Plus 是 Ragflow 的二次开发版本,使其更为简洁实用
A powerful tool for creating fine-tuning datasets for LLM
Toolkit for linearizing PDFs for LLM datasets/training
No fortress, purely open ground. OpenManus is Coming.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Build GUI for your Python program with JavaScript, HTML, and CSS
中文分词模块:继承了jieba分词的基本算法逻辑,进行了全方位的代码优化,还额外提供了HMM算法的训练功能支持。
gesean / pkuseg-python
Forked from lancopku/pkuseg-pythonpkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
gesean / pyhanlp
Forked from hankcs/pyhanlp中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
wb