JumpServer is an open-source Privileged Access Management (PAM) tool that provides DevOps and IT teams with on-demand and secure access to SSH, RDP, Kubernetes, Database and RemoteApp endpoints thr…

Python 28,130 5,507 Updated Jul 24, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 26,915 2,623 Updated Apr 30, 2025

datalab-to / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 26,732 1,748 Updated Jul 22, 2025

BerriAI / litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 25,888 3,577 Updated Jul 24, 2025

datalab-to / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,866 1,201 Updated Jul 22, 2025

getzep / graphiti

Build Real-Time Knowledge Graphs for AI Agents

Python 14,926 1,257 Updated Jul 23, 2025

THUDM / ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,726 1,601 Updated Jan 13, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,682 760 Updated Jul 23, 2025

PaddlePaddle / PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Python 8,039 1,250 Updated Jul 3, 2024

vietnh1009 / ASCII-generator

ASCII generator (image to text, image to image, video to video)

Python 7,948 609 Updated Nov 22, 2024

adithya-s-k / omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 6,626 532 Updated Jun 11, 2025

apify / crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 6,023 412 Updated Jul 24, 2025

andrewyng / translation-agent

Python 5,460 674 Updated Aug 4, 2024

Yuliang-Liu / MonkeyOCR

A lightweight LMM-based Document Parsing Model

Python 5,202 316 Updated Jul 23, 2025

Alibaba-NLP / WebAgent

🌐 WebAgent for Information Seeking built by Tongyi Lab: WebWalker & WebDancer & WebSailor & WebShaper https://arxiv.org/abs/2507.15061 https://arxiv.org/pdf/2507.02592

Python 4,951 366 Updated Jul 22, 2025

oomol-lab / pdf-craft

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

Python 3,114 179 Updated Jul 23, 2025

JefferyHcool / BiliNote

AI 视频笔记生成工具让 AI 为你的视频做笔记

Python 2,989 334 Updated Jul 18, 2025

CatchTheTornado / text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured …

Python 2,757 229 Updated Jul 23, 2025