8000 SanjunLiu (Sanny Liu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SanjunLiu's full-sized avatar

Block or report SanjunLiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An on-premises, OCR-free unstructured data extraction and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 413 28 Updated May 17, 2025

Official repo of Griffon series including v1(ECCV 2024), v2, and G

Python 208 10 Updated Mar 29, 2025

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,252 513 Updated Jul 11, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,200 4,916 Updated May 21, 2025

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 10,463 1,743 Updated May 21, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 2,876 224 Updated Apr 24, 2025

ReadingBank: A Benchmark Dataset for Reading Order Detection

105 3 Updated Aug 26, 2024

Everything about the SmolLM2 and SmolVLM family of models

Python 2,406 141 Updated Mar 31, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,431 1,150 Updated May 20, 2025

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Python 151 22 Updated Apr 1, 2025

Compute benchmark of table structure recognition.

Python 20 Updated Apr 23, 2024

A colored formatter for the python logging module

Python 914 94 Updated Oct 29, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,391 365 Updated May 20, 2025

Data annotation toolbox supports image, audio and video data.

Python 1,199 122 Updated May 20, 2025

Vision agent

Python 4,742 536 Updated May 19, 2025

"GraphAgent: Agentic Graph Language Assistant"

Jupyter Notebook 302 40 Updated Feb 8, 2025

Extract structured text from pdfs quickly

Python 479 48 Updated Feb 28, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 25,204 1,605 Updated May 20, 2025

A toolbox for skeleton-based action recognition.

Python 1,089 197 Updated Mar 17, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,975 307 Updated May 11, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 16,212 1,609 Updated May 11, 2025

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 4,097 441 Updated May 16, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,530 757 Updated May 15, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,913 443 Updated Aug 7, 2024

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 470 35 Updated Jun 4, 2024

基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。

Python 291 22 Updated May 7, 2025

目前已囊括232个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生in…

4,239 176 Updated May 17, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 34,830 3,221 Updated May 21, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 33,831 2,724 Updated May 19, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,242 100 Updated Apr 14, 2025
Next
0