SanjunLiu

Sanny Liu SanjunLiu

CV&&DL

7 followers · 36 following

Achievements

Stars

NanoNets / docext

An on-premises, OCR-free unstructured data extraction and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 413 28 Updated May 17, 2025

jefferyZhan / Griffon

Official repo of Griffon series including v1(ECCV 2024), v2, and G

Python 208 10 Updated Mar 29, 2025

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,252 513 Updated Jul 11, 2024

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 34,200 4,916 Updated May 21, 2025

qubvel-org / segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 10,463 1,743 Updated May 21, 2025

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 2,876 224 Updated Apr 24, 2025

doc-analysis / ReadingBank

ReadingBank: A Benchmark Dataset for Reading Order Detection

105 3 Updated Aug 26, 2024

huggingface / smollm

Everything about the SmolLM2 and SmolVLM family of models

Python 2,406 141 Updated Mar 31, 2025

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,431 1,150 Updated May 20, 2025

aiming-lab / MDocAgent

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Python 151 22 Updated Apr 1, 2025

SWHL / TableRecognitionMetric

Compute benchmark of table structure recognition.

Python 20 Updated Apr 23, 2024

borntyping / python-colorlog

A colored formatter for the python logging module

Python 914 94 Updated Oct 29, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,391 365 Updated May 20, 2025

opendatalab / labelU

Data annotation toolbox supports image, audio and video data.

Python 1,199 122 Updated May 20, 2025

landing-ai / vision-agent

Vision agent

Python 4,742 536 Updated May 19, 2025

HKUDS / GraphAgent

"GraphAgent: Agentic Graph Language Assistant"

Jupyter Notebook 302 40 Updated Feb 8, 2025

VikParuchuri / pdftext

Extract structured text from pdfs quickly

Python 479 48 Updated Feb 28, 2025

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 25,204 1,605 Updated May 20, 2025

kennymckormick / pyskl

A toolbox for skeleton-based action recognition.

Python 1,089 197 Updated Mar 17, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 4,975 307 Updated May 11, 2025

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 16,212 1,609 Updated May 11, 2025

RapidAI / RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 4,097 441 Updated May 16, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,530 757 Updated May 15, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,913 443 Updated Aug 7, 2024

poloclub / unitable

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 470 35 Updated Jun 4, 2024

RapidAI / RapidTable

基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。

Python 291 22 Updated May 7, 2025

jeinlee1991 / chinese-llm-benchmark

目前已囊括232个大模型，覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型，以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生in…

4,239 176 Updated May 17, 2025

milvus-io / milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 34,830 3,221 Updated May 21, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 33,831 2,724 Updated May 19, 2025

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,242 100 Updated Apr 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sanny Liu SanjunLiu

Achievements

Achievements

Block or report SanjunLiu

Stars

NanoNets / docext

jefferyZhan / Griffon

clovaai / donut

huggingface / pytorch-image-models

qubvel-org / segmentation_models.pytorch

mit-han-lab / efficientvit

doc-analysis / ReadingBank

huggingface / smollm

VikParuchuri / surya

aiming-lab / MDocAgent

SWHL / TableRecognitionMetric

borntyping / python-colorlog

open-compass / VLMEvalKit

opendatalab / labelU

landing-ai / vision-agent

HKUDS / GraphAgent

VikParuchuri / pdftext

VikParuchuri / marker

kennymckormick / pyskl

om-ai-lab / VLM-R1

NirDiamant / RAG_Techniques

RapidAI / RapidOCR

QwenLM / Qwen2.5-VL

QwenLM / Qwen-VL

poloclub / unitable

RapidAI / RapidTable

jeinlee1991 / chinese-llm-benchmark

milvus-io / milvus

opendatalab / MinerU

opendatalab / DocLayout-YOLO