8000 bonninr / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View bonninr's full-sized avatar

Block or report bonninr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 52,174 6,299 Updated Jun 12, 2025
4 Updated Nov 19, 2024

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 3,920 553 Updated May 12, 2025

🏀 Basketball Video Analysis: Leverage automated detection and tracking of players, ball, and team assignments using advanced object tracking, zero-shot classification, and keypoint detection with Y…

Jupyter Notebook 48 8 Updated May 6, 2025

visualization of 3d models

Python 3 1 Updated Apr 17, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 15,093 1,984 Updated Jun 12, 2025

[WACV 2025] Implementation of RGB2Point:3D Point Cloud Generation from Single RGB Images

Python 28 9 Updated Jun 2, 2025

CAD-Recode: Reverse Engineering CAD Code from Point Clouds

Jupyter Notebook 127 14 Updated Mar 19, 2025

Official repo and evaluation implementation of VSI-Bench

Python 503 28 Updated Feb 28, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,486 118 Updated Jun 2, 2025

Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!

HTML 565 51 Updated Feb 27, 2025

Object detection in soccer scenes trained only with synthetic data (Blender renders)

Jupyter Notebook 7 Updated Sep 17, 2023

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,405 1,880 Updated Mar 26, 2025

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,398 212 Updated Apr 11, 2025

DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.dig…

Python 1,329 187 Updated Jun 10, 2025

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models.

Rust 174 24 Updated Jun 13, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 54,983 5,364 Updated Jun 13, 2025

Vision infrastructure to turn complex documents into RAG/LLM-ready data

Rust 2,205 132 Updated Jun 10, 2025
Dockerfile 93 40 Updated Jan 27, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 4 2 Updated Aug 9, 2024

A synthetic data generator for text recognition

Python 3,495 1,006 Updated Jul 18, 2024

Deep Learning based Image Segmentation Model to extract QR code regions from an Image

Jupyter Notebook 3 Updated Nov 10, 2021

The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""

Python 66 3 Updated Jun 5, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,806 512 Updated Jun 12, 2025

A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research works related to scene text detection, spotting, etc., includ…

TeX 86 4 Updated Nov 12, 2024

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 26,910 3,356 Updated Sep 24, 2024

Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools,…

Python 50,338 8,307 Updated Jun 13, 2025

A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.

Python 1,455 370 Updated Aug 1, 2024
Next
0