8000 tink2123 (xiaoting) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View tink2123's full-sized avatar
🎯
Focusing
🎯
Focusing
  • baidu
  • BeiJing

Block or report tink2123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

s1: Simple test-time scaling

Python 6,349 744 Updated Apr 4, 2025

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

10000 1,020 61 Updated Mar 9, 2025
Python 74 6 Updated Nov 8, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,270 4,359 Updated May 8, 2025

A reproduction of growing neural cellular automata using PyTorch.

Python 238 35 Updated Jul 16, 2023

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 48,555 5,905 Updated May 9, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,600 751 Updated May 8, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,797 1,736 Updated Feb 26, 2025

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 146 8 Updated May 31, 2024

Minimal reproduction of DeepSeek R1-Zero

Python 11,720 1,482 Updated Apr 24, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,352 1,139 Updated May 8, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, …

Python 7,448 634 Updated May 9, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 124,129 16,604 Updated Apr 30, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 22,995 1,965 Updated May 9, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 31,434 3,532 Updated May 5, 2025

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 465 34 Updated Jun 4, 2024

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,199 97 Updated Apr 14, 2025

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,756 121 Updated Apr 17, 2025

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,821 172 Updated Apr 25, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,533 661 Updated Feb 10, 2025

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,828 144 Updated Dec 30, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,380 1,402 Updated Mar 3, 2025

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 7,577 543 Updated Jan 3, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,288 731 Updated May 4, 2025

✨✨Latest Advances on Multimodal Large Language Models

14,966 960 Updated Apr 24, 2025

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

291 17 Updated Jun 21, 2024

🎨 数学公式识别增强版:中英文手写印刷公式、支持初级符号推导(数据结构基于 LaTeX 抽象语法树)Math Formula OCR Pro, supports handwrite, Chinese-mixed formulas and simple symbol reasoning (based on LaTeX AST).

Jupyter Notebook 1,217 236 Updated Jun 11, 2024

Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

Python 124 4 Updated Nov 13, 2023

ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

Jupyter Notebook 364 53 Updated Aug 20, 2024
Next
0