tink2123

🎯

Focusing

xiaoting tink2123

🎯

Focusing

A giraffe with short legs

79 followers · 11 following

baidu
BeiJing

Achievements

x3 x3

Achievements

x3 x3

Lists (1)

Sort

✨ Inspiration

1 repository

Starred repositories

simplescaling / s1

s1: Simple test-time scaling

Python 6,349 744 Updated Apr 4, 2025

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

10000 1,020 61 Updated Mar 9, 2025

Fantasyele / LLaVA-KD

Python 74 6 Updated Nov 8, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 38,270 4,359 Updated May 8, 2025

chenmingxiang110 / Growing-Neural-Cellular-Automata

A reproduction of growing neural cellular automata using PyTorch.

Python 238 35 Updated Jul 16, 2023

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 48,555 5,905 Updated May 9, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,600 751 Updated May 8, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,797 1,736 Updated Feb 26, 2025

ucaslcl / Fox

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 146 8 Updated May 31, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,720 1,482 Updated Apr 24, 2025

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 17,352 1,139 Updated May 8, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, …

Python 7,448 634 Updated May 9, 2025

deepseek-ai / DeepSeek-V3

Python 96,593 15,707 Updated Apr 9, 2025

f / awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 124,129 16,604 Updated Apr 30, 2025

Byaidu / PDFMathTranslate

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python 22,995 1,965 Updated May 9, 2025

Shubhamsaboo / awesome-llm-apps

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 31,434 3,532 Updated May 5, 2025

poloclub / unitable

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 465 34 Updated Jun 4, 2024

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,199 97 Updated Apr 14, 2025

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,756 121 Updated Apr 17, 2025

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,821 172 Updated Apr 25, 2025

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,533 661 Updated Feb 10, 2025

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,828 144 Updated Dec 30, 2024

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,380 1,402 Updated Mar 3, 2025

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 7,577 543 Updated Jan 3, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,288 731 Updated May 4, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

14,966 960 Updated Apr 24, 2025

PKU-DAIR / RAG-Survey

Forked from hymie122/RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

291 17 Updated Jun 21, 2024