Stars
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Making large AI models cheaper, faster and more accessible
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, …
Development Containers: Use a container as a full-featured development environment.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
PaddleSlim is an open-source library for deep model compression and architecture search.
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。
Handwriting Synthesis with RNNs ✏️
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
A synthetic data generator for text recognition
A toolbox of ocr models and algorithms based on MindSpore
🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测,中文拼写检测纠正。英文单词拼写校验工具)
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Gemma open-weight LLM library, from Google DeepMind