Highlights
- Pro
Lists (7)
Sort Name ascending (A-Z)
Starred repositories
The official GitHub page for the survey paper "A Survey of Large Language Models".
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
A modern, powerful, and user-friendly C++ language server built from scratch
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
Implementing DeepSeek R1's GRPO algorithm from scratch
《Effective Modern C++》- 完成翻译
An open protocol enabling communication and interoperability between opaque agentic applications.
🧑🚀 全世界最好的LLM资料总结(视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
LearningOS / 2025s-rustling-Days-gone
Forked from LearningOS/rustling-classroom-2025s-rustling-25S-templaterustling-classroom-2025s-rustling-25S-template created by GitHub Classroom
Learning Large Language Model (LLM)(大语言模型学习)
A repository sharing the literatures about large language models
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The official Python SDK for Model Context Protocol servers and clients
Model Context Protocol(MCP) 编程极速入门
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Examples and guides for using the OpenAI API