Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 44,888 5,416 Updated May 31, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,750 3,391 Updated Jan 26, 2025

zyds / transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 2,910 400 Updated Jul 15, 2024

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,624 2,275 Updated May 28, 2025

kongjiellx / octupus-tool-call

Jupyter Notebook 63 5 Updated May 4, 2025

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Jupyter Notebook 1,540 225 Updated May 22, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,904 672 Updated May 30, 2025

qiufengqijun / mini_qwen_further_analysis

本项目是mini_qwen项目的后续实验，是为了探究大模型复读机现象的成因与微调阶段模型知识注入现象的普遍性。

4 1 Updated Jan 22, 2025

qiufengqijun / mini_qwen

这是一个从头训练大语言模型的项目，包括预训练、微调和直接偏好优化，模型拥有1B参数，支持中英文。

Python 408 56 Updated Feb 18, 2025

zhanshijinwat / Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Jupyter Notebook 664 70 Updated Apr 27, 2025

Chinese-Tiny-LLM / Chinese-Tiny-LLM

Python 228 15 Updated May 10, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,369 464 Updated Nov 6, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,801 333 Updated May 21, 2024

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,541 174 Updated Apr 20, 2024

charent / Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 552 62 Updated Jul 11, 2024

jiahe7ay / MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 435 65 Updated May 1, 2025

wdndev / tiny-llm-zh

从零实现一个小参数量中文大语言模型。

Python 664 74 Updated Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qiufengqijun

Achievements

Achievements

Block or report qiufengqijun

Stars

qiufengqijun / open-r1-reprod

liguodongiot / llm-action

huggingface / transformers

deepspeedai / DeepSpeedExamples

karpathy / nanoGPT

hiyouga / EasyR1

EvolvingLMMs-Lab / open-r1-multimodal

om-ai-lab / VLM-R1

liujunwen23 / MIRE

Unakar / Logic-RL

jingyaogong / minimind

tsinghua-fib-lab / AgentSocietyChallenge

FlagOpen / FlagEmbedding

cline / cline