8000 LianxinRay / Starred · GitHub

More Web Proxy on the site http://driver.im/

LianxinRay

Follow

LianxinRay

Follow

2 followers · 0 following

Achievements

Achievements

Stars

kossisoroyce / train_grpo.py

GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Optimization) method on the GSM8K (Generalized Math 8K) datase…

Python 5 1 Updated May 24, 2025

w3ng-git / qwen2.5-1.5b-grpo-starter

A simple example of using GRPO (Group Relative Policy Optimization) trainer from TRL library to fine-tune Qwen2.5-1.5B model on the TL;DR summarization dataset.

Jupyter Notebook 8 1 Updated Feb 10, 2025

chaoswork / sft_datasets

开源SFT数据集整理,随时补充

519 41 Updated Jun 2, 2023

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 9,362 793 Updated May 29, 2025

jongwooko / distillm

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 219 26 Updated Mar 13, 2025

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,012 311 Updated May 29, 2025

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 555 30 Updated Dec 9, 2024

ydyjya / Awesome-LLM-Safety

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

1,416 72 Updated Jun 2, 2025

thu-coai / Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

1,026 84 Updated Feb 27, 2024

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 652 61 Updated Jun 1, 2024

unslothai / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,105 3,175 Updated Jun 6, 2025

intel / intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,169 213 Updated Oct 8, 2024

0-hero / vllm-experiments

Forked from vllm-project/vllm

Official VLLM Implementation is available for mixtral. This version supports the DiscoLM mixtral models

Python 7 1 Updated Jan 15, 2024

PlexPt / awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

55,327 13,569 Updated Jan 1, 2025

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

2,040 145 Updated Dec 26, 2024

meta-llama / llama

Inference code for Llama models

Python 58,336 9,784 Updated Jan 26, 2025

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,830 493 Updated Nov 27, 2024

GAIR-NLP / auto-j

Generative Judge for Evaluating Alignment

Python 238 14 Updated Jan 18, 2024

jerry1993-tech / Cornucopia-LLaMA-Fin-Chinese

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Python 636 65 Updated Jun 30, 2023

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,536 538 Updated May 3, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,744 6,254 Updated Jun 6, 2025

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Python 4,125 295 Updated Nov 8, 2024

LlamaFamily / Llama-Chinese

Llama中文社区，实时汇总最新Llama学习资料，构建最好的中文Llama大模型开源生态，完全开源可商用

Python 14,599 1,306 Updated Apr 6, 2025

CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Python 3,055 232 Updated Apr 14, 2024

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

20,279 1,952 Updated May 19, 2025

vxfla / kanchil

Kanchil（鼷鹿）是世界上最小的偶蹄目动物，这个开源项目意在探索小模型（6B以下）是否也能具备和人类偏好对齐的能力。

Python 113 5 Updated Apr 1, 2023

pengxiao-song / LaWGPT

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,980 553 Updated Jun 11, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,851 1,894 Updated Apr 30, 2024

AndrewZhe / lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

Python 944 129 Updated Aug 28, 2024

thu-coai / KdConv

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

Python 483 62 Updated May 8, 2023

0