8000 LianxinRay / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View LianxinRay's full-sized avatar

Block or report LianxinRay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GRPO Training Script for Qwen Model on GSM8K Dataset. This script trains a Qwen model using the GRPO (Generalized Reinforcement Policy Optimization) method on the GSM8K (Generalized Math 8K) datase…

Python 5 1 Updated May 24, 2025

A simple example of using GRPO (Group Relative Policy Optimization) trainer from TRL library to fine-tune Qwen2.5-1.5B model on the TL;DR summarization dataset.

Jupyter Notebook 8 1 Updated Feb 10, 2025

开源SFT数据集整理,随时补充

519 41 Updated Jun 2, 2023

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 9,362 793 Updated May 29, 2025

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 219 26 Updated Mar 13, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,012 311 Updated May 29, 2025

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 555 30 Updated Dec 9, 2024

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

1,416 72 Updated Jun 2, 2025

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。

1,026 84 Updated Feb 27, 2024

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 652 61 Updated Jun 1, 2024

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Python 40,105 3,175 Updated Jun 6, 2025

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,169 213 Updated Oct 8, 2024

Official VLLM Implementation is available for mixtral. This version supports the DiscoLM mixtral models

Python 7 1 Updated Jan 15, 2024

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

55,327 13,569 Updated Jan 1, 2025

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

2,040 145 Updated Dec 26, 2024

Inference code for Llama models

Python 58,336 9,784 Updated Jan 26, 2025

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,830 493 Updated Nov 27, 2024

Generative Judge for Evaluating Alignment

Python 238 14 Updated Jan 18, 2024

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Python 636 65 Updated Jun 30, 2023

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,536 538 Updated May 3, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,744 6,254 Updated Jun 6, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,125 295 Updated Nov 8, 2024

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,599 1,306 Updated Apr 6, 2025

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,055 232 Updated Apr 14, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

20,279 1,952 Updated May 19, 2025

Kanchil(鼷鹿)是世界上最小的偶蹄目动物,这个开源项目意在探索小模型(6B以下)是否也能具备和人类偏好对齐的能力。

Python 113 5 Updated Apr 1, 2023

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,980 553 Updated Jun 11, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,851 1,894 Updated Apr 30, 2024

中文法律LLaMA (LLaMA for Chinese legel domain)

Python 944 129 Updated Aug 28, 2024

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

Python 483 62 Updated May 8, 2023
Next
0