8000 lonelydancer / Starred · GitHub

More Web Proxy on the site http://driver.im/

lonelydancer

Follow

lonelydancer

Follow

5 followers · 15 following

Stars

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,253 1,979 Updated Jun 19, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,131 693 Updated Jun 19, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,419 63 Updated Apr 18, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,826 278 Updated May 15, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,918 1,489 Updated Apr 24, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,842 2,297 Updated Jun 2, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,754 379 Updated Jun 18, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 18,654 2,220 Updated Jun 15, 2025

Tony-Tan / CUDA_Freshman

Cuda 2,462 474 Updated Jan 16, 2024

alibaba / Megatron-LLaMA

Forked from NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Python 656 58 Updated Jan 2, 2024

modelscope / modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Python 3,203 365 Updated Jun 19, 2025

meta-llama / llama

Inference code for Llama models

Python 58,395 9,778 Updated Jan 26, 2025

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,997 1,254 Updated Apr 1, 2025

CoderLSF / fast-llama

Runs LLaMA with Extremely HIGH speed

C++ 90 10 Updated Nov 21, 2023

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,606 680 Updated Jun 18, 2025

baichuan-inc / Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,971 238 Updated Sep 6, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,146 8,199 Updated Jun 19, 2025

THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,727 1,837 Updated Jun 27, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,236 1,201 Updated Jun 19, 2025

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,860 1,891 Updated Apr 30, 2024

Ryota-Kawamura / Building-Systems-with-the-ChatGPT-API

In Building Systems With The ChatGPT API, you will learn how to automate complex workflows using chain calls to a large language model.

Jupyter Notebook 59 57 Updated Jun 4, 2023

THUDM / GLM-130B

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,680 605 Updated Jul 25, 2023

openai / openai-cookbook

Examples and guides for using the OpenAI API

MDX 64,736 10,685 Updated Jun 18, 2025

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,719 916 Updated Jun 17, 2025

JushBJJ / Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,567 3,382 Updated Jun 14, 2025

OpenBMB / BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,790 257 Updated Dec 5, 2023

thunlp / WebCPM

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

HTML 916 74 Updated Nov 25, 2023

openai / chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,196 3,686 Updated Jul 4, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,026 4,426 Updated Jun 19, 2025

thu-coai / EVA

EVA: Large-scale Pre-trained Chit-Chat Models

Python 307 50 Updated Mar 11, 2023

0