8000 lonelydancer / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lonelydancer's full-sized avatar

Block or report lonelydancer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 14,253 1,979 Updated Jun 19, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,131 693 Updated Jun 19, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,419 63 Updated Apr 18, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,826 278 Updated May 15, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,918 1,489 Updated Apr 24, 2025

Fully open reproduction of DeepSeek-R1

Python 24,842 2,297 Updated Jun 2, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,754 379 Updated Jun 18, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 18,654 2,220 Updated Jun 15, 2025

Best practice for training LLaMA models in Megatron-LM

Python 656 58 Updated Jan 2, 2024

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Python 3,203 365 Updated Jun 19, 2025

Inference code for Llama models

Python 58,395 9,778 Updated Jan 26, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,997 1,254 Updated Apr 1, 2025

Runs LLaMA with Extremely HIGH speed

C++ 90 10 Updated Nov 21, 2023

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,606 680 Updated Jun 18, 2025

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,971 238 Updated Sep 6, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 50,146 8,199 Updated Jun 19, 2025

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,727 1,837 Updated Jun 27, 2024

Large Language Model Text Generation Inference

Python 10,236 1,201 Updated Jun 19, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,860 1,891 Updated Apr 30, 2024

In Building Systems With The ChatGPT API, you will learn how to automate complex workflows using chain calls to a large language model.

Jupyter Notebook 59 57 Updated Jun 4, 2023

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,680 605 Updated Jul 25, 2023

Examples and guides for using the OpenAI API

MDX 64,736 10,685 Updated Jun 18, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,719 916 Updated Jun 17, 2025

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,567 3,382 Updated Jun 14, 2025

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,790 257 Updated Dec 5, 2023

Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"

HTML 916 74 Updated Nov 25, 2023

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,196 3,686 Updated Jul 4, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,026 4,426 Updated Jun 19, 2025

EVA: Large-scale Pre-trained Chit-Chat Models

Python 307 50 Updated Mar 11, 2023
Next
0