An Easy-to-use, Scalable and High-performance 9110 RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)

Python 6,842 669 Updated May 27, 2025

vuejs / vitepress

Vite & Vue powered static site generator.

TypeScript 14,886 2,356 Updated May 26, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 50,907 6,152 Updated May 27, 2025

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 25,442 2,588 Updated May 23, 2025

snowdreams1006 / snowdreams1006.github.io

雪之梦技术驿站,snowdreams1006搭建的 Gitbook 个人博客

HTML 91 61 Updated Feb 7, 2025

zhijing-jin / nlp-phd-global-equality

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

920 78 Updated Sep 22, 2024

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,663 1,347 Updated May 27, 2025

AmourWaltz / Reliable-LLM

JavaScript 122 5 Updated Sep 10, 2024

datawhalechina / llm-universe

本项目是一个面向小白开发者的大模型应用开发教程，在线阅读地址：https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 8,278 937 Updated May 27, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 15,698 1,756 Updated May 8, 2025

ruleGreen / Survey-Evolution-DS

This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey of the Evolution of Language Model-Based Dialogue Systems

61 3 Updated Apr 11, 2025

jjbrophy47 / machine_unlearning

Existing Literature about Machine Unlearning

863 103 Updated Mar 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Beyond Hsueh AmourWaltz

Achievements

Achievements

Highlights

Block or report AmourWaltz

Stars

AmourWaltz / ReliableMath

DevoAllen / Awesome-Reasoning-Economy-Papers

PeterGriffinJin / Search-R1

wdndev / llm_interview_note

zhaochenyang20 / Awesome-ML-SYS-Tutorial

agentica-project / rllm

hkust-nlp / simpleRL-reason

huggingface / open-r1

GAIR-NLP / abel

microsoft / unilm

AmourWaltz / UAlign

AmourWaltz / GroupWebsite

AmourWaltz / AmourWaltz

hkust-nlp / dart-math

SihengLi99 / LLM-Honesty-Survey

hijkzzz / Awesome-LLM-Strawberry

AmourWaltz / MlingConf

tianyi-lab / Reflection_Tuning

OpenRLHF / OpenRLHF