An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,263 707 Updated Jun 19, 2025

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,794 135 Updated Jan 17, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 59,863 6,996 Updated Jul 4, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,205 50 Updated Nov 16, 2024

facebookresearch / searchformer

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 369 19 Updated Jun 11, 2024

yuzhimanhua / MATCH

MATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)

Python 120 23 Updated Apr 2, 2024

yuzhimanhua / HiGitClass

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)

Python 61 2 Updated Apr 2, 2024

yuzhimanhua / SEType

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains (AAAI'24)

Python 8 1 Updated Apr 2, 2024

yuzhimanhua / SciMult

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)

Python 11 Updated Aug 24, 2024

yuzhimanhua / MAPLE

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study (WWW'23)

C++ 64 3 Updated May 27, 2023

yuzhimanhua / MetaCat

Minimally Supervised Categorization of Text with Metadata (SIGIR'20)

Python 46 3 Updated Apr 2, 2024

yuzhimanhua / MICoL

Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)

Python 31 5 Updated Jun 21, 2025

yuzhimanhua / Multi-BioNER

Cross-type Biomedical Named Entity Recognition with Deep Multi-task Learning (Bioinformatics'19)

Python 135 28 Updated Jul 25, 2024

wangyu-ustc / LM4CV

The official implementation of the paper **Learning Concise and Descriptive Attributes for Visual Recognition**

Python 44 5 Updated Dec 14, 2023

wangyu-ustc / LVChat

The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**

Python 13 Updated Apr 15, 2024

wangyu-ustc / EditingLlama

Implementations of the model-editing baselines in the paper "MemoryLLM: Towards Self-Updatable Large Language Models".

Python 3 2 Updated May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiusi Chen xiusic

Achievements

Achievements

Highlights

Block or report xiusic

Stars

xiusic / DecisionFlow

RM-R1-UIUC / RM-R1

Weixin-Liang / LLM-scientific-feedback

willccbb / verifiers

zhaochenyang20 / Awesome-ML-SYS-Tutorial

ernie-research / Tool-Augmented-Reward-Model

Zhou-Zoey / RMB-Reward-Model-Benchmark

BytedTsinghua-SIA / DAPO

xiusic / Rubric-RM

THU-KEG / RM-Bench

simplescaling / s1

volcengine / verl

agentica-project / rllm

lukeolson / illinois-letterhead

OpenRLHF / OpenRLHF