8000 xiusic (Xiusi Chen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View xiusic's full-sized avatar

Highlights

  • Pro

Block or report xiusic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 29 4 Updated May 29, 2025

RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 113 9 Updated Jun 26, 2025

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 522 50 Updated Jan 11, 2024

Verifiers for LLM Reinforcement Learning

Python 1,421 175 Updated Jul 4, 2025

My learning notes/codes for ML SYS.

Python 2,746 169 Updated Jul 4, 2025

[ICLR'24 spotlight] Tool-Augmented Reward Modeling

Python 50 1 Updated Jun 6, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,401 59 Updated May 11, 2025
Jupyter Notebook 4 Updated May 4, 2025

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 55 2 Updated Jun 28, 2025

s1: Simple test-time scaling

Python 6,471 751 Updated Jun 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,335 1,715 Updated Jul 4, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,558 329 Updated Jul 3, 2025

Replicating the Illinois letterhead in latex

TeX 44 14 Updated Nov 8, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,263 707 Updated Jun 19, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,794 135 Updated Jan 17, 2025

🙌 OpenHands: Code Less, Make More

Python 59,863 6,996 Updated Jul 4, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,205 50 Updated Nov 16, 2024

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 369 19 Updated Jun 11, 2024

MATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)

Python 120 23 Updated Apr 2, 2024

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories (ICDM'19)

Python 61 2 Updated Apr 2, 2024

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains (AAAI'24)

Python 8 1 Updated Apr 2, 2024

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)

Python 11 Updated Aug 24, 2024

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study (WWW'23)

C++ 64 3 Updated May 27, 2023

Minimally Supervised Categorization of Text with Metadata (SIGIR'20)

Python 46 3 Updated Apr 2, 2024

Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)

Python 31 5 Updated Jun 21, 2025

Cross-type Biomedical Named Entity Recognition with Deep Multi-task Learning (Bioinformatics'19)

Python 135 28 Updated Jul 25, 2024

The official implementation of the paper **Learning Concise and Descriptive Attributes for Visual Recognition**

Python 44 5 Updated Dec 14, 2023

The official implementation of the paper **LVChat: Facilitating Long Video Comprehension**

Python 13 Updated Apr 15, 2024

Implementations of the model-editing baselines in the paper "MemoryLLM: Towards Self-Updatable Large Language Models".

Python 3 2 Updated May 15, 2024
Next
0