8000 night-chen (Yuchen Zhuang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View night-chen's full-sized avatar

Highlights

  • Pro

Block or report night-chen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 49 1 Updated May 13, 2025

Verifiers for LLM Reinforcement Learning

Python 1,250 152 Updated Jun 9, 2025

A version of verl to support tool use

Python 206 11 Updated Jun 9, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,347 306 Updated May 13, 2025
Python 16 2 Updated Apr 8, 2025

Awesome RL Reasoning Recipes ("Triple R")

645 38 Updated Jun 9, 2025

A collection of MCP servers.

54,130 4,104 Updated Jun 10, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 545 36 Updated May 27, 2025

LLM/VLM gaming agents and model evaluation through games.

Python 621 65 Updated Jun 9, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,883 1,489 Updated Apr 24, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 765 104 Updated Jun 5, 2025

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 343 65 Updated Jun 10, 2025

An agent benchmark with tasks in a simulated software company.

Python 392 54 Updated May 17, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,546 538 Updated May 3, 2024

A curated list of Diffusion Model in RL resources (continually updated)

1,188 61 Updated Feb 15, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,591 167 Updated Jun 5, 2025

O1 Replication Journey

1,992 65 Updated Jan 14, 2025

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,135 1,137 Updated Jun 9, 2025

[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning

Python 33 Updated Dec 26, 2024

[ICLR2025] Kolmogorov-Arnold Transformer

Python 783 49 Updated Mar 23, 2025

Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.

Python 16 2 Updated Nov 19, 2024

🙌 OpenHands: Code Less, Make More

Python 57,760 6,587 Updated Jun 10, 2025
Python 5 2 Updated Jan 8, 2025
Python 11 7 Updated Jan 12, 2025

[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".

Python 226 19 Updated Aug 28, 2024

Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective

Python 32 4 Updated Jan 31, 2025

[ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.

Python 37 3 Updated Sep 19, 2024

Source code of MOLLEO

Python 44 3 Updated Apr 11, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 14,761 2,931 Updated Jun 10, 2025

[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.

Python 71 4 Updated Nov 27, 2024
Next
0