-
Georgia Institute of Technology
- Atlanta
- night-chen.github.io
Highlights
- Pro
Stars
Democratizing Reinforcement Learning for LLMs
Awesome RL Reasoning Recipes ("Triple R")
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
LLM/VLM gaming agents and model evaluation through games.
Minimal reproduction of DeepSeek R1-Zero
🌎💪 BrowserGym, a Gym environment for web task automation
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
An agent benchmark with tasks in a simulated software company.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
A curated list of Diffusion Model in RL resources (continually updated)
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models.
[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".
Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
[ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.