-
Dartmouth College
- Hanover, NH
-
10:58
(UTC -12:00) - https://yefanzhou.github.io/
Starred repositories
Concise Reasoning via Reinforcement Learning
A final sanity checklist to help your CS paper get accepted, not desk rejected.
The simplest, fastest repository for training/finetuning small-sized VLMs.
Understanding R1-Zero-Like Training: A Critical Perspective
Implementing DeepSeek R1's GRPO algorithm from scratch
A curated list for Efficient Large Language Models
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]
A curated list of early exiting (LLM, CV, NLP, etc)
TokenSkip: Controllable Chain-of-Thought Compression in LLMs
WildEval / ZeroEval
Forked from allenai/WildBenchA simple unified framework for evaluating LLMs
Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Code for ICLR 2025 paper "Emergence of a High-Dimensional Abstraction Phase in Language Transformers"
Fully open reproduction of DeepSeek-R1
A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.
Autonomous Agents (LLMs) research papers. Updated Daily.
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
This is the reading list of Large Language Model-Based Data Science Agent
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
This repository collects all relevant resources about interpretability in LLMs
awesome papers in LLM interpretability