8000 YefanZhou (Yefan) / Starred · GitHub

More Web Proxy on the site http://driver.im/

YefanZhou

Follow

🎯

Focusing

Yefan YefanZhou

🎯

Focusing

Follow

CS PhD Candidate @ Dartmouth College Ex-Master in EECS at UC Berkeley

25 followers · 26 following

Dartmouth College
Hanover, NH
10:58 (UTC -12:00)
https://yefanzhou.github.io/

Achievements

Achievements

Starred repositories

ai-wand / concise-reasoning

Concise Reasoning via Reinforcement Learning

Python 8 Updated Apr 16, 2025

yzhao062 / cs-paper-checklist

A final sanity checklist to help your CS paper get accepted, not desk rejected.

971 111 Updated May 7, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,863 222 Updated May 23, 2025

sail-sg / understand-r1-zero

Understanding R1-Zero-Like Training: A Critical Perspective

Python 945 43 Updated May 24, 2025

chunhuizng / audio-long-form-reasoner

Python 4 Updated Apr 23, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,362 52 Updated Apr 18, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,677 134 Updated Apr 23, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,584 266 Updated Apr 10, 2025

McGill-NLP / nano-aha-moment

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 459 39 Updated May 23, 2025

SalesforceAIResearch / ContextualJudgeBench

Python 3 Updated Mar 21, 2025

Nota-NetsPresso / shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 80 12 Updated Sep 13, 2024

falcon-xu / early-exit-papers

A curated list of early exiting (LLM, CV, NLP, etc)

49 4 Updated Aug 21, 2024

hemingkx / TokenSkip

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Python 142 6 Updated Mar 13, 2025

WildEval / ZeroEval

Forked from allenai/WildBench

A simple unified framework for evaluating LLMs

HTML 212 23 Updated Apr 14, 2025

LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 275 15 Updated Apr 28, 2025

chengemily1 / id-llm-abstraction

Code for ICLR 2025 paper "Emergence of a High-Dimensional Abstraction Phase in Language Transformers"

Python 1 1 Updated Jan 23, 2025

open-thought / system-2-research

System 2 Reasoning Link Collection

834 74 Updated Mar 16, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,528 2,260 Updated May 23, 2025

samkhur006 / awesome-llm-planning-reasoning

A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.

273 16 Updated Feb 28, 2025

tmgthb / Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated Daily.

813 41 Updated May 21, 2025

facebookresearch / searchformer

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 367 18 Updated Jun 11, 2024

Stephen-SMJ / Reading-List-of-Large-Language-Model-Based-Data-Science-Agent

This is the reading list of Large Language Model-Based Data Science Agent

16 Updated Feb 28, 2025

codefuse-ai / Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,521 162 Updated May 18, 2025

quantumiracle / Popular-RL-Algorithms

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,232 136 Updated Mar 13, 2025

feder-cr / Jobs_Applier_AI_Agent_AIHawk

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 28,193 4,225 Updated May 22, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,444 643 Updated Mar 27, 2025

lucidrains / coconut-pytorch

Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch

Python 168 8 Updated Dec 31, 2024

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

351 25 Updated Nov 1, 2024

zepingyu0512 / awesome-llm-understanding-mechanism

awesome papers in LLM interpretability

466 14 Updated May 24, 2025

BallisticLA / parla

Python Algorithms for Randomized Linear Algebra

Python 54 5 Updated May 3, 2023

Starred topics

jupyterlab-extension

0