8000 YefanZhou (Yefan) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View YefanZhou's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report YefanZhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Concise Reasoning via Reinforcement Learning

Python 8 Updated Apr 16, 2025

A final sanity checklist to help your CS paper get accepted, not desk rejected.

971 111 Updated May 7, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 2,863 222 Updated May 23, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 945 43 Updated May 24, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,362 52 Updated Apr 18, 2025

A curated list for Efficient Large Language Models

Python 1,677 134 Updated Apr 23, 2025

Simple RL training for reasoning

Python 3,584 266 Updated Apr 10, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 459 39 Updated May 23, 2025

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 80 12 Updated Sep 13, 2024

A curated list of early exiting (LLM, CV, NLP, etc)

49 4 Updated Aug 21, 2024

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Python 142 6 Updated Mar 13, 2025

A simple unified framework for evaluating LLMs

HTML 212 23 Updated Apr 14, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 275 15 Updated Apr 28, 2025

Code for ICLR 2025 paper "Emergence of a High-Dimensional Abstraction Phase in Language Transformers"

Python 1 1 Updated Jan 23, 2025

System 2 Reasoning Link Collection

834 74 Updated Mar 16, 2025

Fully open reproduction of DeepSeek-R1

Python 24,528 2,260 Updated May 23, 2025

A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.

273 16 Updated Feb 28, 2025

Autonomous Agents (LLMs) research papers. Updated Daily.

813 41 Updated May 21, 2025

Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".

Jupyter Notebook 367 18 Updated Jun 11, 2024

This is the reading list of Large Language Model-Based Data Science Agent

16 Updated Feb 28, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,521 162 Updated May 18, 2025

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,232 136 Updated Mar 13, 2025

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Python 28,193 4,225 Updated May 22, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,444 643 Updated Mar 27, 2025

Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch

Python 168 8 Updated Dec 31, 2024

This repository collects all relevant resources about interpretability in LLMs

351 25 Updated Nov 1, 2024

awesome papers in LLM interpretability

466 14 Updated May 24, 2025

Python Algorithms for Randomized Linear Algebra

Python 54 5 Updated May 3, 2023
Next
0