XuandongZhao

😎

study

Xuandong Zhao XuandongZhao

😎

study

Postdoc@UC Berkeley CS

168 followers · 246 following

UC Berkeley
Berkeley
06:19 (UTC -07:00)
https://xuandongzhao.github.io/
@xuandongzhao
in/xuandong-zhao-a3270610b

Achievements

Highlights

Lists (1)

Sort

Research

3 repositories

Starred repositories

allenai / olmes

Reproducible, flexible LLM evaluations

Python 212 33 Updated May 8, 2025

openai / simple-evals

Python 3,697 365 Updated May 13, 2025

QingyangZhang / Label-Free-RLVR

204 4 Updated Jun 15, 2025

eric-ai-lab / GRIT

Official code for paper "GRIT: Teaching MLLMs to Think with Images"

Python 76 1 Updated Jun 15, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,525 1,342 Updated Jun 16, 2025

yzhao062 / cs-paper-checklist

A final sanity checklist to help your CS paper get accepted, not desk rejected.

1,261 124 Updated May 7, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,453 62 Updated Jun 5, 2025

chinasatokolo / csGraduateFellowships

A curated list of fellowships for graduate students in Computer Science and related fields.

651 64 Updated Jun 10, 2025

RLHFlow / Minimal-RL

Python 204 9 Updated May 14, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,370 308 Updated May 13, 2025

inclusionAI / AReaL

Distributed RL System for LLM Reasoning

Python 1,750 85 Updated Jun 16, 2025

wicai24 / DOOR-Alignment

Python 10 1 Updated Apr 7, 2025

backprop07 / Self-Certainty

Implementation of self-certainty as an extention of ZeroEval Project

Python 17 1 Updated May 31, 2025

David-Li0406 / AI-Supervision-Risk

21 2 Updated Mar 17, 2025

llm-as-a-judge / Awesome-LLM-as-a-judge

356 18 Updated Jun 3, 2025

Zhen-Tan-dmml / LLM4Annotation

574 23 Updated Jun 3, 2025

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 5,465 556 Updated Jun 10, 2025

PorUna-byte / PAR

Python 15 Updated Feb 27, 2025

avduarte333 / DIS-CO

Python 6 1 Updated Feb 26, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 8,028 869 Updated Apr 30, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,964 104 Updated Jun 2, 2025

philschmid / deep-learning-pytorch-huggingface

Jupyter Notebook 1,224 246 Updated Feb 27, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,766 8,013 Updated Jun 16, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,802 2,293 Updated Jun 2, 2025

deepseek-ai / DeepSeek-R1

90,102 11,637 Updated Apr 9, 2025

MiniMax-AI / MiniMax-01

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,797 221 Updated Jun 10, 2025

huggingface / smolagents

🤗 smolagents: a barebones library for agents that think in code.

Python 20,185 1,743 Updated Jun 15, 2025

facebookresearch / fastText

Library for fast text representation and classification.

HTML 26,256 4,773 Updated Mar 22, 2024

openpsi-project / ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 302 19 Updated Apr 24, 2025

deepseek-ai / DeepSeek-V3

Python 97,628 15,874 Updated Jun 16, 2025

Starred topics

counterfactual-regret-minimization