8000 XuandongZhao (Xuandong Zhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View XuandongZhao's full-sized avatar
😎
study
😎
study

Highlights

  • Pro

Block or report XuandongZhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Reproducible, flexible LLM evaluations

Python 212 33 Updated May 8, 2025
Python 3,697 365 Updated May 13, 2025

Official code for paper "GRIT: Teaching MLLMs to Think with Images"

Python 76 1 Updated Jun 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,525 1,342 Updated Jun 16, 2025

A final sanity checklist to help your CS paper get accepted, not desk rejected.

1,261 124 Updated May 7, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,453 62 Updated Jun 5, 2025

A curated list of fellowships for graduate students in Computer Science and related fields.

651 64 Updated Jun 10, 2025
Python 204 9 Updated May 14, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,370 308 Updated May 13, 2025

Distributed RL System for LLM Reasoning

Python 1,750 85 Updated Jun 16, 2025
Python 10 1 Updated Apr 7, 2025

Implementation of self-certainty as an extention of ZeroEval Project

Python 17 1 Updated May 31, 2025

Agent S: an open agentic framework that uses computers like a human

Python 5,465 556 Updated Jun 10, 2025
Python 15 Updated Feb 27, 2025
Python 6 1 Updated Feb 26, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 8,028 869 Updated Apr 30, 2025

Official Repo for Open-Reasoner-Zero

Python 1,964 104 Updated Jun 2, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,766 8,013 Updated Jun 16, 2025

Fully open reproduction of DeepSeek-R1

Python 24,802 2,293 Updated Jun 2, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,797 221 Updated Jun 10, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 20,185 1,743 Updated Jun 15, 2025

Library for fast text representation and classification.

HTML 26,256 4,773 Updated Mar 22, 2024

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 302 19 Updated Apr 24, 2025
Next
0