8000 yuzhaouoe (Yu Zhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yuzhaouoe's full-sized avatar
🕊️
🕊️

Block or report yuzhaouoe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,809 213 Updated Jun 27, 2025
Shell 173 15 Updated Jun 27, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,058 1,655 Updated Jun 27, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

< 10000 /div>
402 19 Updated Jun 23, 2025

GUI Grounding for Professional High-Resolution Computer Use

Python 216 25 Updated Jun 27, 2025

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python 187 32 Updated Jun 24, 2025

A research prototype of a human-centered web agent

Python 5,821 589 Updated Jun 26, 2025

Witness the aha moment of VLM with less than $3.

Python 3,806 289 Updated May 19, 2025

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,734 129 Updated May 30, 2025

The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"

Python 117 9 Updated May 16, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,103 40 Updated Jun 22, 2025

A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agent (AutoExplore Agent), and GenieRedux-G - an implementation …

Python 41 7 Updated Jun 15, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 406 22 Updated Jun 27, 2025

Awesome Agent Training

166 11 Updated Jun 23, 2025

This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.

180 22 Updated Jun 5, 2025

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 720 69 Updated Apr 30, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 90,256 10,293 Updated Jun 25, 2025

open source interpretability platform 🧠

TypeScript 276 34 Updated Jun 23, 2025
Python 18 Updated Apr 16, 2025

A Bionic Reading Extension for Zotero with Verbs and Nouns Highlight

TypeScript 84 Updated Apr 11, 2025

[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?

Python 68 5 Updated Mar 18, 2025

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Jupyter Notebook 8 Updated May 8, 2025

The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Python 33 1 Updated Mar 20, 2025

Paper list for Efficient Reasoning.

522 19 Updated Jun 23, 2025

Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)

Python 33 5 Updated Mar 7, 2025

Jacobian SAEs for sparsifying LLM computation, rather than just representations

Jupyter Notebook 4 1 Updated Jun 4, 2025

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

Python 69 3 Updated Jun 23, 2025

My learning notes/codes for ML SYS.

Python 2,678 168 Updated Jun 27, 2025

[ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`

Python 74 16 Updated Mar 7, 2025

Fully open reproduction of DeepSeek-R1

Python 24,901 2,312 Updated Jun 26, 2025
Next
0