8000 yuzhaouoe (Yu Zhao) / Starred · GitHub

More Web Proxy on the site http://driver.im/

yuzhaouoe

Follow

🕊️

Yu Zhao yuzhaouoe

🕊️

Follow

PhD Student @ University of Edinburgh, CDT in NLP

14 followers · 20 following

Achievements

Achievements

Stars

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 2,809 213 Updated Jun 27, 2025

RAGEN-AI / VAGEN

Shell 173 15 Updated Jun 27, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,058 1,655 Updated Jun 27, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

< 10000 /div>

402 19 Updated Jun 23, 2025

likaixin2000 / ScreenSpot-Pro-GUI-Grounding

GUI Grounding for Professional High-Resolution Computer Use

Python 216 25 Updated Jun 27, 2025

EleutherAI / delphi

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python 187 32 Updated Jun 24, 2025

microsoft / magentic-ui

A research prototype of a human-centered web agent

Python 5,821 589 Updated Jun 26, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,806 289 Updated May 19, 2025

microsoft / Magma

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,734 129 Updated May 30, 2025

EdinburghNLP / MMLongBench

The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"

Python 117 9 Updated May 16, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,103 40 Updated Jun 22, 2025

insait-institute / GenieRedux

A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agent (AutoExplore Agent), and GenieRedux-G - an implementation …

Python 41 7 Updated Jun 15, 2025

OSU-NLP-Group / GUI-Agents-Paper-List

Building a comprehensive and handy list of papers for GUI agents

Python 406 22 Updated Jun 27, 2025

bruno686 / Awesome-Agent-Training

Awesome Agent Training

166 11 Updated Jun 23, 2025

Elvin-Yiming-Du / Survey_Memory_in_AI

This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.

180 22 Updated Jun 5, 2025

microsoft / WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 720 69 Updated Apr 30, 2025

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 90,256 10,293 Updated Jun 25, 2025

hijohnnylin / neuronpedia

open source interpretability platform 🧠

TypeScript 276 34 Updated Jun 23, 2025

ethz-spylab / jailbreak-tax

Python 18 Updated Apr 16, 2025

DrUsagi / Colorful-Bionic

A Bionic Reading Extension for Zotero with Verbs and Nouns Highlight

TypeScript 84 Updated Apr 11, 2025

google-deepmind / latent-multi-hop-reasoning

[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?

Python 68 5 Updated Mar 18, 2025

Sckathach / subspace-rerouting

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Jupyter Notebook 8 Updated May 8, 2025

sail-sg / SkyLadder

Forked from jzhang38/TinyLlama

The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Python 33 1 Updated Mar 20, 2025

hemingkx / Awesome-Efficient-Reasoning

Paper list for Efficient Reasoning.

522 19 Updated Jun 23, 2025

NathanGodey / qfilters

Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)

Python 33 5 Updated Mar 7, 2025

lucyfarnik / jacobian-saes

Jacobian SAEs for sparsifying LLM computation, rather than just representations

Jupyter Notebook 4 1 Updated Jun 4, 2025

dmis-lab / Monet

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

Python 69 3 Updated Jun 23, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 2,678 168 Updated Jun 27, 2025

uw-nsl / ArtPrompt

[ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`

Python 74 16 Updated Mar 7, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,901 2,312 Updated Jun 26, 2025

0