8000 QingyuanWuNothing (Caqybara) · GitHub

More Web Proxy on the site http://driver.im/

QingyuanWuNothing

Follow

🦫

Focusing

Caqybara QingyuanWuNothing

🦫

Focusing

Follow

PhD student

6 followers · 11 following

University of Southampton
Southampton, England
22:34 (UTC +01:00)
https://energetic-monday-07f.notion.site/Qingyuan-s-Homepage-1f877499bea980078c16e1489150440f?pvs=4

Achievements

Achievements

Highlights

Pro

QingyuanWuNothing/README.md

self-bootstrapping on the self-reinforcing mind

Pinned Loading

AD-RL AD-RL Public

AD-RL, Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays, ICML 2024, Poster

Python 5
VDPO VDPO Public

VDPO, Variational Delayed Policy Optimization, NeurIPS 2024, Spotlight

Python 5
DFBT DFBT Public

DFBT, Directly Forecasting Belief for Reinforcement Learning with Delays, ICML 2025, Poster

Python

0