8000 GitHub - QingyuanWuNothing/DFBT: DFBT, Directly Forecasting Belief for Reinforcement Learning with Delays, ICML 2025, Poster
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

QingyuanWuNothing/DFBT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DFBT: Directly Forecasting Belief for Reinforcement Learning with Delays

0. requirements

conda create -n dfbt python=3.10
conda activate dfbt
pip install -r requirement.yaml
pip install "cython<3"
pip install git+https://github.com/Farama-Foundation/d4rl@master#egg=d4rl

1. training directly forecasting belief

python3 scripts/train_dfbt.py --dataset_name halfcheetah --delay 128

2. learning with directly forecasting belief

python3 scripts/train_dfbt_sac.py --env_name HalfCheetah-v2 --delay 128

Belief Visualization

Environment Truth DATS D-Dreamer D-SAC DBFT-SAC (ours)
HalfCheetah-v2
Hopper-v2
Walker2d-v2

Citation

@inproceedings{wu2025directly,
  title={Directly Forecasting Belief for Reinforcement Learning with Delays},
  author={Wu, Qingyuan and Wang, Yuhui and Zhan, Simon Sinong and Wang, Yixuan and Lin, Chung-Wei and Lv, Chen and Zhu, Qi and Schmidhuber, J{\"u}rgen and Huang, Chao},
  booktitle={International Conference on Machine Learning},
  year={2025},
  organization={PMLR}
}

Acknowledgement

  1. CleanRL: https://github.com/vwxyzjn/cleanrl
  2. SAC: https://github.com/haarnoja/sac
  3. CORL: https://github.com/tinkoff-ai/CORL

About

DFBT, Directly Forecasting Belief for Reinforcement Learning with Delays, ICML 2025, Poster

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0