Distributional Reinforcement Learning

This repository is by Pierre-Alexandre K. and Paul-Ambroise D. and contains the PyTorch source code to reproduce the results of Bellemare and al. ["A Distributional Perspective on Reinforcement Learning"](https://arxiv.org/abs/1707.06887).

Requirements

- Python 3.6 - Torch - OpenAI gym

Results

We used the categorical algorithm to solve [CartPole-v0](https://gym.openai.com/envs/CartPole-v0/).

The following results were not optimized over different hyperparameters, so there is room for improvement.

The evolution of the distribution for the [0, 0, 0, 0] state is the following:

Discussion

We want to extend the work of Bellemare and al. to continuous action using either ICNN, CEM or NAF to handle continuous actions. An ICNN implementation is yet available but needs optimization.

Implicit : étendre aux actions continues https://arxiv.org/pdf/1806.06923.pdf QUOTA : https://arxiv.org/pdf/1811.02073.pdf Quantile regression : c51 qrdqn DISTRIBUTED DISTRIBUTIONAL DETERMINISTIC POLICY GRADIENTS: https://openreview.net/pdf?id=SyZipzbCb

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
agents		agents
models		models
results		results
utils		utils
.gitignore		.gitignore
Analysis.ipynb		Analysis.ipynb
Create_gif.ipynb		Create_gif.ipynb
DQN.ipynb		DQN.ipynb
README.md		README.md
bayes_by_backprop.py		bayes_by_backprop.py
distributional_dqn.py		distributional_dqn.py
dqn.py		dqn.py
example_dqn.py		example_dqn.py
icnn_dqn.py		icnn_dqn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Distributional Reinforcement Learning

Requirements

Results

Discussion

About

Uh oh!

Releases

Packages

Languages

cuijiaxun/distributional_rl

Folders and files

Latest commit

History

Repository files navigation

Distributional Reinforcement Learning

Requirements

Results

Discussion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages