History Compression via Language Models in Reinforcement Learning

Fabian Paischer^{1 2}, Thomas Adler¹, Vihang Patil¹, Angela Bitto-Nemling^{1 3}, Markus Holzleitner¹, Sebastian Lehner^{1 2}, Hamid Eghbal-zadeh¹, Sepp Hochreiter^{1 2 3}

¹ LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Austria
² ELLIS Unit Linz
³ Institute of Advanced Research in Artificial Intelligence (IARAI)

This is the repository for the paper: History Compression via Language Models in Reinforcement Learning.

Detailed blog post on this paper at this link.

To reproduce our results, first clone the repository and install the conda environment by

git clone https://github.com/ml-jku/helm.git
cd helm
conda env create -f environment.yml

After installing the conda environment you can train HELM on the KeyCorridor environment by

python main.py

A new directory ./experiments/HELM/MiniGrid-KeyCorridorS3R1-v0 will be created in which all log files and checkpoints will be stored.

All changeable parameters are stored in the config.json file and can be adjusted via command line arguments as:

python main.py --var KEY=VALUE

For example, if you would like to train on RandomMaze-v0:

python main.py --var env=RandomMaze-v0

or on the Procgen environment maze:

python main.py --var env=maze

Note that by default the Procgen environments are created in the memory distribution mode, thus only the six environments as mentioned in the paper can be trained on, all others do not support the memory mode.

By default a Tensorboard log is created which can be visualized by

tensorboard --logdir ./experiments

LICENSE

MIT LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
envs		envs
trainers		trainers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.json		config.json
environment.yml		environment.yml
experiment.py		experiment.py
main.py		main.py
model.py		model.py
utils.py		utils.py
variables.py		variables.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

History Compression via Language Models in Reinforcement Learning

LICENSE

About

Uh oh!

Releases

Packages

Languages

License

ibagur/helm

Folders and files

Latest commit

History

Repository files navigation

History Compression via Language Models in Reinforcement Learning

LICENSE

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages