Adds checkpoint frequencies for serial and batch Agents. by prabhatnagarajan · Pull Request #525 · chainer/chainerrl · GitHub
More Web Proxy on the site http://driver.im/
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
We may want to checkpoint intermediate networks during training. This PR modifies one example, and the experiments API to allow that for serial agents.
One use case of this update could be T-REX (https://arxiv.org/abs/1904.06387), which generates synthetic demonstrations by taking different checkpoints of a neural network and using them to generate demonstrations, which may be ranked according to the checkpoint order.
prabhatnagarajan
changed the title
Adds checkpoint frequencies for training serial agents
Adds checkpoint frequencies for Serial and Batch Agents.
Aug 20, 2019
I ran python train_ppo_ale.py --gpu -1 --checkpoint-frequency 20 --steps 100
and got the following output:
INFO:chainerrl.experiments.train_agent_batch:Saved the agent to results/20190820T214258.089467/40_checkpoint
INFO:chainerrl.experiments.train_agent_batch:Saved the agent to results/20190820T214258.089467/60_checkpoint
INFO:chainerrl.experiments.train_agent_batch:Saved the agent to results/20190820T214258.089467/80_checkpoint
INFO:chainerrl.experiments.train_agent_batch:Saved the agent to results/20190820T214258.089467/100_checkpoint
INFO:chainerrl.experiments.train_agent_batch:Saved the agent to results/20190820T214258.089467/104_finish
I also checked some of the directories, and they indeed exist.
INFO:chainerrl.experiments.train_agent:Saved the agent to results/20190820T214138.012278/40_checkpoint
INFO:chainerrl.experiments.train_agent:Saved the agent to results/20190820T214138.012278/60_checkpoint
INFO:chainerrl.experiments.train_agent:Saved the agent to results/20190820T214138.012278/80_checkpoint
INFO:chainerrl.experiments.train_agent:outdir:results/20190820T214138.012278 step:100 episode:0 R:2.0```
I checked that the directories were created, and that there were files inside them.
prabhatnagarajan
changed the title
Adds checkpoint frequencies for Serial and Batch Agents.
Adds checkpoint frequencies for serial and batch Agents.
Aug 21, 2019
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
We may want to checkpoint intermediate networks during training. This PR modifies one example, and the experiments API to allow that for serial agents.
One use case of this update could be T-REX (https://arxiv.org/abs/1904.06387), which generates synthetic demonstrations by taking different checkpoints of a neural network and using them to generate demonstrations, which may be ranked according to the checkpoint order.