Tags · ashok-arora/tianshou

v0.5.0

update version to 0.5.0 (thu-ml#826)

Mar 13, 2023
f0afdea
zip
tar.gz

v0.4.11

fix info not pass issue in PGPolicy (thu-ml#787)

close thu-ml#775

Dec 24, 2022
1037627
zip
tar.gz

v0.4.10

bump version to 0.4.10 (thu-ml#757)

Oct 17, 2022
41ae346
zip
tar.gz

v0.4.9

bump version to 0.4.9 (thu-ml#684)

Jul 4, 2022
6505484
zip
tar.gz

v0.4.8

Add vecenv wrappers for obs_norm to support running mujoco experiment…

… with envpool (thu-ml#628)

- add VectorEnvWrapper and VectorEnvNormObs
- obs_rms store in policy save/load
- align mujoco scripts with atari: obs_norm, envpool, wandb and README

May 5, 2022
2a7c151
zip
tar.gz

v0.4.7

rename save_fn to save_best_fn to avoid ambiguity (thu-ml#575)

This PR also introduces `tianshou.utils.deprecation` for a unified deprecation wrapper.

Mar 21, 2022
2a9c928
zip
tar.gz

v0.4.6

Add VizDoom PPO example and results (thu-ml#533)

* update vizdoom ppo example

* update README with results

Feb 25, 2022
97df511
zip
tar.gz

v0.4.6.post1

fix conda support and keep API compatibility (thu-ml#536)

* loose constrains

* fix nni issue (thu-ml#478)

* fix coverage

Feb 25, 2022
c248b4f
zip
tar.gz

v0.4.5

Fix critic network for Discrete CRR (thu-ml#485)

- Fixes an inconsistency in the implementation of Discrete CRR. Now it uses `Critic` class for its critic, following conventions in other actor-critic policies;
- Updates several offline policies to use `ActorCritic` class for its optimizer to eliminate randomness caused by parameter sharing between actor and critic;
- Add `writer.flush()` in TensorboardLogger to ensure real-time result;
- Enable `test_collector=None` in 3 trainers to turn off testing during training;
- Updates the Atari offline results in README.md;
- Moves Atari offline RL examples to `examples/offline`; tests to `test/offline` per review comments.

Nov 28, 2021
3592f45
zip
tar.gz

v0.4.4

bump to 0.4.4

Oct 13, 2021
b9eedc5
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.5.0

v0.4.11

v0.4.10

v0.4.9

v0.4.8

v0.4.7

v0.4.6

v0.4.6.post1

v0.4.5

v0.4.4

Tags: ashok-arora/tianshou