Tags · lunzizoo/tianshou

v0.4.1

Fix SAC loss explode (thu-ml#333)

* change SAC action_bound_method to "clip" (tanh is hardcoded in forward)

* docstring update

* modelbase -> modelbased

Apr 4, 2021
dd4a011
zip
tar.gz

v0.4.0

Merge pull request thu-ml#302 from thu-ml/dev

v0.4.0

Mar 2, 2021
389bdb7
zip
tar.gz

v0.3.2

v0.3.2 (thu-ml#292)

Throw a warning in ListReplayBuffer.

This version update is needed because of thu-ml#289, the previous v0.3.1 cannot work well under torch<=1.6.0 with cuda environment.

Feb 16, 2021
cb65b56
zip
tar.gz

v0.3.1

Add offline trainer and discrete BCQ algorithm (thu-ml#263)

The result needs to be tuned after `done` issue fixed.

Co-authored-by: n+e <trinkle23897@gmail.com>

Jan 20, 2021
a511cb4
zip
tar.gz

v0.3.0.post1

specify the meaning of logits in documentation (thu-ml#238)

Oct 8, 2020
b364f1a
zip
tar.gz

v0.3.0

change API of train_fn and test_fn (thu-ml#229)

train_fn(epoch) -> train_fn(epoch, num_env_step)
test_fn(epoch) -> test_fn(epoch, num_env_step)

Sep 26, 2020
710966e
zip
tar.gz

v0.3.0rc0

add PSRL policy (thu-ml#202)

Add PSRL policy in tianshou/policy/modelbase/psrl.py.

Co-authored-by: n+e <trinkle23897@cmu.edu>

Sep 23, 2020
dcfcbb3
zip
tar.gz

v0.2.7

fix critical bugs in MAPolicy and docs update (thu-ml#207)

- fix a bug in MAPolicy: `buffer.rew = Batch()` doesn't change `buffer.rew` (thanks mypy)
- polish examples/box2d/bipedal_hardcore_sac.py
- several docs update
- format setup.py and bump version to 0.2.7

Sep 8, 2020
64af7ea
zip
tar.gz

v0.2.6

code refactor for venv (thu-ml#179)

- Refacor code to remove duplicate code

- Enable async simulation for all vector envs

- Remove `collector.close` and rename `VectorEnv` to `DummyVectorEnv`

The abstraction of vector env changed.

Prior to this pr, each vector env is almost independent.

After this pr, each env is wrapped into a worker, and vector envs differ with their worker type. In fact, users can just use `BaseVectorEnv` with different workers, I keep `SubprocVectorEnv`, `ShmemVectorEnv` for backward compatibility.

Co-authored-by: n+e <463003665@qq.com>
Co-authored-by: magicly <magicly007@gmail.com>

Aug 19, 2020
a9f9940
zip
tar.gz

v0.2.5

docs fix and v0.2.5 (thu-ml#156)

* pre

* update docs

* update docs

* $ in bash

* size -> hidden_layer_size

* doctest

* doctest again

* filter a warning

* fix bug

* fix examples

* test fail

* test succ

Jul 22, 2020
bd9c3c7
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.4.1

v0.4.0

v0.3.2

v0.3.1

v0.3.0.post1

v0.3.0

v0.3.0rc0

v0.2.7

v0.2.6

v0.2.5

Tags: lunzizoo/tianshou