Tags · thu-ml/tianshou

v0.4.6

Add VizDoom PPO example and results (#533)

* update vizdoom ppo example

* update README with results

Feb 25, 2022
97df511
zip
tar.gz
Notes
Downloads

v0.4.6.post1

fix conda support and keep API compatibility (#536)

* loose constrains

* fix nni issue (#478)

* fix coverage

Feb 25, 2022
c248b4f
zip
tar.gz
Notes
Downloads

v0.4.5

Fix critic network for Discrete CRR (#485)

- Fixes an inconsistency in the implementation of Discrete CRR. Now it uses `Critic` class for its critic, following conventions in other actor-critic policies;
- Updates several offline policies to use `ActorCritic` class for its optimizer to eliminate randomness caused by parameter sharing between actor and critic;
- Add `writer.flush()` in TensorboardLogger to ensure real-time result;
- Enable `test_collector=None` in 3 trainers to turn off testing during training;
- Updates the Atari offline results in README.md;
- Moves Atari offline RL examples to `examples/offline`; tests to `test/offline` per review comments.

Nov 28, 2021
3592f45
zip
tar.gz
Notes
Downloads

v0.4.4

bump to 0.4.4

Oct 13, 2021
b9eedc5
zip
tar.gz
Notes
Downloads

v0.4.3

bump to v0.4.3 (#432)

* add makefile
* bump version
* add isort and yapf
* update contributing.md
* update PR template
* spelling check

Sep 2, 2021
fc251ab
zip
tar.gz
Notes
Downloads

v0.4.2

add vizdoom example, bump version to 0.4.2 (#384)

Jun 26, 2021
ebaca6f
zip
tar.gz
Notes
Downloads

v0.4.1

Fix SAC loss explode (#333)

* change SAC action_bound_method to "clip" (tanh is hardcoded in forward)

* docstring update

* modelbase -> modelbased

Apr 4, 2021
dd4a011
zip
tar.gz
Notes
Downloads

v0.4.0

Merge pull request #302 from thu-ml/dev

v0.4.0

Mar 2, 2021
389bdb7
zip
tar.gz
Notes
Downloads

v0.3.2

v0.3.2 (#292)

Throw a warning in ListReplayBuffer.

This version update is needed because of #289, the previous v0.3.1 cannot work well under torch<=1.6.0 with cuda environment.

Feb 16, 2021
cb65b56
zip
tar.gz
Notes
Downloads

v0.3.1

Add offline trainer and discrete BCQ algorithm (#263)

The result needs to be tuned after `done` issue fixed.

Co-authored-by: n+e <trinkle23897@gmail.com>

Jan 20, 2021
a511cb4
zip
tar.gz
Notes
Downloads

Previous Next

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.4.6

v0.4.6.post1

v0.4.5

v0.4.4

v0.4.3

v0.4.2

v0.4.1

v0.4.0

v0.3.2

v0.3.1

Tags: thu-ml/tianshou