Support deterministic evaluation for onpolicy algorithms #354

ultmaster · 2021-04-25T09:12:04Z

I have marked all applicable categories:
- exception-raising fix
- algorithm implementation fix
- documentation modification
- new feature
If applicable, I have mentioned the relevant/related issue(s) Deterministic sampling for PGPolicy #353

Less important but also useful:

I have visited the source website
I have searched through the issue tracker for duplicates

I have mentioned version numbers, operating system and environment, where applicable:

import tianshou, torch, numpy, sys
print(tianshou.__version__, torch.__version__, numpy.__version__, sys.version, sys.platform)

tianshou/policy/modelfree/pg.py

codecov-commenter · 2021-04-25T09:24:47Z

Codecov Report

Merging #354 (2faf4a0) into master (ff4d3cd) will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #354      +/-   ##
==========================================
+ Coverage   94.46%   94.48%   +0.01%     
==========================================
  Files          53       53              
  Lines        3413     3424      +11     
==========================================
+ Hits         3224     3235      +11     
  Misses        189      189

Flag	Coverage Δ
unittests	`94.48% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tianshou/policy/modelfree/a2c.py	`96.87% <ø> (ø)`
tianshou/policy/modelfree/npg.py	`98.85% <ø> (ø)`
tianshou/policy/modelfree/ppo.py	`90.90% <ø> (ø)`
tianshou/policy/modelfree/trpo.py	`93.33% <ø> (ø)`
tianshou/policy/base.py	`79.56% <100.00%> (+0.93%)`	⬆️
tianshou/policy/imitation/base.py	`100.00% <100.00%> (ø)`
tianshou/policy/modelfree/pg.py	`96.42% <100.00%> (+0.42%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ff4d3cd...2faf4a0. Read the comment docs.

tianshou/policy/base.py

ultmaster added 2 commits April 25, 2021 17:09

Support deterministic eval for PG

7d8e47f

Add documentation

ffb995d

Trinkle23897 reviewed Apr 25, 2021

View reviewed changes

tianshou/policy/modelfree/pg.py Outdated Show resolved Hide resolved

ultmaster and others added 4 commits April 25, 2021 17:27

Support continuous cases

4102b1b

Fix typo

9779b83

Update argmax condition

5857d25

refact

2b74f9a

ultmaster commented Apr 25, 2021

View reviewed changes

tianshou/policy/base.py Show resolved Hide resolved

fix typo

2faf4a0

Trinkle23897 requested review from ChenDRAG and danagi April 25, 2021 12:50

Trinkle23897 changed the title ~~Support deterministic evaluation for PG~~ Support deterministic evaluation for onpolicy algorithms Apr 25, 2021

Trinkle23897 linked an issue Apr 26, 2021 that may be closed by this pull request

Deterministic sampling for PGPolicy #353

Closed

8 tasks

danagi approved these changes Apr 26, 2021

View reviewed changes

ChenDRAG approved these changes Apr 27, 2021

View reviewed changes

Trinkle23897 merged commit f4e05d5 into thu-ml:master Apr 27, 2021

BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024

Support deterministic evaluation for onpolicy algorithms (thu-ml#354)

d2af2cf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support deterministic evaluation for onpolicy algorithms #354

Support deterministic evaluation for onpolicy algorithms #354

Uh oh!

ultmaster commented Apr 25, 2021

Uh oh!

Uh oh!

codecov-commenter commented Apr 25, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Support deterministic evaluation for onpolicy algorithms #354

Support deterministic evaluation for onpolicy algorithms #354

Uh oh!

Conversation

ultmaster commented Apr 25, 2021

Uh oh!

Uh oh!

codecov-commenter commented Apr 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Apr 25, 2021 •

edited

Loading