fix critical bugs in MAPolicy and docs update #207

Trinkle23897 · 2020-09-06T23:27:40Z

cherry-pick from #200

fix a bug in MAPolicy: buffer.rew = Batch() doesn't change buffer.rew (thanks mypy)
polish examples/box2d/bipedal_hardcore_sac.py
several docs update
format setup.py and bump version to 0.2.7

codecov-commenter · 2020-09-06T23:35:48Z

Codecov Report

Merging #207 into master will decrease coverage by 0.49%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #207      +/-   ##
==========================================
- Coverage   94.09%   93.60%   -0.50%     
==========================================
  Files          40       40              
  Lines        2457     2454       -3     
==========================================
- Hits         2312     2297      -15     
- Misses        145      157      +12

Flag	Coverage Δ
#unittests	`93.60% <100.00%> (-0.50%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tianshou/policy/imitation/base.py	`100.00% <ø> (ø)`
tianshou/policy/multiagent/mapolicy.py	`93.75% <100.00%> (ø)`
tianshou/trainer/onpolicy.py	`77.61% <0.00%> (-17.92%)`	⬇️
tianshou/env/worker/subproc.py	`91.09% <0.00%> (-0.07%)`	⬇️
tianshou/data/collector.py	`91.44% <0.00%> (-0.05%)`	⬇️
tianshou/data/buffer.py	`98.98% <0.00%> (-0.01%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 380e9e9...96a2cc8. Read the comment docs.

duburcqa · 2020-09-08T12:48:20Z

tianshou/policy/multiagent/mapolicy.py

-            save_rew, buffer.rew = buffer.rew, Batch()
+            # Since we do not override buffer.__setattr__, here we use _meta to
+            # change buffer.rew, otherwise buffer.rew = Batch() has no effect.
+            save_rew, buffer._meta.rew = buffer.rew, Batch()


It would be nice to find a way to avoid such trick in future release because it is error prone.

- fix a bug in MAPolicy: `buffer.rew = Batch()` doesn't change `buffer.rew` (thanks mypy) - polish examples/box2d/bipedal_hardcore_sac.py - several docs update - format setup.py and bump version to 0.2.7

fix bugs in MAPolicy

a512150

Trinkle23897 requested a review from youkaichao September 6, 2020 23:27

Trinkle23897 added 2 commits September 7, 2020 19:37

sync bipedal results

660faea

version test

f8275ed

Trinkle23897 linked an issue Sep 8, 2020 that may be closed by this pull request

How to support multi-agent reinforcement learning #121

Open

8 tasks

bump to v0.2.7

62c2bf3

Trinkle23897 requested a review from duburcqa September 8, 2020 12:18

Trinkle23897 added 2 commits September 8, 2020 20:36

polish

e376bbc

revert

96a2cc8

duburcqa approved these changes Sep 8, 2020

View reviewed changes

duburcqa reviewed Sep 8, 2020

View reviewed changes

Trinkle23897 merged commit 64af7ea into thu-ml:master Sep 8, 2020

Trinkle23897 deleted the fix-mapolicy branch September 8, 2020 13:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix critical bugs in MAPolicy and docs update #207

fix critical bugs in MAPolicy and docs update #207

Uh oh!

Trinkle23897 commented Sep 6, 2020 •

edited

Loading

Uh oh!

codecov-commenter commented Sep 6, 2020 •

edited

Loading

Uh oh!

duburcqa Sep 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix critical bugs in MAPolicy and docs update #207

fix critical bugs in MAPolicy and docs update #207

Uh oh!

Conversation

Trinkle23897 commented Sep 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Sep 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

duburcqa Sep 8, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Trinkle23897 commented Sep 6, 2020 •

edited

Loading

codecov-commenter commented Sep 6, 2020 •

edited

Loading