Trainer refactor : flexible logger #295

ChenDRAG · 2021-02-21T08:03:29Z

This is the 4th commit of 6 commits mentioned in #274, which features refactor of trainer to fix #161. You can check #274 for more detail.
To avoid large commit as in #280, I will do this in 2 pr. first(#293 ) focus on some definition change of trainer to make it more friendly to use and be consistent with typical usage in research papers. Second pr focus on refactor of logging method to solve bug of nan reward and log interval. After these two pr, hopefully fundamental change of tianshou/data is finished. We then can concentrate on building benchmarks of tianshou finally.
This is the second pr.

This pr also fixes the bug that 'rew' is nan in tqdm visualization.

Not finished yet, lacking test and docs update. Open to discussion

tianshou/test.py

codecov-io · 2021-02-22T11:58:38Z

Codecov Report

Merging #295 (de2ecaf) into dev (e99e1b0) will decrease coverage by 0.05%.
The diff coverage is 96.07%.

@@            Coverage Diff             @@
##              dev     #295      +/-   ##
==========================================
- Coverage   93.94%   93.89%   -0.06%     
==========================================
  Files          45       47       +2     
  Lines        3202     3241      +39     
==========================================
+ Hits         3008     3043      +35     
- Misses        194      198       +4

Flag	Coverage Δ
unittests	`93.89% <96.07%> (-0.06%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
tianshou/data/buffer.py	`93.29% <ø> (-0.02%)`	⬇️
tianshou/data/collector.py	`94.89% <ø> (ø)`
tianshou/utils/log_tools.py	`93.22% <93.22%> (-6.78%)`	⬇️
tianshou/policy/base.py	`76.80% <100.00%> (ø)`
tianshou/trainer/offline.py	`100.00% <100.00%> (ø)`
tianshou/trainer/offpolicy.py	`100.00% <100.00%> (ø)`
tianshou/trainer/onpolicy.py	`97.01% <100.00%> (-0.05%)`	⬇️
tianshou/trainer/utils.py	`100.00% <100.00%> (ø)`
tianshou/utils/__init__.py	`100.00% <100.00%> (ø)`
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e99e1b0...de2ecaf. Read the comment docs.

examples/mujoco/runnable/halfcheetahBullet_v0_sac.py

tianshou/trainer/utils.py

ChenDRAG · 2021-02-23T13:50:30Z

Ready I think

examples/atari/atari_c51.py

examples/atari/atari_dqn.py

Co-authored-by: n+e <trinkle23897@qq.com>

This PR focus on refactor of logging method to solve bug of nan reward and log interval. After these two pr, hopefully fundamental change of tianshou/data is finished. We then can concentrate on building benchmarks of tianshou finally. Things changed: 1. trainer now accepts logger (BasicLogger or LazyLogger) instead of writer; 2. remove utils.SummaryWriter;

Trinkle23897 reviewed Feb 21, 2021

View reviewed changes

tianshou/test.py Outdated Show resolved Hide resolved

rebase

12f053c

Trinkle23897 force-pushed the trainer_logger branch from 79ef514 to 12f053c Compare February 22, 2021 11:28

ChenDRAG added 3 commits February 22, 2021 19:33

pep8

e5d2d0a

pep8

6165dd0

doc update

87952e2

ChenDRAG added 2 commits February 22, 2021 20:00

type checkout

4ba58fc

pep8

66101f6

ChenDRAG requested review from Trinkle23897, danagi and duburcqa February 23, 2021 05:05

Trinkle23897 added 2 commits February 23, 2021 17:41

some update

ef87ff9

fix import

ee42e7f

Trinkle23897 reviewed Feb 23, 2021

View reviewed changes

examples/mujoco/runnable/halfcheetahBullet_v0_sac.py Show resolved Hide resolved

fix

0e833ac

ChenDRAG commented Feb 23, 2021

View reviewed changes

tianshou/trainer/utils.py Show resolved Hide resolved

fix doc

44a23c0

Trinkle23897 reviewed Feb 24, 2021

View reviewed changes

examples/atari/atari_c51.py Outdated Show resolved Hide resolved

Trinkle23897 reviewed Feb 24, 2021

View reviewed changes

examples/atari/atari_dqn.py Outdated Show resolved Hide resolved

ChenDRAG and others added 4 commits February 24, 2021 12:36

Update examples/atari/atari_dqn.py

ec4ff48

Co-authored-by: n+e <trinkle23897@qq.com>

update

de2ecaf

final

950c7f4

verbose two lines

581acaf

Trinkle23897 approved these changes Feb 24, 2021

View reviewed changes

ChenDRAG merged commit f0129c9 into thu-ml:dev Feb 24, 2021

Trinkle23897 linked an issue Apr 21, 2021 that may be closed by this pull request

Plans of releasing mujoco benchmark with ddpg/sac/td3 on Tianshou #274

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Trainer refactor : flexible logger #295

Trainer refactor : flexible logger #295

Uh oh!

ChenDRAG commented Feb 21, 2021

Uh oh!

Uh oh!

codecov-io commented Feb 22, 2021 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ChenDRAG commented Feb 23, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Trainer refactor : flexible logger #295

Trainer refactor : flexible logger #295

Uh oh!

Conversation

ChenDRAG commented Feb 21, 2021

Uh oh!

Uh oh!

codecov-io commented Feb 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

ChenDRAG commented Feb 23, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-io commented Feb 22, 2021 •

edited

Loading