-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Trainer refactor : flexible logger #295
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
79ef514
to
12f053c
Compare
Codecov Report
@@ Coverage Diff @@
## dev #295 +/- ##
==========================================
- Coverage 93.94% 93.89% -0.06%
==========================================
Files 45 47 +2
Lines 3202 3241 +39
==========================================
+ Hits 3008 3043 +35
- Misses 194 198 +4
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
Ready I think |
Co-authored-by: n+e <trinkle23897@qq.com>
This PR focus on refactor of logging method to solve bug of nan reward and log interval. After these two pr, hopefully fundamental change of tianshou/data is finished. We then can concentrate on building benchmarks of tianshou finally. Things changed: 1. trainer now accepts logger (BasicLogger or LazyLogger) instead of writer; 2. remove utils.SummaryWriter;
This PR focus on refactor of logging method to solve bug of nan reward and log interval. After these two pr, hopefully fundamental change of tianshou/data is finished. We then can concentrate on building benchmarks of tianshou finally. Things changed: 1. trainer now accepts logger (BasicLogger or LazyLogger) instead of writer; 2. remove utils.SummaryWriter;
This is the 4th commit of 6 commits mentioned in #274, which features refactor of trainer to fix #161. You can check #274 for more detail.
To avoid large commit as in #280, I will do this in 2 pr. first(#293 ) focus on some definition change of trainer to make it more friendly to use and be consistent with typical usage in research papers. Second pr focus on refactor of logging method to solve bug of nan reward and log interval. After these two pr, hopefully fundamental change of tianshou/data is finished. We then can concentrate on building benchmarks of tianshou finally.
This is the second pr.
This pr also fixes the bug that 'rew' is nan in tqdm visualization.
Not finished yet, lacking test and docs update. Open to discussion