Numba acceleration #193

Trinkle23897 · 2020-08-27T06:48:12Z

Training FPS improvement (base commit is 94bfb32):
test_pdqn: 1660 (without numba) -> 1930
discrete/test_ppo: 5100 -> 5170

since nstep has little impact on overall performance, the unit test result is:
GAE: 4.1s -> 0.057s
nstep: 0.3s -> 0.15s (little improvement)

Others:

fix a bug in ttt set_eps
keep only sumtree in segment tree implementation
dirty fix for asyncVenv check_id test

tianshou/data/utils/segtree.py

duburcqa · 2020-08-27T16:52:08Z

Could you add more information about the relevance of improving the efficiency of 'segtree' ? How much time is spent in it ?

Trinkle23897 · 2020-08-27T23:22:19Z

Could you add more information about the relevance of improving the efficiency of 'segtree' ? How much time is spent in it ?

Sure, posted on the top.

Trinkle23897 · 2020-08-31T05:10:52Z

@duburcqa ready for review, any comment is appreciated!

tianshou/data/utils/segtree.py

tianshou/policy/base.py

duburcqa · 2020-08-31T09:11:46Z

tianshou/policy/base.py

+    return target_q
+
+
+def _compile():


Why do you want to precompile the function ? I don't see the point. It goes against the JIT (Just-in-Time compilation) principle. I would use instead AOT (Ahead-of-Time compilation) if you really think it is better to pre-compile whose methods.

Here is the official documentation, including examples. Note that it is possible to define the shape of the input arrays without passing examples for both JIT and AOT.

I post the reason in the docstring. Since I want to compare the training time of several versions, if we do not compile the main function pattern ahead of time, the first-time compilation time will add onto the training time in the collector, and thus it will show that Numba slows down the whole process (from 2050 fps to 1800 fps).

I don't think AOT is a good choice since we cannot determine some arguments' type ahead of time, for example, v_s_ in GAE.

Then I would rather move _compile at the beginning of the brenchmark, rather than here. I thing it makes more sense, because it is really just a benchmarking issue.

I don't think AOT is a good choice since we cannot determine some arguments' type ahead of time, for example, v_s_ in GAE.

I don't now, it is strange. Compilation support variable size numpy array, just the number of dimensions must be known in advance. Knowing that you can also precompile several signatures if necessary.

duburcqa

Done with review!

Trinkle23897

I think _compile should under tianshou.utils and include also SegTree. I'll change it.

duburcqa · 2020-08-31T17:44:25Z

Nice ! Good to go now :)

README.md

test/base/test_env.py

test/base/test_returns.py

youkaichao · 2020-09-01T01:13:31Z

test/continuous/test_td3.py

+    from tianshou.utils import pre_compile
+    pre_compile()  # exclude compilation time to get the correct train_speed


That's a bit strange for newcomers. Maybe pre_compile can be called during tianshou.init .

I think so, but this conflicts with @duburcqa #193 (comment)

Then I would rather move _compile at the beginning of the brenchmark, rather than here. I thing it makes more sense, because it is really just a benchmarking issue.

There are many benchmarks in tianshou, but some of the benchmarks (in test folder) are also examples for beginners. Therefore, I think it is not a good solution to add pre-compile for every benchmark. Pre-compile in tianshou.__init__.py should be enough.

@duburcqa

I get your point but it is an anti-pattern. JIT is for jit. If precompile is actually able to precompile all you need, then I would use, it works just the same as jit and it is actually intenses to do what you are looking for.

Well AOT is not exactly as portable as JIT in practice because of some limitations. You can put it in init if you want but I really don't like the idea. I would rather keep it as it is right now. Maybe a more comprehensive name for beginner would be enough, like 'init_profiler'. It is not exactly what it does but it is what it is designed for. From this name plus a comment it would be clear it is there on the CI.

But beginners may build up their own code and benchmark and say, "Hey, your provided result is not compatible with my own test", something like #169 and #196. That's a sad thing.
Also, maybe more functions will be added to the future version. Perhaps they need to be compiled too.

I don't think it is a sad thing. People have never been able to properly benchmark a code, you can see this in all but one in a million posts related to stuff this on stackoverflow. It is a petty to lower our standard to match people programming knowledge, especially from people not contributing but only complaining. But I get your point too and I understand your point of view. At the end it can be changed in the future pretty easily, it is not an code design issue so it is not a big deal. As I see you seem to both agree on this point, maybe you should go for it.

Trinkle23897 · 2020-09-01T07:59:19Z

I have no more commits on this pr.

Training FPS improvement (base commit is 94bfb32): test_pdqn: 1660 (without numba) -> 1930 discrete/test_ppo: 5100 -> 5170 since nstep has little impact on overall performance, the unit test result is: GAE: 4.1s -> 0.057s nstep: 0.3s -> 0.15s (little improvement) Others: - fix a bug in ttt set_eps - keep only sumtree in segment tree implementation - dirty fix for asyncVenv check_id test

add numba to setup.py

8301798

Trinkle23897 marked this pull request as draft August 27, 2020 06:48

Trinkle23897 added 2 commits August 27, 2020 18:50

numba version of segtree.get_prefix_sum_idx

78eaab2

full numba version of segtree

3a6601c

youkaichao marked this pull request as ready for review August 27, 2020 14:20

youkaichao marked this pull request as draft August 27, 2020 14:23

youkaichao marked this pull request as ready for review August 27, 2020 14:24

youkaichao marked this pull request as draft August 27, 2020 14:24

duburcqa reviewed Aug 27, 2020

View reviewed changes

tianshou/data/utils/segtree.py Show resolved Hide resolved

Trinkle23897 added 7 commits August 29, 2020 17:34

change the order

4039028

fix import error and ttt set_eps

ac68400

numba GAE

c37d862

Merge branch 'master' into numba

c7e8028

nstep numba has negative impact

75f0b9f

compile numba jit script during __init__

410f221

dirty fix venv check_id

faecb83

Trinkle23897 marked this pull request as ready for review August 30, 2020 09:57

Trinkle23897 requested a review from youkaichao August 30, 2020 09:57

readme

b8994b7

Trinkle23897 requested a review from duburcqa August 30, 2020 13:36

add a branch of GAE pre-compile

4557967

duburcqa reviewed Aug 31, 2020

View reviewed changes

tianshou/data/utils/segtree.py Show resolved Hide resolved

duburcqa reviewed Aug 31, 2020

View reviewed changes

tianshou/data/utils/segtree.py Outdated Show resolved Hide resolved

duburcqa reviewed Aug 31, 2020

View reviewed changes

tianshou/policy/base.py Show resolved Hide resolved

duburcqa reviewed Aug 31, 2020

View reviewed changes

update comments

7a53efc

Trinkle23897 added 2 commits August 31, 2020 17:44

fix test check_id

5dde892

compile in test benchmark

2bceb5f

Trinkle23897 commented Aug 31, 2020

View reviewed changes

utils/compile.py

81eb9bd

youkaichao reviewed Sep 1, 2020

View reviewed changes

Trinkle23897 added 2 commits September 1, 2020 09:23

docs

9c5f656

pre_compile in init

8df5cb5

duburcqa approved these changes Sep 1, 2020

View reviewed changes

youkaichao approved these changes Sep 2, 2020

View reviewed changes

Trinkle23897 merged commit a746187 into thu-ml:master Sep 2, 2020

Trinkle23897 deleted the numba branch September 2, 2020 04:50

		from tianshou.utils import pre_compile
		pre_compile() # exclude compilation time to get the correct train_speed

Numba acceleration #193

Numba acceleration #193

Uh oh!

Conversation

Trinkle23897 commented Aug 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

duburcqa commented Aug 27, 2020

Uh oh!

Trinkle23897 commented Aug 27, 2020

Uh oh!

Trinkle23897 commented Aug 31, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

duburcqa Aug 31, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Trinkle23897 Aug 31, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Trinkle23897 Aug 31, 2020

Choose a reason for hiding this comment

Uh oh!

duburcqa Aug 31, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duburcqa Aug 31, 2020

Choose a reason for hiding this comment

Uh oh!

duburcqa left a comment

Choose a reason for hiding this comment

Uh oh!

Trinkle23897 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duburcqa commented Aug 31, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

youkaichao Sep 1, 2020

Choose a reason for hiding this comment

Uh oh!

Trinkle23897 Sep 1, 2020

Choose a reason for hiding this comment

Uh oh!

youkaichao Sep 1, 2020

Choose a reason for hiding this comment

Uh oh!

duburcqa Sep 1, 2020

Choose a reason for hiding this comment

Uh oh!

duburcqa Sep 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Trinkle23897 Sep 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

duburcqa Sep 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Trinkle23897 commented Sep 1, 2020

Uh oh!

Uh oh!

Trinkle23897 commented Aug 27, 2020 •

edited

Loading

duburcqa Aug 31, 2020 •

edited

Loading

Trinkle23897 Aug 31, 2020 •

edited

Loading

duburcqa Aug 31, 2020 •

edited

Loading

Trinkle23897 left a comment •

edited

Loading

duburcqa Sep 1, 2020 •

edited

Loading

Trinkle23897 Sep 1, 2020 •

edited

Loading

duburcqa Sep 1, 2020 •

edited

Loading