code refactor for venv #179

youkaichao · 2020-08-04T13:17:09Z

Refacor code to remove duplicate code
Enable async simulation for all vector envs
Remove collector.close and rename VectorEnv to DummyVectorEnv

The abstraction of vector env changed.

Prior to this pr, each vector env is almost independent.

After this pr, each env is wrapped into a worker, and vector envs differ with their worker type. In fact, users can just use BaseVectorEnv with different workers, I keep SubprocVectorEnv, ShmemVectorEnv for backward compatibility.

…th_il.py

test/throughput/test_collector_profile.py

tianshou/env/venvs.py

README.md

docs/tutorials/dqn.rst

duburcqa · 2020-08-16T11:05:02Z

It is true that timeout should be supported for dynamic environments. In fact, action distribution depends on policy, which may change the time cost distribution of step functions.

I would add that in m'y opinion it is usually more common to deal with such dynamic environment than a static one. For example in my case I have to deal with dynamic environments, never static. It comes from the way the state is integrated over time. Some state requires to refine the computation to guarantee the accuracy, while for others it is not necessary.

duburcqa · 2020-08-16T11:10:00Z

Is it ready for review ? I didn't spent much time thinking to a better way to handle methods such as wait to avoid relying on static methods of the workers themself. I'm suite confident that using metaclass (in a way similar to metaprograming in c++) could do the trick but it is tricky to design and lack for clarity even for devs. So I don't know what to think about this.

Trinkle23897 · 2020-08-16T11:13:28Z

Is it ready for review?

Yes, of course!

And I have another concern: can we reduce the async overhead as much as possible? I tested with wait_num == env_num for both sync and async mode (use test_drqn.py), and the result shows that the speed of async is 80% of sync. This could be the future work.

test/base/test_env.py

youkaichao · 2020-08-16T11:24:32Z

I would add that in m'y opinion it is usually more common to deal with such dynamic environment than a static one.

Agree. Only toy environments are static. Almost all realistic environments are dynamic.

duburcqa · 2020-08-18T04:59:34Z

I'll have a look later today

tianshou/env/worker/subproc.py

duburcqa · 2020-08-19T05:55:32Z

As I said, I should be clear from the documentation that the vector env are not usual pool, a sense that it does not really handle parallelism directly. Being asynchronous or not is to the resort of the sole worker and at its discretion, with no explicit mechanism for the pool to know this information in advance (hence the unusual wait mechanism).

youkaichao · 2020-08-19T06:45:26Z

Being asynchronous or not is to the resort of the sole worker and at its discretion, with no explicit mechanism for the pool to know this information in advance

I don't agree. The workers have send_action / get_results interfaces so that they support asynchronous simulation. It is up to the user / vec env whether to use asynchronous or not. I don't think "the unusual wait mechanism" exists.

duburcqa · 2020-08-19T07:05:37Z

The workers have send_action / get_results interfaces so that they support asynchronous simulation. It is up to the user / vec env whether to use asynchronous or not.

No it is not. It requires the user to use a worker that support async behavior. It is not asynchronous by itself.

I don't think "the unusual wait mechanism" exists.

Are you kidding me ? Show me a single well-known library using the same "calling static method from workers" strategy to implement wait method.

youkaichao · 2020-08-19T07:15:32Z

Are you kidding me ? Show me a single well-known library using the same "calling static method from workers" strategy to implement wait method.

I admit the current implementation requires the hack of calling static method from workers.

To remove the hack, each worker should implement a ready interface, which returns True if it is ready to return results. But I don't know how to implement it for ray, and I think it would be difficult to implement. Therefore, I rely on the readily implemented connection.wait and ray.wait.

The ready interface enables waiting by while-loop. An efficient wait should rely on something like semaphore, i.e. sleep on a set of semaphores. But I doubt something general like sleeping on a set of semaphores exists. Most likely, sleeping on some signal is worker-dependent.

duburcqa · 2020-08-19T13:08:31Z

I agree it is tricky to implement and I'm fine with the current implementation. It could be refactor later if necessary, or if we have time for that.

- Refacor code to remove duplicate code - Enable async simulation for all vector envs - Remove `collector.close` and rename `VectorEnv` to `DummyVectorEnv` The abstraction of vector env changed. Prior to this pr, each vector env is almost independent. After this pr, each env is wrapped into a worker, and vector envs differ with their worker type. In fact, users can just use `BaseVectorEnv` with different workers, I keep `SubprocVectorEnv`, `ShmemVectorEnv` for backward compatibility. Co-authored-by: n+e <463003665@qq.com> Co-authored-by: magicly <magicly007@gmail.com>

youkaichao added 2 commits August 4, 2020 19:18

env code refactor

4b20cfa

enable async simulation for all vec env

a2a543a

youkaichao changed the title ~~code refactor for env~~ WIP: code refactor for env Aug 4, 2020

Trinkle23897 linked an issue Aug 4, 2020 that may be closed by this pull request

Async Sampling #103

Closed

8 tasks

youkaichao added 8 commits August 4, 2020 21:31

add venvs

7e50d22

doc fix

8eda6c8

fix ray import

01628a1

fix close in profile & make run_once specific to vec env

87fc0f0

correctly close and bugfix

556f094

bugfix for _batch_set_item and is_async

5bcc4f4

bugfix for incorrectly re-used collector in test_sac_with_il.py

4706fb0

bugfix for incorrectly re-used collector in test/discrete/test_a2c_wi…

836325d

…th_il.py

youkaichao requested a review from Trinkle23897 August 4, 2020 16:19

youkaichao commented Aug 4, 2020

View reviewed changes

test/throughput/test_collector_profile.py Outdated Show resolved Hide resolved

youkaichao changed the title ~~WIP: code refactor for env~~ code refactor for env Aug 4, 2020

improve for send_action and get_result

8bd0d74

Trinkle23897 mentioned this pull request Aug 5, 2020

fix #166, assure close in vec env #178

Closed

Trinkle23897 linked an issue Aug 5, 2020 that may be closed by this pull request

SubprocVectorEnv is not released when collector is closed #166

Closed

Trinkle23897 reviewed Aug 5, 2020

View reviewed changes

test/throughput/test_collector_profile.py Outdated Show resolved Hide resolved

tianshou/env/venvs.py Outdated Show resolved Hide resolved

youkaichao and others added 11 commits August 5, 2020 09:24

rename VectorEnv to ForLoopVectorEnv and bugfix

7a5322d

bugfix for sync simulation

4c38319

pep8 fix

dd98f06

rename ForLoopVectorEnv to DummyVectorEnv

dac13ce

Merge branch 'dev' into worker

78a11d1

change examples venv

0e356b5

update docs

74d8961

update docs

abfb337

remove collector.close

b6b2003

fix close

01f1a96

try import

51f0cb4

update doc for parallel simulation

e1da7f0

youkaichao commented Aug 16, 2020

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

docs/tutorials/dqn.rst Outdated Show resolved Hide resolved

youkaichao mentioned this pull request Aug 16, 2020

Pickle compatible for replay buffer and improve buffer.get #182

Merged

Trinkle23897 added 2 commits August 16, 2020 16:26

Merge branch 'dev' into worker

f6a1e28

docs

5ce2ae4

Trinkle23897 reviewed Aug 16, 2020

View reviewed changes

test/base/test_env.py Outdated Show resolved Hide resolved

Trinkle23897 added 3 commits August 17, 2020 08:11

add warning instead of remove

82ead96

0.2.6

e7372ec

stable test async

cee76e4

Trinkle23897 requested a review from duburcqa August 17, 2020 14:52

Trinkle23897 added 2 commits August 18, 2020 15:16

remove gym.Env from BaseWorker

ba77aac

change to parser.parse_args()

2080e6a

duburcqa reviewed Aug 19, 2020

View reviewed changes

tianshou/env/worker/subproc.py Outdated Show resolved Hide resolved

duburcqa previously approved these changes Aug 19, 2020

View reviewed changes

move _setup_buf

091f39d

Trinkle23897 dismissed duburcqa’s stale review via 091f39d August 19, 2020 06:41

Trinkle23897 approved these changes Aug 19, 2020

View reviewed changes

Trinkle23897 merged commit a9f9940 into thu-ml:dev Aug 19, 2020

youkaichao deleted the worker branch August 19, 2020 07:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

code refactor for venv #179

code refactor for venv #179

Uh oh!

youkaichao commented Aug 4, 2020 •

edited by Trinkle23897

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

duburcqa commented Aug 16, 2020 •

edited

Loading

Uh oh!

duburcqa commented Aug 16, 2020

Uh oh!

Trinkle23897 commented Aug 16, 2020 •

edited

Loading

Uh oh!

Uh oh!

youkaichao commented Aug 16, 2020

Uh oh!

duburcqa commented Aug 18, 2020

Uh oh!

Uh oh!

duburcqa commented Aug 19, 2020

Uh oh!

youkaichao commented Aug 19, 2020

Uh oh!

duburcqa commented Aug 19, 2020

Uh oh!

youkaichao commented Aug 19, 2020 •

edited

Loading

Uh oh!

duburcqa commented Aug 19, 2020

Uh oh!

Uh oh!

code refactor for venv #179

code refactor for venv #179

Uh oh!

Conversation

youkaichao commented Aug 4, 2020 • edited by Trinkle23897 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

duburcqa commented Aug 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

duburcqa commented Aug 16, 2020

Uh oh!

Trinkle23897 commented Aug 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

youkaichao commented Aug 16, 2020

Uh oh!

duburcqa commented Aug 18, 2020

Uh oh!

Uh oh!

duburcqa commented Aug 19, 2020

Uh oh!

youkaichao commented Aug 19, 2020

Uh oh!

duburcqa commented Aug 19, 2020

Uh oh!

youkaichao commented Aug 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

duburcqa commented Aug 19, 2020

Uh oh!

Uh oh!

youkaichao commented Aug 4, 2020 •

edited by Trinkle23897

Loading

duburcqa commented Aug 16, 2020 •

edited

Loading

Trinkle23897 commented Aug 16, 2020 •

edited

Loading

youkaichao commented Aug 19, 2020 •

edited

Loading