Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
make format
(required)make commit-checks
(required)As I mentioned in #494, to support CTDE scheme or other schemes that need global information, the returned
info
should contain more information.By the way, I am trying to implement some basic MARL-communication algorithms based on tianshou. Inspired by #399 (comment), I have the following design:
agent_num
agents andenv_num
envs should act as a venv withagent_num
xenv_num
envs.env_id
=agent_num
xenv_num
+env_num
should be contained ininfo
.env_id
ininfo
indicates which agents are taking actions.agent_num
xenv_num
buffers, the indices used for each agent should be the same.agent_num
agents and the centralized module.