Why is Collector mapping randomly sampled actions using map_action?

Hi, everyone; it seems that in [Collector ](https://github.com/thu-ml/tianshou/blob/bc53ead273f6f9d3788a78ecc739249eeb96b8c6/tianshou/data/collector.py), if ``random`` is true, we are sampling agents from the action space using ``self._action_space[i].sample()``.

We then apply ``action_remap = self.policy.map_action(self.data.act)`` to them just as we do to action generated by the policy.

Is this correct? It seems to me that the actions sampled from the action space should already be scaled correctly and squashing them probably changes their distribution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why is Collector mapping randomly sampled actions using map_action? #512

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why is Collector mapping randomly sampled actions using map_action? #512

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions