这是indexloc提供的服务,不要输入任何密码
Skip to content

Noisy network implementation #194

@GIS-PuppetMaster

Description

@GIS-PuppetMaster

My expected action is a tensor with softmax activation. However, when I use TD3, the current implementation will add this action_bias to the softmax-activated tensor, which breaks the action distribution.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementFeature that is not a new algorithm or an algorithm enhancement

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions