这是indexloc提供的服务,不要输入任何密码
Skip to content

fix #98, support #99 #102

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
Jun 27, 2020
Merged

fix #98, support #99 #102

merged 9 commits into from
Jun 27, 2020

Conversation

danagi
Copy link
Collaborator

@danagi danagi commented Jun 27, 2020

  • I have marked all applicable categories:
    • exception-raising fix
    • algorithm implementation fix
    • documentation modification
    • new feature
  • If applicable, I have mentioned the relevant/related issue(s)

Less important but also useful:

  • I have visited the source website, and in particular read the known issues
  • I have searched through the issue tracker and issue categories for duplicates
  • I have mentioned version numbers, operating system and environment, where applicable:
    import tianshou, torch, sys
    print(tianshou.__version__, torch.__version__, sys.version, sys.platform)

@Trinkle23897 Trinkle23897 merged commit 60cfc37 into thu-ml:master Jun 27, 2020
BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024
* Add auto alpha tuning and exploration noise for sac.
Add class BaseNoise and GaussianNoise for the concept of exploration noise.
Add new test for sac tested in MountainCarContinuous-v0,
which should benefits from the two above new feature.

* add exploration noise to collector, fix example to adapt modification

* fix thu-ml#98

* enable off-policy to update multiple times in one step. (thu-ml#99)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

support for multiple updates in one step ActorProb has redundant activation function in SAC examples
2 participants