这是indexloc提供的服务,不要输入任何密码
Skip to content

ppo benchmark released #330

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 7 commits into from
Mar 30, 2021
Merged

ppo benchmark released #330

merged 7 commits into from
Mar 30, 2021

Conversation

ChenDRAG
Copy link
Collaborator

@ChenDRAG ChenDRAG commented Mar 28, 2021

PPO SOTA benchmark finally ready. See #307 for details.

Environment Tianshou ikostrikov/pytorch-a2c-ppo-acktr-gail PPO paper baselines spinningup(pytorch)
Ant 3258.4+-1079.3 N N N ~650
HalfCheetah 5783.9+-1244.0 ~3120 ~1800 ~1700 ~1670
Hopper 2609.3+-700.8 ~2300 ~2330 ~2400 ~1850
Walker2d 3588.5+-756.6 ~4000 ~3460 ~3510 ~1230
Swimmer 66.7+-99.1 N ~108 ~111 ~120
Humanoid 787.1+-193.5 N N N N
Reacher -4.1+-0.3 ~-5 ~-7 ~-6 N
InvertedPendulum 1000.0+-0.0 N ~1000 ~940 N
InvertedDoublePendulum 9231.3+-270.4 N ~8000 ~7350 N

@codecov-io
Copy link

codecov-io commented Mar 28, 2021

Codecov Report

Merging #330 (8f7427c) into master (5d580c3) will decrease coverage by 0.00%.
The diff coverage is 0.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #330      +/-   ##
==========================================
- Coverage   93.98%   93.98%   -0.01%     
==========================================
  Files          51       51              
  Lines        3274     3273       -1     
==========================================
- Hits         3077     3076       -1     
  Misses        197      197              
Flag Coverage Δ
unittests 93.98% <0.00%> (-0.01%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
tianshou/policy/modelfree/ppo.py 90.90% <0.00%> (+1.35%) ⬆️
tianshou/trainer/onpolicy.py 95.58% <0.00%> (-1.48%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5d580c3...8f7427c. Read the comment docs.

@Trinkle23897 Trinkle23897 requested a review from danagi March 28, 2021 13:23
danagi
danagi previously approved these changes Mar 28, 2021
@Trinkle23897 Trinkle23897 merged commit 6426a39 into thu-ml:master Mar 30, 2021
BFAnas pushed a commit to BFAnas/tianshou that referenced this pull request May 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Plans of releasing mujoco benchmark of onpolicy algorithms(VPG, A2C, PPO)
4 participants