+
Skip to content

[CI] Fix libs workflows #2800

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 25 commits into from
Feb 28, 2025
Merged

[CI] Fix libs workflows #2800

merged 25 commits into from
Feb 28, 2025

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Feb 20, 2025

No description provided.

Copy link

pytorch-bot bot commented Feb 20, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2800

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2025
@vmoens vmoens added CI Has to do with CI setup (e.g. wheels & builds, tests...) Environments Adds or modifies an environment wrapper Data Data-related PR, will launch data-related jobs and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Feb 20, 2025
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2025
Copy link

github-actions bot commented Feb 20, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}23$. Worsened: $\large\color{#d91a1a}5$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.6176s 0.5321s 1.8792 Ops/s 1.9502 Ops/s $\color{#d91a1a}-3.64\%$
test_transformed 1.1420s 1.0527s 0.9500 Ops/s 0.9818 Ops/s $\color{#d91a1a}-3.24\%$
test_serial 1.6539s 1.5647s 0.6391 Ops/s 0.6560 Ops/s $\color{#d91a1a}-2.59\%$
test_parallel 1.3892s 1.3141s 0.7610 Ops/s 0.7678 Ops/s $\color{#d91a1a}-0.89\%$
test_step_mdp_speed[True-True-True-True-True] 0.1901ms 31.8128μs 31.4339 KOps/s 32.2017 KOps/s $\color{#d91a1a}-2.38\%$
test_step_mdp_speed[True-True-True-True-False] 45.4240μs 18.3221μs 54.5790 KOps/s 55.1509 KOps/s $\color{#d91a1a}-1.04\%$
test_step_mdp_speed[True-True-True-False-True] 45.5540μs 17.6565μs 56.6362 KOps/s 57.2429 KOps/s $\color{#d91a1a}-1.06\%$
test_step_mdp_speed[True-True-True-False-False] 38.5410μs 10.2913μs 97.1695 KOps/s 99.0314 KOps/s $\color{#d91a1a}-1.88\%$
test_step_mdp_speed[True-True-False-True-True] 0.5044ms 33.6712μs 29.6989 KOps/s 29.7896 KOps/s $\color{#d91a1a}-0.30\%$
test_step_mdp_speed[True-True-False-True-False] 55.3130μs 20.3796μs 49.0687 KOps/s 49.6637 KOps/s $\color{#d91a1a}-1.20\%$
test_step_mdp_speed[True-True-False-False-True] 56.6460μs 19.6172μs 50.9758 KOps/s 50.6235 KOps/s $\color{#35bf28}+0.70\%$
test_step_mdp_speed[True-True-False-False-False] 39.3830μs 12.3968μs 80.6662 KOps/s 82.5391 KOps/s $\color{#d91a1a}-2.27\%$
test_step_mdp_speed[True-False-True-True-True] 78.0650μs 35.7519μs 27.9705 KOps/s 28.0084 KOps/s $\color{#d91a1a}-0.14\%$
test_step_mdp_speed[True-False-True-True-False] 55.6840μs 22.5262μs 44.3927 KOps/s 45.1828 KOps/s $\color{#d91a1a}-1.75\%$
test_step_mdp_speed[True-False-True-False-True] 50.8240μs 19.7095μs 50.7371 KOps/s 50.9936 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[True-False-True-False-False] 33.9030μs 12.3097μs 81.2365 KOps/s 82.2535 KOps/s $\color{#d91a1a}-1.24\%$
test_step_mdp_speed[True-False-False-True-True] 75.8310μs 37.4671μs 26.6901 KOps/s 26.6821 KOps/s $\color{#35bf28}+0.03\%$
test_step_mdp_speed[True-False-False-True-False] 54.9620μs 24.3091μs 41.1368 KOps/s 41.7268 KOps/s $\color{#d91a1a}-1.41\%$
test_step_mdp_speed[True-False-False-False-True] 53.4890μs 21.4093μs 46.7087 KOps/s 46.0847 KOps/s $\color{#35bf28}+1.35\%$
test_step_mdp_speed[True-False-False-False-False] 42.3590μs 14.2800μs 70.0279 KOps/s 71.5885 KOps/s $\color{#d91a1a}-2.18\%$
test_step_mdp_speed[False-True-True-True-True] 70.4610μs 36.0495μs 27.7396 KOps/s 28.2189 KOps/s $\color{#d91a1a}-1.70\%$
test_step_mdp_speed[False-True-True-True-False] 53.6900μs 22.6306μs 44.1879 KOps/s 45.3288 KOps/s $\color{#d91a1a}-2.52\%$
test_step_mdp_speed[False-True-True-False-True] 77.5040μs 23.3301μs 42.8631 KOps/s 44.6145 KOps/s $\color{#d91a1a}-3.93\%$
test_step_mdp_speed[False-True-True-False-False] 58.5360μs 13.6907μs 73.0425 KOps/s 73.5912 KOps/s $\color{#d91a1a}-0.75\%$
test_step_mdp_speed[False-True-False-True-True] 0.6275ms 37.1895μs 26.8893 KOps/s 27.0406 KOps/s $\color{#d91a1a}-0.56\%$
test_step_mdp_speed[False-True-False-True-False] 49.0010μs 24.2792μs 41.1874 KOps/s 41.9872 KOps/s $\color{#d91a1a}-1.90\%$
test_step_mdp_speed[False-True-False-False-True] 2.6837ms 24.2598μs 41.2204 KOps/s 40.6687 KOps/s $\color{#35bf28}+1.36\%$
test_step_mdp_speed[False-True-False-False-False] 43.0300μs 15.6387μs 63.9439 KOps/s 64.8748 KOps/s $\color{#d91a1a}-1.43\%$
test_step_mdp_speed[False-False-True-True-True] 73.2270μs 39.3892μs 25.3877 KOps/s 25.3583 KOps/s $\color{#35bf28}+0.12\%$
test_step_mdp_speed[False-False-True-True-False] 59.9010μs 26.4141μs 37.8586 KOps/s 38.5926 KOps/s $\color{#d91a1a}-1.90\%$
test_step_mdp_speed[False-False-True-False-True] 51.9260μs 24.5040μs 40.8096 KOps/s 38.3313 KOps/s $\textbf{\color{#35bf28}+6.47\%}$
test_step_mdp_speed[False-False-True-False-False] 37.1800μs 15.7708μs 63.4082 KOps/s 64.6917 KOps/s $\color{#d91a1a}-1.98\%$
test_step_mdp_speed[False-False-False-True-True] 82.7440μs 41.5415μs 24.0723 KOps/s 24.4928 KOps/s $\color{#d91a1a}-1.72\%$
test_step_mdp_speed[False-False-False-True-False] 58.1380μs 28.3673μs 35.2519 KOps/s 36.5160 KOps/s $\color{#d91a1a}-3.46\%$
test_step_mdp_speed[False-False-False-False-True] 55.0330μs 26.7849μs 37.3345 KOps/s 38.3103 KOps/s $\color{#d91a1a}-2.55\%$
test_step_mdp_speed[False-False-False-False-False] 35.9760μs 17.6444μs 56.6752 KOps/s 58.3053 KOps/s $\color{#d91a1a}-2.80\%$
test_values[generalized_advantage_estimate-True-True] 12.6379ms 9.7807ms 102.2423 Ops/s 102.6533 Ops/s $\color{#d91a1a}-0.40\%$
test_values[vec_generalized_advantage_estimate-True-True] 28.6457ms 25.9634ms 38.5158 Ops/s 41.4970 Ops/s $\textbf{\color{#d91a1a}-7.18\%}$
test_values[td0_return_estimate-False-False] 0.2349ms 0.1742ms 5.7390 KOps/s 5.6454 KOps/s $\color{#35bf28}+1.66\%$
test_values[td1_return_estimate-False-False] 26.1403ms 23.4952ms 42.5619 Ops/s 41.2349 Ops/s $\color{#35bf28}+3.22\%$
test_values[vec_td1_return_estimate-False-False] 29.4235ms 26.1156ms 38.2913 Ops/s 41.4148 Ops/s $\textbf{\color{#d91a1a}-7.54\%}$
test_values[td_lambda_return_estimate-True-False] 36.8613ms 34.1541ms 29.2790 Ops/s 28.3624 Ops/s $\color{#35bf28}+3.23\%$
test_values[vec_td_lambda_return_estimate-True-False] 28.2081ms 26.1867ms 38.1873 Ops/s 41.4908 Ops/s $\textbf{\color{#d91a1a}-7.96\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.5505ms 8.3859ms 119.2476 Ops/s 117.0875 Ops/s $\color{#35bf28}+1.84\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.6484ms 1.8881ms 529.6252 Ops/s 504.5722 Ops/s $\color{#35bf28}+4.97\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 1.1880ms 0.3840ms 2.6042 KOps/s 2.6734 KOps/s $\color{#d91a1a}-2.59\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 47.0706ms 45.9951ms 21.7414 Ops/s 24.4936 Ops/s $\textbf{\color{#d91a1a}-11.24\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.3778ms 3.4316ms 291.4115 Ops/s 289.2726 Ops/s $\color{#35bf28}+0.74\%$
test_dqn_speed[False-None] 6.0057ms 1.4295ms 699.5311 Ops/s 691.4847 Ops/s $\color{#35bf28}+1.16\%$
test_dqn_speed[False-backward] 2.0250ms 1.9263ms 519.1221 Ops/s 511.0722 Ops/s $\color{#35bf28}+1.58\%$
test_dqn_speed[True-None] 0.6991ms 0.4718ms 2.1197 KOps/s 2.0296 KOps/s $\color{#35bf28}+4.44\%$
test_dqn_speed[True-backward] 0.9327ms 0.9026ms 1.1079 KOps/s 1.0654 KOps/s $\color{#35bf28}+3.99\%$
test_dqn_speed[reduce-overhead-None] 0.5936ms 0.4797ms 2.0846 KOps/s 2.0249 KOps/s $\color{#35bf28}+2.95\%$
test_dqn_speed[reduce-overhead-backward] 1.0229ms 0.9109ms 1.0978 KOps/s 1.0653 KOps/s $\color{#35bf28}+3.05\%$
test_ddpg_speed[False-None] 3.2398ms 2.9059ms 344.1331 Ops/s 337.2406 Ops/s $\color{#35bf28}+2.04\%$
test_ddpg_speed[False-backward] 4.1483ms 4.0436ms 247.3036 Ops/s 237.3572 Ops/s $\color{#35bf28}+4.19\%$
test_ddpg_speed[True-None] 1.5706ms 1.2176ms 821.3171 Ops/s 796.9148 Ops/s $\color{#35bf28}+3.06\%$
test_ddpg_speed[True-backward] 2.1678ms 2.1061ms 474.8034 Ops/s 462.3922 Ops/s $\color{#35bf28}+2.68\%$
test_ddpg_speed[reduce-overhead-None] 1.4484ms 1.2140ms 823.7524 Ops/s 798.4943 Ops/s $\color{#35bf28}+3.16\%$
test_ddpg_speed[reduce-overhead-backward] 3.0108ms 2.1622ms 462.4939 Ops/s 464.7164 Ops/s $\color{#d91a1a}-0.48\%$
test_sac_speed[False-None] 9.5215ms 8.0606ms 124.0609 Ops/s 122.6947 Ops/s $\color{#35bf28}+1.11\%$
test_sac_speed[False-backward] 12.1018ms 10.8941ms 91.7932 Ops/s 91.0815 Ops/s $\color{#35bf28}+0.78\%$
test_sac_speed[True-None] 3.1237ms 2.0878ms 478.9785 Ops/s 473.6052 Ops/s $\color{#35bf28}+1.13\%$
test_sac_speed[True-backward] 4.1273ms 3.8208ms 261.7232 Ops/s 257.9315 Ops/s $\color{#35bf28}+1.47\%$
test_sac_speed[reduce-overhead-None] 2.6133ms 2.0767ms 481.5239 Ops/s 446.9960 Ops/s $\textbf{\color{#35bf28}+7.72\%}$
test_sac_speed[reduce-overhead-backward] 4.8586ms 3.8789ms 257.8045 Ops/s 260.5021 Ops/s $\color{#d91a1a}-1.04\%$
test_redq_speed[False-None] 17.9792ms 13.4394ms 74.4081 Ops/s 74.3131 Ops/s $\color{#35bf28}+0.13\%$
test_redq_speed[False-backward] 26.1005ms 22.7897ms 43.8795 Ops/s 43.3756 Ops/s $\color{#35bf28}+1.16\%$
test_redq_speed[True-None] 6.0153ms 5.1280ms 195.0083 Ops/s 192.6267 Ops/s $\color{#35bf28}+1.24\%$
test_redq_speed[True-backward] 13.1082ms 12.4092ms 80.5853 Ops/s 75.9988 Ops/s $\textbf{\color{#35bf28}+6.03\%}$
test_redq_speed[reduce-overhead-None] 5.7586ms 5.0075ms 199.6986 Ops/s 203.3361 Ops/s $\color{#d91a1a}-1.79\%$
test_redq_speed[reduce-overhead-backward] 13.2732ms 12.5847ms 79.4617 Ops/s 75.2112 Ops/s $\textbf{\color{#35bf28}+5.65\%}$
test_redq_deprec_speed[False-None] 13.4651ms 12.8728ms 77.6829 Ops/s 73.4339 Ops/s $\textbf{\color{#35bf28}+5.79\%}$
test_redq_deprec_speed[False-backward] 19.6228ms 18.7130ms 53.4389 Ops/s 51.1534 Ops/s $\color{#35bf28}+4.47\%$
test_redq_deprec_speed[True-None] 4.9193ms 3.8791ms 257.7944 Ops/s 255.3542 Ops/s $\color{#35bf28}+0.96\%$
test_redq_deprec_speed[True-backward] 9.7041ms 8.4693ms 118.0735 Ops/s 120.4857 Ops/s $\color{#d91a1a}-2.00\%$
test_redq_deprec_speed[reduce-overhead-None] 4.5760ms 3.9352ms 254.1183 Ops/s 258.2939 Ops/s $\color{#d91a1a}-1.62\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.3148ms 8.4668ms 118.1078 Ops/s 108.0733 Ops/s $\textbf{\color{#35bf28}+9.28\%}$
test_td3_speed[False-None] 8.4628ms 8.1526ms 122.6602 Ops/s 118.8661 Ops/s $\color{#35bf28}+3.19\%$
test_td3_speed[False-backward] 11.0733ms 10.4934ms 95.2980 Ops/s 91.4695 Ops/s $\color{#35bf28}+4.19\%$
test_td3_speed[True-None] 1.9283ms 1.8270ms 547.3349 Ops/s 542.8660 Ops/s $\color{#35bf28}+0.82\%$
test_td3_speed[True-backward] 3.4849ms 3.3956ms 294.4962 Ops/s 290.9650 Ops/s $\color{#35bf28}+1.21\%$
test_td3_speed[reduce-overhead-None] 2.0495ms 1.7857ms 560.0201 Ops/s 539.9913 Ops/s $\color{#35bf28}+3.71\%$
test_td3_speed[reduce-overhead-backward] 3.7317ms 3.4012ms 294.0131 Ops/s 288.8738 Ops/s $\color{#35bf28}+1.78\%$
test_cql_speed[False-None] 39.2835ms 36.1166ms 27.6881 Ops/s 26.5430 Ops/s $\color{#35bf28}+4.31\%$
test_cql_speed[False-backward] 52.2558ms 47.7961ms 20.9222 Ops/s 20.2684 Ops/s $\color{#35bf28}+3.23\%$
test_cql_speed[True-None] 16.6574ms 16.0003ms 62.4990 Ops/s 62.2971 Ops/s $\color{#35bf28}+0.32\%$
test_cql_speed[True-backward] 25.1265ms 23.5110ms 42.5332 Ops/s 43.7469 Ops/s $\color{#d91a1a}-2.77\%$
test_cql_speed[reduce-overhead-None] 16.5981ms 16.1048ms 62.0933 Ops/s 61.8089 Ops/s $\color{#35bf28}+0.46\%$
test_cql_speed[reduce-overhead-backward] 23.8598ms 23.0602ms 43.3647 Ops/s 43.4150 Ops/s $\color{#d91a1a}-0.12\%$
test_a2c_speed[False-None] 8.7868ms 7.2166ms 138.5695 Ops/s 136.5884 Ops/s $\color{#35bf28}+1.45\%$
test_a2c_speed[False-backward] 15.8932ms 14.4234ms 69.3317 Ops/s 68.4445 Ops/s $\color{#35bf28}+1.30\%$
test_a2c_speed[True-None] 4.4997ms 3.7512ms 266.5796 Ops/s 266.6229 Ops/s $\color{#d91a1a}-0.02\%$
test_a2c_speed[True-backward] 11.0755ms 10.2124ms 97.9200 Ops/s 97.8414 Ops/s $\color{#35bf28}+0.08\%$
test_a2c_speed[reduce-overhead-None] 4.4061ms 3.7664ms 265.5025 Ops/s 266.7976 Ops/s $\color{#d91a1a}-0.49\%$
test_a2c_speed[reduce-overhead-backward] 11.4415ms 10.1818ms 98.2141 Ops/s 95.9353 Ops/s $\color{#35bf28}+2.38\%$
test_ppo_speed[False-None] 8.4208ms 7.5280ms 132.8379 Ops/s 128.9654 Ops/s $\color{#35bf28}+3.00\%$
test_ppo_speed[False-backward] 16.4934ms 14.9205ms 67.0220 Ops/s 64.9007 Ops/s $\color{#35bf28}+3.27\%$
test_ppo_speed[True-None] 4.8314ms 4.0934ms 244.2946 Ops/s 211.8122 Ops/s $\textbf{\color{#35bf28}+15.34\%}$
test_ppo_speed[True-backward] 10.4252ms 10.0416ms 99.5856 Ops/s 89.9974 Ops/s $\textbf{\color{#35bf28}+10.65\%}$
test_ppo_speed[reduce-overhead-None] 4.7193ms 4.0792ms 245.1477 Ops/s 233.9062 Ops/s $\color{#35bf28}+4.81\%$
test_ppo_speed[reduce-overhead-backward] 10.9222ms 10.0798ms 99.2087 Ops/s 95.2981 Ops/s $\color{#35bf28}+4.10\%$
test_reinforce_speed[False-None] 7.3261ms 6.5402ms 152.9004 Ops/s 145.2639 Ops/s $\textbf{\color{#35bf28}+5.26\%}$
test_reinforce_speed[False-backward] 9.9658ms 9.7719ms 102.3344 Ops/s 98.6253 Ops/s $\color{#35bf28}+3.76\%$
test_reinforce_speed[True-None] 3.8220ms 3.1271ms 319.7844 Ops/s 316.3480 Ops/s $\color{#35bf28}+1.09\%$
test_reinforce_speed[True-backward] 9.4733ms 8.9991ms 111.1217 Ops/s 101.4945 Ops/s $\textbf{\color{#35bf28}+9.49\%}$
test_reinforce_speed[reduce-overhead-None] 3.8014ms 3.0668ms 326.0732 Ops/s 291.9713 Ops/s $\textbf{\color{#35bf28}+11.68\%}$
test_reinforce_speed[reduce-overhead-backward] 9.6642ms 9.1216ms 109.6299 Ops/s 98.7518 Ops/s $\textbf{\color{#35bf28}+11.02\%}$
test_iql_speed[False-None] 34.8043ms 32.7923ms 30.4950 Ops/s 28.3223 Ops/s $\textbf{\color{#35bf28}+7.67\%}$
test_iql_speed[False-backward] 47.1211ms 45.5896ms 21.9348 Ops/s 21.1674 Ops/s $\color{#35bf28}+3.63\%$
test_iql_speed[True-None] 12.0372ms 11.0750ms 90.2934 Ops/s 84.1951 Ops/s $\textbf{\color{#35bf28}+7.24\%}$
test_iql_speed[True-backward] 23.0520ms 21.8905ms 45.6818 Ops/s 45.2794 Ops/s $\color{#35bf28}+0.89\%$
test_iql_speed[reduce-overhead-None] 11.8628ms 11.1539ms 89.6550 Ops/s 81.9762 Ops/s $\textbf{\color{#35bf28}+9.37\%}$
test_iql_speed[reduce-overhead-backward] 24.1118ms 22.2715ms 44.9005 Ops/s 45.2146 Ops/s $\color{#d91a1a}-0.69\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.5236ms 4.9565ms 201.7561 Ops/s 203.4175 Ops/s $\color{#d91a1a}-0.82\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7901ms 0.5250ms 1.9049 KOps/s 1.9127 KOps/s $\color{#d91a1a}-0.41\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7646ms 0.5025ms 1.9901 KOps/s 2.0375 KOps/s $\color{#d91a1a}-2.33\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.6427ms 4.7502ms 210.5154 Ops/s 221.2174 Ops/s $\color{#d91a1a}-4.84\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.1496ms 0.5068ms 1.9732 KOps/s 1.9584 KOps/s $\color{#35bf28}+0.76\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6948ms 0.4830ms 2.0705 KOps/s 2.0543 KOps/s $\color{#35bf28}+0.79\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.8561ms 1.6507ms 605.7923 Ops/s 601.7384 Ops/s $\color{#35bf28}+0.67\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2053ms 1.5727ms 635.8600 Ops/s 628.1463 Ops/s $\color{#35bf28}+1.23\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 4.8419ms 4.5839ms 218.1532 Ops/s 212.5111 Ops/s $\color{#35bf28}+2.65\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.4986ms 0.6516ms 1.5347 KOps/s 1.5317 KOps/s $\color{#35bf28}+0.20\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8484ms 0.6297ms 1.5880 KOps/s 1.5747 KOps/s $\color{#35bf28}+0.84\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.9328ms 4.5040ms 222.0266 Ops/s 209.5941 Ops/s $\textbf{\color{#35bf28}+5.93\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.2074ms 0.5172ms 1.9336 KOps/s 1.9270 KOps/s $\color{#35bf28}+0.34\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7165ms 0.4966ms 2.0138 KOps/s 1.9691 KOps/s $\color{#35bf28}+2.27\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 4.9435ms 4.4558ms 224.4263 Ops/s 206.6225 Ops/s $\textbf{\color{#35bf28}+8.62\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.0460ms 0.5091ms 1.9643 KOps/s 1.9467 KOps/s $\color{#35bf28}+0.90\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7302ms 0.4806ms 2.0807 KOps/s 2.0717 KOps/s $\color{#35bf28}+0.44\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.7917ms 4.5999ms 217.3939 Ops/s 201.2842 Ops/s $\textbf{\color{#35bf28}+8.00\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.6711ms 0.6521ms 1.5336 KOps/s 1.4877 KOps/s $\color{#35bf28}+3.08\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8775ms 0.6295ms 1.5887 KOps/s 1.5503 KOps/s $\color{#35bf28}+2.47\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.7001ms 4.2190ms 237.0250 Ops/s 231.2376 Ops/s $\color{#35bf28}+2.50\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.4498ms 2.2857ms 437.4970 Ops/s 447.1988 Ops/s $\color{#d91a1a}-2.17\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 6.1458ms 1.4585ms 685.6147 Ops/s 633.1200 Ops/s $\textbf{\color{#35bf28}+8.29\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4298s 12.7832ms 78.2279 Ops/s 229.4712 Ops/s $\textbf{\color{#d91a1a}-65.91\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 6.6011ms 2.3016ms 434.4808 Ops/s 428.2712 Ops/s $\color{#35bf28}+1.45\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 1.9150ms 1.2592ms 794.1810 Ops/s 678.5497 Ops/s $\textbf{\color{#35bf28}+17.04\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.6939ms 4.3916ms 227.7072 Ops/s 32.0607 Ops/s $\textbf{\color{#35bf28}+610.24\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.0848ms 2.4570ms 406.9973 Ops/s 392.2417 Ops/s $\color{#35bf28}+3.76\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.6744ms 1.4885ms 671.8041 Ops/s 617.4172 Ops/s $\textbf{\color{#35bf28}+8.81\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.2009ms 11.7997ms 84.7478 Ops/s 76.3924 Ops/s $\textbf{\color{#35bf28}+10.94\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.4213ms 14.5396ms 68.7776 Ops/s 67.7788 Ops/s $\color{#35bf28}+1.47\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.8588ms 20.5264ms 48.7178 Ops/s 46.4637 Ops/s $\color{#35bf28}+4.85\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 15.4389ms 14.6005ms 68.4907 Ops/s 66.5621 Ops/s $\color{#35bf28}+2.90\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 22.5321ms 20.5413ms 48.6824 Ops/s 46.9329 Ops/s $\color{#35bf28}+3.73\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.6813ms 16.0382ms 62.3510 Ops/s 61.0805 Ops/s $\color{#35bf28}+2.08\%$

Copy link

github-actions bot commented Feb 20, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.9239s 0.8311s 1.2033 Ops/s 1.2028 Ops/s $\color{#35bf28}+0.04\%$
test_transformed 1.5626s 1.4696s 0.6805 Ops/s 0.6864 Ops/s $\color{#d91a1a}-0.86\%$
test_serial 2.4569s 2.3578s 0.4241 Ops/s 0.4286 Ops/s $\color{#d91a1a}-1.04\%$
test_parallel 1.9547s 1.8574s 0.5384 Ops/s 0.5301 Ops/s $\color{#35bf28}+1.56\%$
test_step_mdp_speed[True-True-True-True-True] 0.1258ms 39.1245μs 25.5594 KOps/s 25.4737 KOps/s $\color{#35bf28}+0.34\%$
test_step_mdp_speed[True-True-True-True-False] 53.3810μs 23.6803μs 42.2292 KOps/s 43.1985 KOps/s $\color{#d91a1a}-2.24\%$
test_step_mdp_speed[True-True-True-False-True] 57.6910μs 21.9875μs 45.4803 KOps/s 45.4690 KOps/s $\color{#35bf28}+0.02\%$
test_step_mdp_speed[True-True-True-False-False] 39.4710μs 12.9686μs 77.1093 KOps/s 76.6018 KOps/s $\color{#35bf28}+0.66\%$
test_step_mdp_speed[True-True-False-True-True] 0.1083ms 42.0535μs 23.7793 KOps/s 23.8569 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[True-True-False-True-False] 56.8210μs 25.5547μs 39.1317 KOps/s 38.9215 KOps/s $\color{#35bf28}+0.54\%$
test_step_mdp_speed[True-True-False-False-True] 94.8110μs 24.2006μs 41.3212 KOps/s 41.5523 KOps/s $\color{#d91a1a}-0.56\%$
test_step_mdp_speed[True-True-False-False-False] 42.1110μs 15.2179μs 65.7119 KOps/s 65.3415 KOps/s $\color{#35bf28}+0.57\%$
test_step_mdp_speed[True-False-True-True-True] 71.6510μs 44.5948μs 22.4242 KOps/s 22.6232 KOps/s $\color{#d91a1a}-0.88\%$
test_step_mdp_speed[True-False-True-True-False] 55.2010μs 28.0706μs 35.6245 KOps/s 35.8800 KOps/s $\color{#d91a1a}-0.71\%$
test_step_mdp_speed[True-False-True-False-True] 60.7110μs 24.1815μs 41.3539 KOps/s 40.7783 KOps/s $\color{#35bf28}+1.41\%$
test_step_mdp_speed[True-False-True-False-False] 38.6600μs 15.3664μs 65.0772 KOps/s 66.2086 KOps/s $\color{#d91a1a}-1.71\%$
test_step_mdp_speed[True-False-False-True-True] 75.2110μs 46.4439μs 21.5313 KOps/s 21.4816 KOps/s $\color{#35bf28}+0.23\%$
test_step_mdp_speed[True-False-False-True-False] 61.0420μs 30.2492μs 33.0587 KOps/s 33.3042 KOps/s $\color{#d91a1a}-0.74\%$
test_step_mdp_speed[True-False-False-False-True] 56.0600μs 26.6011μs 37.5924 KOps/s 38.1119 KOps/s $\color{#d91a1a}-1.36\%$
test_step_mdp_speed[True-False-False-False-False] 49.2910μs 17.5235μs 57.0663 KOps/s 57.5595 KOps/s $\color{#d91a1a}-0.86\%$
test_step_mdp_speed[False-True-True-True-True] 88.3320μs 44.7668μs 22.3380 KOps/s 22.7332 KOps/s $\color{#d91a1a}-1.74\%$
test_step_mdp_speed[False-True-True-True-False] 59.8710μs 28.4231μs 35.1826 KOps/s 35.8642 KOps/s $\color{#d91a1a}-1.90\%$
test_step_mdp_speed[False-True-True-False-True] 68.2110μs 28.4066μs 35.2030 KOps/s 36.3173 KOps/s $\color{#d91a1a}-3.07\%$
test_step_mdp_speed[False-True-True-False-False] 51.9910μs 16.8628μs 59.3021 KOps/s 59.5003 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[False-True-False-True-True] 77.4720μs 46.2643μs 21.6149 KOps/s 21.4748 KOps/s $\color{#35bf28}+0.65\%$
test_step_mdp_speed[False-True-False-True-False] 57.6010μs 30.2079μs 33.1039 KOps/s 32.9738 KOps/s $\color{#35bf28}+0.39\%$
test_step_mdp_speed[False-True-False-False-True] 3.3536ms 30.7892μs 32.4790 KOps/s 32.3041 KOps/s $\color{#35bf28}+0.54\%$
test_step_mdp_speed[False-True-False-False-False] 50.9810μs 19.3283μs 51.7376 KOps/s 51.4360 KOps/s $\color{#35bf28}+0.59\%$
test_step_mdp_speed[False-False-True-True-True] 80.7510μs 48.7339μs 20.5196 KOps/s 20.3998 KOps/s $\color{#35bf28}+0.59\%$
test_step_mdp_speed[False-False-True-True-False] 61.8010μs 32.4452μs 30.8212 KOps/s 31.0539 KOps/s $\color{#d91a1a}-0.75\%$
test_step_mdp_speed[False-False-True-False-True] 70.0410μs 30.0416μs 33.2872 KOps/s 33.4686 KOps/s $\color{#d91a1a}-0.54\%$
test_step_mdp_speed[False-False-True-False-False] 58.5210μs 19.0722μs 52.4323 KOps/s 51.6266 KOps/s $\color{#35bf28}+1.56\%$
test_step_mdp_speed[False-False-False-True-True] 89.8210μs 50.7698μs 19.6967 KOps/s 19.7055 KOps/s $\color{#d91a1a}-0.04\%$
test_step_mdp_speed[False-False-False-True-False] 77.3420μs 34.5357μs 28.9555 KOps/s 28.9055 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[False-False-False-False-True] 66.2110μs 31.7720μs 31.4742 KOps/s 31.3837 KOps/s $\color{#35bf28}+0.29\%$
test_step_mdp_speed[False-False-False-False-False] 56.9110μs 21.1132μs 47.3638 KOps/s 47.6179 KOps/s $\color{#d91a1a}-0.53\%$
test_values[generalized_advantage_estimate-True-True] 26.8123ms 25.7351ms 38.8575 Ops/s 38.9845 Ops/s $\color{#d91a1a}-0.33\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1046s 2.9866ms 334.8293 Ops/s 342.5718 Ops/s $\color{#d91a1a}-2.26\%$
test_values[td0_return_estimate-False-False] 0.1025ms 78.2651μs 12.7771 KOps/s 12.6025 KOps/s $\color{#35bf28}+1.39\%$
test_values[td1_return_estimate-False-False] 61.6438ms 57.0980ms 17.5137 Ops/s 17.2268 Ops/s $\color{#35bf28}+1.67\%$
test_values[vec_td1_return_estimate-False-False] 1.2599ms 1.0787ms 927.0167 Ops/s 920.7348 Ops/s $\color{#35bf28}+0.68\%$
test_values[td_lambda_return_estimate-True-False] 97.6790ms 93.4490ms 10.7010 Ops/s 10.6695 Ops/s $\color{#35bf28}+0.30\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2285ms 1.0754ms 929.8900 Ops/s 920.9864 Ops/s $\color{#35bf28}+0.97\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 27.7689ms 25.6264ms 39.0223 Ops/s 38.2915 Ops/s $\color{#35bf28}+1.91\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0646ms 0.7495ms 1.3343 KOps/s 1.2895 KOps/s $\color{#35bf28}+3.47\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7489ms 0.6667ms 1.4999 KOps/s 1.5035 KOps/s $\color{#d91a1a}-0.24\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5913ms 1.4847ms 673.5513 Ops/s 672.6442 Ops/s $\color{#35bf28}+0.13\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7326ms 0.6797ms 1.4713 KOps/s 1.4697 KOps/s $\color{#35bf28}+0.11\%$
test_dqn_speed[False-None] 1.6827ms 1.5939ms 627.3752 Ops/s 662.9745 Ops/s $\textbf{\color{#d91a1a}-5.37\%}$
test_dqn_speed[False-backward] 2.3636ms 2.1253ms 470.5162 Ops/s 472.8331 Ops/s $\color{#d91a1a}-0.49\%$
test_dqn_speed[True-None] 0.9821ms 0.5661ms 1.7665 KOps/s 1.7938 KOps/s $\color{#d91a1a}-1.52\%$
test_dqn_speed[True-backward] 1.2141ms 1.1360ms 880.2810 Ops/s 803.9744 Ops/s $\textbf{\color{#35bf28}+9.49\%}$
test_dqn_speed[reduce-overhead-None] 0.6343ms 0.5642ms 1.7725 KOps/s 1.7504 KOps/s $\color{#35bf28}+1.27\%$
test_dqn_speed[reduce-overhead-backward] 1.0392ms 0.9487ms 1.0541 KOps/s 923.0215 Ops/s $\textbf{\color{#35bf28}+14.20\%}$
test_ddpg_speed[False-None] 3.2609ms 2.8438ms 351.6422 Ops/s 352.0096 Ops/s $\color{#d91a1a}-0.10\%$
test_ddpg_speed[False-backward] 4.5285ms 4.0910ms 244.4386 Ops/s 236.5973 Ops/s $\color{#35bf28}+3.31\%$
test_ddpg_speed[True-None] 1.7056ms 1.3270ms 753.5924 Ops/s 746.4091 Ops/s $\color{#35bf28}+0.96\%$
test_ddpg_speed[True-backward] 2.5853ms 2.4148ms 414.1170 Ops/s 385.1699 Ops/s $\textbf{\color{#35bf28}+7.52\%}$
test_ddpg_speed[reduce-overhead-None] 1.7582ms 1.3350ms 749.0722 Ops/s 735.8796 Ops/s $\color{#35bf28}+1.79\%$
test_ddpg_speed[reduce-overhead-backward] 1.9372ms 1.8779ms 532.5034 Ops/s 487.1629 Ops/s $\textbf{\color{#35bf28}+9.31\%}$
test_sac_speed[False-None] 8.5110ms 8.0856ms 123.6767 Ops/s 122.9851 Ops/s $\color{#35bf28}+0.56\%$
test_sac_speed[False-backward] 11.5137ms 10.9997ms 90.9114 Ops/s 88.5944 Ops/s $\color{#35bf28}+2.62\%$
test_sac_speed[True-None] 1.9732ms 1.8396ms 543.5929 Ops/s 538.5366 Ops/s $\color{#35bf28}+0.94\%$
test_sac_speed[True-backward] 3.6499ms 3.5612ms 280.8072 Ops/s 265.5329 Ops/s $\textbf{\color{#35bf28}+5.75\%}$
test_sac_speed[reduce-overhead-None] 21.5725ms 11.9922ms 83.3875 Ops/s 81.5107 Ops/s $\color{#35bf28}+2.30\%$
test_sac_speed[reduce-overhead-backward] 1.6543ms 1.5793ms 633.2008 Ops/s 554.5981 Ops/s $\textbf{\color{#35bf28}+14.17\%}$
test_redq_speed[False-None] 8.1129ms 7.6534ms 130.6610 Ops/s 129.3580 Ops/s $\color{#35bf28}+1.01\%$
test_redq_speed[False-backward] 12.0218ms 11.4792ms 87.1140 Ops/s 84.0683 Ops/s $\color{#35bf28}+3.62\%$
test_redq_speed[True-None] 2.4404ms 2.3102ms 432.8660 Ops/s 406.9928 Ops/s $\textbf{\color{#35bf28}+6.36\%}$
test_redq_speed[True-backward] 4.6573ms 4.1701ms 239.8025 Ops/s 233.6341 Ops/s $\color{#35bf28}+2.64\%$
test_redq_speed[reduce-overhead-None] 2.3924ms 2.3316ms 428.8944 Ops/s 419.9952 Ops/s $\color{#35bf28}+2.12\%$
test_redq_speed[reduce-overhead-backward] 4.3818ms 4.1998ms 238.1090 Ops/s 230.7770 Ops/s $\color{#35bf28}+3.18\%$
test_redq_deprec_speed[False-None] 9.3975ms 9.0682ms 110.2758 Ops/s 109.2920 Ops/s $\color{#35bf28}+0.90\%$
test_redq_deprec_speed[False-backward] 13.2607ms 12.2973ms 81.3184 Ops/s 80.7307 Ops/s $\color{#35bf28}+0.73\%$
test_redq_deprec_speed[True-None] 2.7609ms 2.6208ms 381.5608 Ops/s 375.3495 Ops/s $\color{#35bf28}+1.65\%$
test_redq_deprec_speed[True-backward] 4.6479ms 4.4740ms 223.5158 Ops/s 214.4568 Ops/s $\color{#35bf28}+4.22\%$
test_redq_deprec_speed[reduce-overhead-None] 2.8501ms 2.6214ms 381.4767 Ops/s 371.8537 Ops/s $\color{#35bf28}+2.59\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.5796ms 4.4400ms 225.2238 Ops/s 220.1031 Ops/s $\color{#35bf28}+2.33\%$
test_td3_speed[False-None] 8.2352ms 7.9701ms 125.4685 Ops/s 124.7033 Ops/s $\color{#35bf28}+0.61\%$
test_td3_speed[False-backward] 11.0701ms 10.5498ms 94.7885 Ops/s 95.2746 Ops/s $\color{#d91a1a}-0.51\%$
test_td3_speed[True-None] 1.7408ms 1.6930ms 590.6669 Ops/s 584.8291 Ops/s $\color{#35bf28}+1.00\%$
test_td3_speed[True-backward] 3.4899ms 3.3217ms 301.0507 Ops/s 292.6430 Ops/s $\color{#35bf28}+2.87\%$
test_td3_speed[reduce-overhead-None] 52.4112ms 26.5932ms 37.6036 Ops/s 37.8982 Ops/s $\color{#d91a1a}-0.78\%$
test_td3_speed[reduce-overhead-backward] 1.3642ms 1.3075ms 764.7921 Ops/s 750.3113 Ops/s $\color{#35bf28}+1.93\%$
test_cql_speed[False-None] 17.9132ms 17.0743ms 58.5675 Ops/s 59.3375 Ops/s $\color{#d91a1a}-1.30\%$
test_cql_speed[False-backward] 22.6375ms 21.8792ms 45.7056 Ops/s 45.4311 Ops/s $\color{#35bf28}+0.60\%$
test_cql_speed[True-None] 3.4926ms 3.2675ms 306.0456 Ops/s 297.0816 Ops/s $\color{#35bf28}+3.02\%$
test_cql_speed[True-backward] 6.0525ms 5.6455ms 177.1331 Ops/s 176.0057 Ops/s $\color{#35bf28}+0.64\%$
test_cql_speed[reduce-overhead-None] 21.1228ms 13.2162ms 75.6646 Ops/s 73.6749 Ops/s $\color{#35bf28}+2.70\%$
test_cql_speed[reduce-overhead-backward] 2.1307ms 1.9846ms 503.8741 Ops/s 497.1346 Ops/s $\color{#35bf28}+1.36\%$
test_a2c_speed[False-None] 3.3592ms 3.1725ms 315.2050 Ops/s 310.9445 Ops/s $\color{#35bf28}+1.37\%$
test_a2c_speed[False-backward] 6.9604ms 6.2750ms 159.3619 Ops/s 155.9113 Ops/s $\color{#35bf28}+2.21\%$
test_a2c_speed[True-None] 1.4168ms 1.3374ms 747.6916 Ops/s 719.3121 Ops/s $\color{#35bf28}+3.95\%$
test_a2c_speed[True-backward] 3.2874ms 3.0719ms 325.5328 Ops/s 330.9838 Ops/s $\color{#d91a1a}-1.65\%$
test_a2c_speed[reduce-overhead-None] 16.2146ms 9.1402ms 109.4071 Ops/s 112.3046 Ops/s $\color{#d91a1a}-2.58\%$
test_a2c_speed[reduce-overhead-backward] 1.6979ms 1.5912ms 628.4560 Ops/s 664.6238 Ops/s $\textbf{\color{#d91a1a}-5.44\%}$
test_ppo_speed[False-None] 3.7951ms 3.6776ms 271.9159 Ops/s 257.9990 Ops/s $\textbf{\color{#35bf28}+5.39\%}$
test_ppo_speed[False-backward] 7.2709ms 6.9824ms 143.2181 Ops/s 144.1131 Ops/s $\color{#d91a1a}-0.62\%$
test_ppo_speed[True-None] 1.5694ms 1.4087ms 709.8672 Ops/s 697.7135 Ops/s $\color{#35bf28}+1.74\%$
test_ppo_speed[True-backward] 3.2661ms 3.2106ms 311.4635 Ops/s 317.6943 Ops/s $\color{#d91a1a}-1.96\%$
test_ppo_speed[reduce-overhead-None] 1.0216ms 0.9633ms 1.0381 KOps/s 1.0431 KOps/s $\color{#d91a1a}-0.48\%$
test_ppo_speed[reduce-overhead-backward] 1.7171ms 1.5611ms 640.5870 Ops/s 616.7075 Ops/s $\color{#35bf28}+3.87\%$
test_reinforce_speed[False-None] 2.3470ms 2.2598ms 442.5232 Ops/s 439.7269 Ops/s $\color{#35bf28}+0.64\%$
test_reinforce_speed[False-backward] 3.8024ms 3.4153ms 292.8018 Ops/s 296.0354 Ops/s $\color{#d91a1a}-1.09\%$
test_reinforce_speed[True-None] 1.4550ms 1.2832ms 779.2936 Ops/s 768.9182 Ops/s $\color{#35bf28}+1.35\%$
test_reinforce_speed[True-backward] 3.1435ms 3.0632ms 326.4586 Ops/s 339.7296 Ops/s $\color{#d91a1a}-3.91\%$
test_reinforce_speed[reduce-overhead-None] 18.1530ms 10.0534ms 99.4686 Ops/s 98.2520 Ops/s $\color{#35bf28}+1.24\%$
test_reinforce_speed[reduce-overhead-backward] 1.7500ms 1.6263ms 614.9088 Ops/s 652.9031 Ops/s $\textbf{\color{#d91a1a}-5.82\%}$
test_iql_speed[False-None] 9.6642ms 9.2103ms 108.5746 Ops/s 107.3972 Ops/s $\color{#35bf28}+1.10\%$
test_iql_speed[False-backward] 13.7552ms 13.1370ms 76.1208 Ops/s 76.0978 Ops/s $\color{#35bf28}+0.03\%$
test_iql_speed[True-None] 2.3433ms 2.2105ms 452.3965 Ops/s 433.7211 Ops/s $\color{#35bf28}+4.31\%$
test_iql_speed[True-backward] 5.0538ms 4.9005ms 204.0600 Ops/s 198.0270 Ops/s $\color{#35bf28}+3.05\%$
test_iql_speed[reduce-overhead-None] 0.5124s 13.0313ms 76.7381 Ops/s 88.9307 Ops/s $\textbf{\color{#d91a1a}-13.71\%}$
test_iql_speed[reduce-overhead-backward] 2.0897ms 2.0408ms 489.9992 Ops/s 478.4741 Ops/s $\color{#35bf28}+2.41\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.7289ms 6.2206ms 160.7550 Ops/s 156.8684 Ops/s $\color{#35bf28}+2.48\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5309ms 0.2899ms 3.4491 KOps/s 3.5827 KOps/s $\color{#d91a1a}-3.73\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5682ms 0.3286ms 3.0432 KOps/s 3.8063 KOps/s $\textbf{\color{#d91a1a}-20.05\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2383ms 5.9629ms 167.7025 Ops/s 164.5690 Ops/s $\color{#35bf28}+1.90\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9030ms 0.3068ms 3.2592 KOps/s 3.1708 KOps/s $\color{#35bf28}+2.79\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5116ms 0.2664ms 3.7542 KOps/s 3.7943 KOps/s $\color{#d91a1a}-1.06\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.6188ms 1.4209ms 703.7963 Ops/s 749.1211 Ops/s $\textbf{\color{#d91a1a}-6.05\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6394ms 1.3595ms 735.5817 Ops/s 782.2755 Ops/s $\textbf{\color{#d91a1a}-5.97\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.2991ms 6.1260ms 163.2387 Ops/s 159.1968 Ops/s $\color{#35bf28}+2.54\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.7564ms 0.4935ms 2.0264 KOps/s 2.3810 KOps/s $\textbf{\color{#d91a1a}-14.89\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7648ms 0.4795ms 2.0855 KOps/s 2.2923 KOps/s $\textbf{\color{#d91a1a}-9.02\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.2369ms 6.0625ms 164.9498 Ops/s 162.7970 Ops/s $\color{#35bf28}+1.32\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.3617ms 0.3436ms 2.9103 KOps/s 3.0650 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5648ms 0.3036ms 3.2942 KOps/s 3.6223 KOps/s $\textbf{\color{#d91a1a}-9.06\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 8.1695ms 5.9668ms 167.5938 Ops/s 165.0658 Ops/s $\color{#35bf28}+1.53\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.0823ms 0.3147ms 3.1779 KOps/s 3.0378 KOps/s $\color{#35bf28}+4.61\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6130ms 0.2964ms 3.3737 KOps/s 3.4826 KOps/s $\color{#d91a1a}-3.13\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.2771ms 6.1330ms 163.0523 Ops/s 159.3496 Ops/s $\color{#35bf28}+2.32\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.0887ms 0.4369ms 2.2889 KOps/s 2.0018 KOps/s $\textbf{\color{#35bf28}+14.34\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6173ms 0.4036ms 2.4776 KOps/s 2.0703 KOps/s $\textbf{\color{#35bf28}+19.67\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1307ms 5.5455ms 180.3263 Ops/s 177.1630 Ops/s $\color{#35bf28}+1.79\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.7661ms 2.1094ms 474.0656 Ops/s 432.7247 Ops/s $\textbf{\color{#35bf28}+9.55\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.0022ms 1.2504ms 799.7483 Ops/s 781.1793 Ops/s $\color{#35bf28}+2.38\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4819s 15.1914ms 65.8267 Ops/s 178.9473 Ops/s $\textbf{\color{#d91a1a}-63.21\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 10.4132ms 2.1100ms 473.9402 Ops/s 426.1246 Ops/s $\textbf{\color{#35bf28}+11.22\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 3.3968ms 1.1820ms 845.9999 Ops/s 809.5087 Ops/s $\color{#35bf28}+4.51\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 9.6453ms 5.8209ms 171.7957 Ops/s 29.3418 Ops/s $\textbf{\color{#35bf28}+485.50\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.3255ms 2.2454ms 445.3641 Ops/s 442.4182 Ops/s $\color{#35bf28}+0.67\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 9.2232ms 1.4619ms 684.0553 Ops/s 711.7255 Ops/s $\color{#d91a1a}-3.89\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 14.3259ms 13.8837ms 72.0271 Ops/s 70.6838 Ops/s $\color{#35bf28}+1.90\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 20.6973ms 18.1954ms 54.9589 Ops/s 58.6886 Ops/s $\textbf{\color{#d91a1a}-6.36\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.6381ms 18.1806ms 55.0038 Ops/s 53.1551 Ops/s $\color{#35bf28}+3.48\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.9904ms 17.2531ms 57.9607 Ops/s 57.0562 Ops/s $\color{#35bf28}+1.59\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.6750ms 18.1559ms 55.0785 Ops/s 53.3408 Ops/s $\color{#35bf28}+3.26\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 0.4197s 26.7904ms 37.3268 Ops/s 53.2544 Ops/s $\textbf{\color{#d91a1a}-29.91\%}$

@vmoens vmoens force-pushed the fix-env-ci branch 6 times, most recently from 0d46818 to ac64cae Compare February 25, 2025 09:44
@vmoens vmoens merged commit 8dd1be7 into main Feb 28, 2025
23 of 58 checks passed
vmoens pushed a commit that referenced this pull request Mar 8, 2025
(cherry picked from commit 8dd1be7)
vmoens pushed a commit that referenced this pull request Mar 10, 2025
(cherry picked from commit 8dd1be7)
@vmoens vmoens deleted the fix-env-ci branch May 14, 2025 09:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Environments Adds or modifies an environment wrapper
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载