+
Skip to content

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Feb 12, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Feb 12, 2025
ghstack-source-id: 0b4f78a
Pull Request resolved: #2786
Copy link

pytorch-bot bot commented Feb 12, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2786

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 12, 2025
Copy link

github-actions bot commented Feb 12, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}5$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.6112s 0.5187s 1.9277 Ops/s 1.9168 Ops/s $\color{#35bf28}+0.57\%$
test_transformed 1.1017s 1.0137s 0.9865 Ops/s 0.9724 Ops/s $\color{#35bf28}+1.46\%$
test_serial 1.5902s 1.5005s 0.6665 Ops/s 0.6502 Ops/s $\color{#35bf28}+2.49\%$
test_parallel 1.3775s 1.2921s 0.7739 Ops/s 0.7710 Ops/s $\color{#35bf28}+0.38\%$
test_step_mdp_speed[True-True-True-True-True] 0.1588ms 30.6069μs 32.6724 KOps/s 33.3091 KOps/s $\color{#d91a1a}-1.91\%$
test_step_mdp_speed[True-True-True-True-False] 66.9640μs 18.2910μs 54.6716 KOps/s 56.5121 KOps/s $\color{#d91a1a}-3.26\%$
test_step_mdp_speed[True-True-True-False-True] 55.9640μs 17.3624μs 57.5957 KOps/s 58.8626 KOps/s $\color{#d91a1a}-2.15\%$
test_step_mdp_speed[True-True-True-False-False] 38.8220μs 10.2404μs 97.6520 KOps/s 100.1933 KOps/s $\color{#d91a1a}-2.54\%$
test_step_mdp_speed[True-True-False-True-True] 74.1670μs 32.9885μs 30.3136 KOps/s 30.9436 KOps/s $\color{#d91a1a}-2.04\%$
test_step_mdp_speed[True-True-False-True-False] 67.2450μs 20.1593μs 49.6050 KOps/s 50.8071 KOps/s $\color{#d91a1a}-2.37\%$
test_step_mdp_speed[True-True-False-False-True] 53.5490μs 19.3507μs 51.6777 KOps/s 52.8025 KOps/s $\color{#d91a1a}-2.13\%$
test_step_mdp_speed[True-True-False-False-False] 41.8180μs 12.1810μs 82.0948 KOps/s 84.0995 KOps/s $\color{#d91a1a}-2.38\%$
test_step_mdp_speed[True-False-True-True-True] 77.7250μs 34.6036μs 28.8987 KOps/s 29.4427 KOps/s $\color{#d91a1a}-1.85\%$
test_step_mdp_speed[True-False-True-True-False] 56.8960μs 22.2003μs 45.0444 KOps/s 46.4727 KOps/s $\color{#d91a1a}-3.07\%$
test_step_mdp_speed[True-False-True-False-True] 64.2090μs 19.3069μs 51.7951 KOps/s 53.0934 KOps/s $\color{#d91a1a}-2.45\%$
test_step_mdp_speed[True-False-True-False-False] 39.9140μs 12.1750μs 82.1355 KOps/s 84.4374 KOps/s $\color{#d91a1a}-2.73\%$
test_step_mdp_speed[True-False-False-True-True] 0.7523ms 36.0969μs 27.7032 KOps/s 28.2336 KOps/s $\color{#d91a1a}-1.88\%$
test_step_mdp_speed[True-False-False-True-False] 68.9780μs 23.7739μs 42.0629 KOps/s 43.1245 KOps/s $\color{#d91a1a}-2.46\%$
test_step_mdp_speed[True-False-False-False-True] 53.8700μs 20.8709μs 47.9135 KOps/s 48.7416 KOps/s $\color{#d91a1a}-1.70\%$
test_step_mdp_speed[True-False-False-False-False] 42.7900μs 13.9796μs 71.5327 KOps/s 73.6577 KOps/s $\color{#d91a1a}-2.88\%$
test_step_mdp_speed[False-True-True-True-True] 66.0730μs 34.5096μs 28.9775 KOps/s 29.3934 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[False-True-True-True-False] 59.4800μs 22.0911μs 45.2671 KOps/s 46.2350 KOps/s $\color{#d91a1a}-2.09\%$
test_step_mdp_speed[False-True-True-False-True] 56.9350μs 22.2095μs 45.0257 KOps/s 45.9868 KOps/s $\color{#d91a1a}-2.09\%$
test_step_mdp_speed[False-True-True-False-False] 43.4610μs 13.5520μs 73.7900 KOps/s 74.8729 KOps/s $\color{#d91a1a}-1.45\%$
test_step_mdp_speed[False-True-False-True-True] 70.9920μs 35.9984μs 27.7790 KOps/s 28.0124 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[False-True-False-True-False] 69.3890μs 23.7894μs 42.0355 KOps/s 42.6220 KOps/s $\color{#d91a1a}-1.38\%$
test_step_mdp_speed[False-True-False-False-True] 2.7367ms 23.5764μs 42.4153 KOps/s 42.0340 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[False-True-False-False-False] 47.8800μs 15.3872μs 64.9892 KOps/s 67.0335 KOps/s $\color{#d91a1a}-3.05\%$
test_step_mdp_speed[False-False-True-True-True] 80.6800μs 38.0139μs 26.3062 KOps/s 26.4225 KOps/s $\color{#d91a1a}-0.44\%$
test_step_mdp_speed[False-False-True-True-False] 61.8150μs 25.9207μs 38.5792 KOps/s 39.8139 KOps/s $\color{#d91a1a}-3.10\%$
test_step_mdp_speed[False-False-True-False-True] 56.8350μs 23.7251μs 42.1495 KOps/s 42.7547 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[False-False-True-False-False] 54.2700μs 15.4278μs 64.8181 KOps/s 66.6882 KOps/s $\color{#d91a1a}-2.80\%$
test_step_mdp_speed[False-False-False-True-True] 0.7405ms 39.7369μs 25.1656 KOps/s 25.6175 KOps/s $\color{#d91a1a}-1.76\%$
test_step_mdp_speed[False-False-False-True-False] 66.7140μs 27.6073μs 36.2222 KOps/s 37.1752 KOps/s $\color{#d91a1a}-2.56\%$
test_step_mdp_speed[False-False-False-False-True] 66.6940μs 24.9702μs 40.0478 KOps/s 40.4836 KOps/s $\color{#d91a1a}-1.08\%$
test_step_mdp_speed[False-False-False-False-False] 53.1880μs 17.1479μs 58.3163 KOps/s 59.7556 KOps/s $\color{#d91a1a}-2.41\%$
test_values[generalized_advantage_estimate-True-True] 9.9396ms 9.4699ms 105.5976 Ops/s 101.5049 Ops/s $\color{#35bf28}+4.03\%$
test_values[vec_generalized_advantage_estimate-True-True] 26.1745ms 23.9719ms 41.7155 Ops/s 41.6040 Ops/s $\color{#35bf28}+0.27\%$
test_values[td0_return_estimate-False-False] 0.2541ms 0.1791ms 5.5819 KOps/s 5.6534 KOps/s $\color{#d91a1a}-1.26\%$
test_values[td1_return_estimate-False-False] 27.4136ms 24.0322ms 41.6109 Ops/s 41.3623 Ops/s $\color{#35bf28}+0.60\%$
test_values[vec_td1_return_estimate-False-False] 26.0405ms 24.1882ms 41.3425 Ops/s 41.5019 Ops/s $\color{#d91a1a}-0.38\%$
test_values[td_lambda_return_estimate-True-False] 37.0045ms 34.6490ms 28.8609 Ops/s 28.7225 Ops/s $\color{#35bf28}+0.48\%$
test_values[vec_td_lambda_return_estimate-True-False] 26.0265ms 24.1405ms 41.4241 Ops/s 41.1863 Ops/s $\color{#35bf28}+0.58\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.6371ms 8.4102ms 118.9029 Ops/s 116.5738 Ops/s $\color{#35bf28}+2.00\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.1080ms 1.8888ms 529.4447 Ops/s 511.1610 Ops/s $\color{#35bf28}+3.58\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6009ms 0.3665ms 2.7284 KOps/s 2.6641 KOps/s $\color{#35bf28}+2.42\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 43.1304ms 41.1861ms 24.2800 Ops/s 23.8697 Ops/s $\color{#35bf28}+1.72\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.2290ms 3.4296ms 291.5816 Ops/s 292.1540 Ops/s $\color{#d91a1a}-0.20\%$
test_dqn_speed[False-None] 6.5984ms 1.4277ms 700.4297 Ops/s 692.1826 Ops/s $\color{#35bf28}+1.19\%$
test_dqn_speed[False-backward] 2.0856ms 1.9114ms 523.1762 Ops/s 520.8102 Ops/s $\color{#35bf28}+0.45\%$
test_dqn_speed[True-None] 2.2588ms 0.4982ms 2.0074 KOps/s 1.9874 KOps/s $\color{#35bf28}+1.00\%$
test_dqn_speed[True-backward] 0.9831ms 0.9132ms 1.0951 KOps/s 1.0499 KOps/s $\color{#35bf28}+4.30\%$
test_dqn_speed[reduce-overhead-None] 0.7380ms 0.4940ms 2.0245 KOps/s 1.9926 KOps/s $\color{#35bf28}+1.60\%$
test_dqn_speed[reduce-overhead-backward] 1.0293ms 0.9295ms 1.0759 KOps/s 1.0691 KOps/s $\color{#35bf28}+0.63\%$
test_ddpg_speed[False-None] 3.1786ms 2.9218ms 342.2576 Ops/s 338.7847 Ops/s $\color{#35bf28}+1.03\%$
test_ddpg_speed[False-backward] 5.2717ms 4.1141ms 243.0663 Ops/s 245.0608 Ops/s $\color{#d91a1a}-0.81\%$
test_ddpg_speed[True-None] 1.6902ms 1.2367ms 808.6176 Ops/s 772.2719 Ops/s $\color{#35bf28}+4.71\%$
test_ddpg_speed[True-backward] 2.2095ms 2.1268ms 470.1906 Ops/s 460.2803 Ops/s $\color{#35bf28}+2.15\%$
test_ddpg_speed[reduce-overhead-None] 1.7549ms 1.2456ms 802.8430 Ops/s 790.2900 Ops/s $\color{#35bf28}+1.59\%$
test_ddpg_speed[reduce-overhead-backward] 2.4588ms 2.1537ms 464.3077 Ops/s 460.6870 Ops/s $\color{#35bf28}+0.79\%$
test_sac_speed[False-None] 9.7601ms 8.0998ms 123.4600 Ops/s 121.8112 Ops/s $\color{#35bf28}+1.35\%$
test_sac_speed[False-backward] 11.2118ms 10.7936ms 92.6472 Ops/s 91.3379 Ops/s $\color{#35bf28}+1.43\%$
test_sac_speed[True-None] 2.7952ms 2.1103ms 473.8559 Ops/s 466.6513 Ops/s $\color{#35bf28}+1.54\%$
test_sac_speed[True-backward] 3.8610ms 3.7910ms 263.7803 Ops/s 260.8090 Ops/s $\color{#35bf28}+1.14\%$
test_sac_speed[reduce-overhead-None] 2.6868ms 2.1089ms 474.1750 Ops/s 465.6656 Ops/s $\color{#35bf28}+1.83\%$
test_sac_speed[reduce-overhead-backward] 3.8995ms 3.7988ms 263.2389 Ops/s 260.6843 Ops/s $\color{#35bf28}+0.98\%$
test_redq_speed[False-None] 14.4896ms 13.0809ms 76.4471 Ops/s 77.9982 Ops/s $\color{#d91a1a}-1.99\%$
test_redq_speed[False-backward] 24.2236ms 22.6475ms 44.1550 Ops/s 43.7700 Ops/s $\color{#35bf28}+0.88\%$
test_redq_speed[True-None] 6.6845ms 5.0424ms 198.3179 Ops/s 202.9437 Ops/s $\color{#d91a1a}-2.28\%$
test_redq_speed[True-backward] 13.0131ms 12.0418ms 83.0439 Ops/s 80.4563 Ops/s $\color{#35bf28}+3.22\%$
test_redq_speed[reduce-overhead-None] 5.3134ms 4.7878ms 208.8663 Ops/s 202.3134 Ops/s $\color{#35bf28}+3.24\%$
test_redq_speed[reduce-overhead-backward] 12.7413ms 12.1878ms 82.0489 Ops/s 77.3942 Ops/s $\textbf{\color{#35bf28}+6.01\%}$
test_redq_deprec_speed[False-None] 14.4042ms 12.5750ms 79.5227 Ops/s 77.1459 Ops/s $\color{#35bf28}+3.08\%$
test_redq_deprec_speed[False-backward] 20.7885ms 18.3497ms 54.4967 Ops/s 52.5976 Ops/s $\color{#35bf28}+3.61\%$
test_redq_deprec_speed[True-None] 4.5115ms 3.8370ms 260.6212 Ops/s 256.2772 Ops/s $\color{#35bf28}+1.70\%$
test_redq_deprec_speed[True-backward] 9.0621ms 8.2538ms 121.1557 Ops/s 119.1187 Ops/s $\color{#35bf28}+1.71\%$
test_redq_deprec_speed[reduce-overhead-None] 4.2132ms 3.8199ms 261.7854 Ops/s 255.2254 Ops/s $\color{#35bf28}+2.57\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.3615ms 8.1955ms 122.0178 Ops/s 117.7818 Ops/s $\color{#35bf28}+3.60\%$
test_td3_speed[False-None] 8.4064ms 8.0505ms 124.2166 Ops/s 121.3962 Ops/s $\color{#35bf28}+2.32\%$
test_td3_speed[False-backward] 12.1219ms 10.4563ms 95.6358 Ops/s 93.7084 Ops/s $\color{#35bf28}+2.06\%$
test_td3_speed[True-None] 1.9594ms 1.8097ms 552.5746 Ops/s 536.7132 Ops/s $\color{#35bf28}+2.96\%$
test_td3_speed[True-backward] 3.7343ms 3.4137ms 292.9366 Ops/s 288.1978 Ops/s $\color{#35bf28}+1.64\%$
test_td3_speed[reduce-overhead-None] 1.9913ms 1.7959ms 556.8339 Ops/s 535.2429 Ops/s $\color{#35bf28}+4.03\%$
test_td3_speed[reduce-overhead-backward] 3.4724ms 3.4003ms 294.0908 Ops/s 282.2393 Ops/s $\color{#35bf28}+4.20\%$
test_cql_speed[False-None] 39.6218ms 36.8337ms 27.1491 Ops/s 26.9047 Ops/s $\color{#35bf28}+0.91\%$
test_cql_speed[False-backward] 51.5524ms 47.2537ms 21.1624 Ops/s 21.0193 Ops/s $\color{#35bf28}+0.68\%$
test_cql_speed[True-None] 17.0491ms 15.8917ms 62.9261 Ops/s 61.2712 Ops/s $\color{#35bf28}+2.70\%$
test_cql_speed[True-backward] 22.8482ms 22.0839ms 45.2819 Ops/s 42.6110 Ops/s $\textbf{\color{#35bf28}+6.27\%}$
test_cql_speed[reduce-overhead-None] 18.1979ms 16.0254ms 62.4009 Ops/s 61.3531 Ops/s $\color{#35bf28}+1.71\%$
test_cql_speed[reduce-overhead-backward] 24.3895ms 22.7470ms 43.9619 Ops/s 43.2085 Ops/s $\color{#35bf28}+1.74\%$
test_a2c_speed[False-None] 7.8430ms 7.0995ms 140.8559 Ops/s 136.8043 Ops/s $\color{#35bf28}+2.96\%$
test_a2c_speed[False-backward] 15.6735ms 14.1283ms 70.7797 Ops/s 69.7802 Ops/s $\color{#35bf28}+1.43\%$
test_a2c_speed[True-None] 4.0079ms 3.7014ms 270.1689 Ops/s 264.9449 Ops/s $\color{#35bf28}+1.97\%$
test_a2c_speed[True-backward] 10.7395ms 10.1849ms 98.1844 Ops/s 97.5440 Ops/s $\color{#35bf28}+0.66\%$
test_a2c_speed[reduce-overhead-None] 4.2897ms 3.7087ms 269.6385 Ops/s 267.1948 Ops/s $\color{#35bf28}+0.91\%$
test_a2c_speed[reduce-overhead-backward] 12.0644ms 10.2100ms 97.9436 Ops/s 97.3151 Ops/s $\color{#35bf28}+0.65\%$
test_ppo_speed[False-None] 8.1391ms 7.3882ms 135.3514 Ops/s 133.4298 Ops/s $\color{#35bf28}+1.44\%$
test_ppo_speed[False-backward] 14.7574ms 14.5200ms 68.8707 Ops/s 68.8048 Ops/s $\color{#35bf28}+0.10\%$
test_ppo_speed[True-None] 4.6524ms 4.0769ms 245.2825 Ops/s 238.2890 Ops/s $\color{#35bf28}+2.93\%$
test_ppo_speed[True-backward] 10.9987ms 10.0358ms 99.6435 Ops/s 94.7830 Ops/s $\textbf{\color{#35bf28}+5.13\%}$
test_ppo_speed[reduce-overhead-None] 5.0047ms 4.0829ms 244.9220 Ops/s 240.0186 Ops/s $\color{#35bf28}+2.04\%$
test_ppo_speed[reduce-overhead-backward] 11.5377ms 10.2128ms 97.9167 Ops/s 99.2208 Ops/s $\color{#d91a1a}-1.31\%$
test_reinforce_speed[False-None] 8.0387ms 6.4869ms 154.1574 Ops/s 151.5751 Ops/s $\color{#35bf28}+1.70\%$
test_reinforce_speed[False-backward] 9.9068ms 9.7186ms 102.8952 Ops/s 99.7267 Ops/s $\color{#35bf28}+3.18\%$
test_reinforce_speed[True-None] 3.4380ms 3.0520ms 327.6556 Ops/s 320.0284 Ops/s $\color{#35bf28}+2.38\%$
test_reinforce_speed[True-backward] 9.3817ms 9.0195ms 110.8706 Ops/s 110.3867 Ops/s $\color{#35bf28}+0.44\%$
test_reinforce_speed[reduce-overhead-None] 3.7017ms 3.0554ms 327.2855 Ops/s 317.5538 Ops/s $\color{#35bf28}+3.06\%$
test_reinforce_speed[reduce-overhead-backward] 9.3729ms 9.0147ms 110.9303 Ops/s 109.5556 Ops/s $\color{#35bf28}+1.25\%$
test_iql_speed[False-None] 33.3886ms 32.0365ms 31.2144 Ops/s 30.2226 Ops/s $\color{#35bf28}+3.28\%$
test_iql_speed[False-backward] 0.3601s 51.2477ms 19.5131 Ops/s 21.8093 Ops/s $\textbf{\color{#d91a1a}-10.53\%}$
test_iql_speed[True-None] 15.0986ms 11.3620ms 88.0130 Ops/s 88.0714 Ops/s $\color{#d91a1a}-0.07\%$
test_iql_speed[True-backward] 22.8931ms 22.0060ms 45.4421 Ops/s 45.1595 Ops/s $\color{#35bf28}+0.63\%$
test_iql_speed[reduce-overhead-None] 11.8211ms 11.3064ms 88.4454 Ops/s 88.8576 Ops/s $\color{#d91a1a}-0.46\%$
test_iql_speed[reduce-overhead-backward] 23.2041ms 22.0277ms 45.3975 Ops/s 44.9437 Ops/s $\color{#35bf28}+1.01\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.8650ms 4.7031ms 212.6254 Ops/s 207.3434 Ops/s $\color{#35bf28}+2.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.5194ms 0.5162ms 1.9374 KOps/s 1.9220 KOps/s $\color{#35bf28}+0.80\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8555ms 0.4881ms 2.0489 KOps/s 2.0515 KOps/s $\color{#d91a1a}-0.13\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.3021ms 4.4823ms 223.1010 Ops/s 219.9386 Ops/s $\color{#35bf28}+1.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.2299ms 0.4983ms 2.0070 KOps/s 1.9761 KOps/s $\color{#35bf28}+1.56\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8122ms 0.4794ms 2.0859 KOps/s 2.0730 KOps/s $\color{#35bf28}+0.62\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.0983ms 1.6512ms 605.6315 Ops/s 599.3754 Ops/s $\color{#35bf28}+1.04\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2533ms 1.5668ms 638.2331 Ops/s 630.1579 Ops/s $\color{#35bf28}+1.28\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.3540ms 4.6730ms 213.9965 Ops/s 206.8694 Ops/s $\color{#35bf28}+3.45\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.2694ms 0.6498ms 1.5390 KOps/s 1.5326 KOps/s $\color{#35bf28}+0.42\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8272ms 0.6171ms 1.6205 KOps/s 1.5995 KOps/s $\color{#35bf28}+1.32\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.0127ms 4.5130ms 221.5828 Ops/s 213.3531 Ops/s $\color{#35bf28}+3.86\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8157ms 0.5175ms 1.9322 KOps/s 1.9188 KOps/s $\color{#35bf28}+0.70\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7134ms 0.4883ms 2.0481 KOps/s 2.0535 KOps/s $\color{#d91a1a}-0.26\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.3182ms 4.5274ms 220.8783 Ops/s 218.3161 Ops/s $\color{#35bf28}+1.17\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9899ms 0.4993ms 2.0028 KOps/s 1.9814 KOps/s $\color{#35bf28}+1.08\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7399ms 0.4840ms 2.0660 KOps/s 2.0360 KOps/s $\color{#35bf28}+1.47\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.2902ms 4.6486ms 215.1193 Ops/s 210.5049 Ops/s $\color{#35bf28}+2.19\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1067ms 0.6550ms 1.5268 KOps/s 1.5370 KOps/s $\color{#d91a1a}-0.66\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9609ms 0.6187ms 1.6162 KOps/s 1.5973 KOps/s $\color{#35bf28}+1.18\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.4920ms 4.1984ms 238.1858 Ops/s 224.3135 Ops/s $\textbf{\color{#35bf28}+6.18\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.7307ms 2.3568ms 424.3020 Ops/s 420.5103 Ops/s $\color{#35bf28}+0.90\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.0442ms 1.2882ms 776.2933 Ops/s 828.5744 Ops/s $\textbf{\color{#d91a1a}-6.31\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.5380ms 4.3022ms 232.4392 Ops/s 232.7844 Ops/s $\color{#d91a1a}-0.15\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 0.4335s 11.0467ms 90.5251 Ops/s 414.5154 Ops/s $\textbf{\color{#d91a1a}-78.16\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 2.0485ms 1.2992ms 769.6875 Ops/s 751.0498 Ops/s $\color{#35bf28}+2.48\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 6.1724ms 4.4534ms 224.5486 Ops/s 227.0811 Ops/s $\color{#d91a1a}-1.12\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.2482ms 2.5066ms 398.9404 Ops/s 413.0497 Ops/s $\color{#d91a1a}-3.42\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.3447ms 1.5594ms 641.2611 Ops/s 661.7343 Ops/s $\color{#d91a1a}-3.09\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 14.3244ms 11.6545ms 85.8041 Ops/s 80.8099 Ops/s $\textbf{\color{#35bf28}+6.18\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 16.0358ms 14.7432ms 67.8278 Ops/s 67.2710 Ops/s $\color{#35bf28}+0.83\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 22.4070ms 20.6851ms 48.3440 Ops/s 46.0971 Ops/s $\color{#35bf28}+4.87\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.3528ms 14.8173ms 67.4888 Ops/s 66.0134 Ops/s $\color{#35bf28}+2.23\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.5519ms 20.4706ms 48.8506 Ops/s 47.0686 Ops/s $\color{#35bf28}+3.79\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.0820ms 16.2475ms 61.5481 Ops/s 61.2225 Ops/s $\color{#35bf28}+0.53\%$

Copy link

github-actions bot commented Feb 12, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}19$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.9157s 0.8295s 1.2056 Ops/s 1.1945 Ops/s $\color{#35bf28}+0.93\%$
test_transformed 1.5258s 1.4393s 0.6948 Ops/s 0.6938 Ops/s $\color{#35bf28}+0.14\%$
test_serial 2.4294s 2.3397s 0.4274 Ops/s 0.4275 Ops/s $\color{#d91a1a}-0.02\%$
test_parallel 1.9912s 1.8950s 0.5277 Ops/s 0.5345 Ops/s $\color{#d91a1a}-1.27\%$
test_step_mdp_speed[True-True-True-True-True] 0.2291ms 39.9809μs 25.0119 KOps/s 25.6432 KOps/s $\color{#d91a1a}-2.46\%$
test_step_mdp_speed[True-True-True-True-False] 0.4005ms 23.3730μs 42.7844 KOps/s 42.5766 KOps/s $\color{#35bf28}+0.49\%$
test_step_mdp_speed[True-True-True-False-True] 0.4163ms 22.8011μs 43.8575 KOps/s 44.9058 KOps/s $\color{#d91a1a}-2.33\%$
test_step_mdp_speed[True-True-True-False-False] 0.1334ms 13.2677μs 75.3713 KOps/s 77.0343 KOps/s $\color{#d91a1a}-2.16\%$
test_step_mdp_speed[True-True-False-True-True] 0.4362ms 43.3353μs 23.0759 KOps/s 23.1585 KOps/s $\color{#d91a1a}-0.36\%$
test_step_mdp_speed[True-True-False-True-False] 0.4146ms 25.8597μs 38.6702 KOps/s 38.9072 KOps/s $\color{#d91a1a}-0.61\%$
test_step_mdp_speed[True-True-False-False-True] 58.3540μs 25.3076μs 39.5138 KOps/s 40.5657 KOps/s $\color{#d91a1a}-2.59\%$
test_step_mdp_speed[True-True-False-False-False] 0.4166ms 15.7155μs 63.6313 KOps/s 64.8348 KOps/s $\color{#d91a1a}-1.86\%$
test_step_mdp_speed[True-False-True-True-True] 0.4369ms 45.2288μs 22.1098 KOps/s 22.0612 KOps/s $\color{#35bf28}+0.22\%$
test_step_mdp_speed[True-False-True-True-False] 0.4168ms 28.6250μs 34.9345 KOps/s 35.2013 KOps/s $\color{#d91a1a}-0.76\%$
test_step_mdp_speed[True-False-True-False-True] 0.1355ms 24.9398μs 40.0965 KOps/s 39.8190 KOps/s $\color{#35bf28}+0.70\%$
test_step_mdp_speed[True-False-True-False-False] 0.4024ms 15.7380μs 63.5404 KOps/s 64.9320 KOps/s $\color{#d91a1a}-2.14\%$
test_step_mdp_speed[True-False-False-True-True] 0.4311ms 47.8449μs 20.9009 KOps/s 20.9060 KOps/s $\color{#d91a1a}-0.02\%$
test_step_mdp_speed[True-False-False-True-False] 0.4190ms 30.9651μs 32.2944 KOps/s 32.6633 KOps/s $\color{#d91a1a}-1.13\%$
test_step_mdp_speed[True-False-False-False-True] 0.1164ms 27.7659μs 36.0154 KOps/s 37.0363 KOps/s $\color{#d91a1a}-2.76\%$
test_step_mdp_speed[True-False-False-False-False] 0.4124ms 17.9896μs 55.5876 KOps/s 57.2735 KOps/s $\color{#d91a1a}-2.94\%$
test_step_mdp_speed[False-True-True-True-True] 0.4270ms 45.7540μs 21.8560 KOps/s 21.9765 KOps/s $\color{#d91a1a}-0.55\%$
test_step_mdp_speed[False-True-True-True-False] 0.4199ms 28.8809μs 34.6249 KOps/s 35.8573 KOps/s $\color{#d91a1a}-3.44\%$
test_step_mdp_speed[False-True-True-False-True] 0.1543ms 29.5889μs 33.7964 KOps/s 34.9661 KOps/s $\color{#d91a1a}-3.35\%$
test_step_mdp_speed[False-True-True-False-False] 0.4275ms 17.5373μs 57.0212 KOps/s 58.1255 KOps/s $\color{#d91a1a}-1.90\%$
test_step_mdp_speed[False-True-False-True-True] 0.4541ms 47.8788μs 20.8861 KOps/s 20.9216 KOps/s $\color{#d91a1a}-0.17\%$
test_step_mdp_speed[False-True-False-True-False] 0.4340ms 30.8428μs 32.4225 KOps/s 33.1692 KOps/s $\color{#d91a1a}-2.25\%$
test_step_mdp_speed[False-True-False-False-True] 3.2495ms 31.7787μs 31.4676 KOps/s 32.1912 KOps/s $\color{#d91a1a}-2.25\%$
test_step_mdp_speed[False-True-False-False-False] 68.4840μs 19.3269μs 51.7414 KOps/s 51.7307 KOps/s $\color{#35bf28}+0.02\%$
test_step_mdp_speed[False-False-True-True-True] 0.4660ms 49.5250μs 20.1918 KOps/s 20.1467 KOps/s $\color{#35bf28}+0.22\%$
test_step_mdp_speed[False-False-True-True-False] 0.4206ms 33.3588μs 29.9771 KOps/s 30.2241 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[False-False-True-False-True] 0.1432ms 30.6532μs 32.6231 KOps/s 32.1181 KOps/s $\color{#35bf28}+1.57\%$
test_step_mdp_speed[False-False-True-False-False] 0.4100ms 19.7904μs 50.5295 KOps/s 51.6115 KOps/s $\color{#d91a1a}-2.10\%$
test_step_mdp_speed[False-False-False-True-True] 0.4440ms 51.8384μs 19.2907 KOps/s 19.4500 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[False-False-False-True-False] 0.4211ms 35.8608μs 27.8856 KOps/s 28.2028 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[False-False-False-False-True] 0.4209ms 33.4509μs 29.8946 KOps/s 30.7092 KOps/s $\color{#d91a1a}-2.65\%$
test_step_mdp_speed[False-False-False-False-False] 52.4630μs 22.1268μs 45.1941 KOps/s 46.0910 KOps/s $\color{#d91a1a}-1.95\%$
test_values[generalized_advantage_estimate-True-True] 25.7653ms 25.0398ms 39.9365 Ops/s 37.7036 Ops/s $\textbf{\color{#35bf28}+5.92\%}$
test_values[vec_generalized_advantage_estimate-True-True] 0.1009s 2.9273ms 341.6098 Ops/s 303.1156 Ops/s $\textbf{\color{#35bf28}+12.70\%}$
test_values[td0_return_estimate-False-False] 0.1076ms 81.3414μs 12.2939 KOps/s 12.2768 KOps/s $\color{#35bf28}+0.14\%$
test_values[td1_return_estimate-False-False] 60.4711ms 57.2195ms 17.4766 Ops/s 16.9740 Ops/s $\color{#35bf28}+2.96\%$
test_values[vec_td1_return_estimate-False-False] 1.3341ms 1.0956ms 912.7006 Ops/s 905.3897 Ops/s $\color{#35bf28}+0.81\%$
test_values[td_lambda_return_estimate-True-False] 95.7042ms 90.8576ms 11.0062 Ops/s 11.1598 Ops/s $\color{#d91a1a}-1.38\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3186ms 1.0906ms 916.8956 Ops/s 919.1567 Ops/s $\color{#d91a1a}-0.25\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.4749ms 25.1508ms 39.7601 Ops/s 39.8203 Ops/s $\color{#d91a1a}-0.15\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0300ms 0.7640ms 1.3090 KOps/s 1.3058 KOps/s $\color{#35bf28}+0.24\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 1.0658ms 0.6844ms 1.4612 KOps/s 1.4716 KOps/s $\color{#d91a1a}-0.71\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.6658ms 1.4944ms 669.1715 Ops/s 669.7398 Ops/s $\color{#d91a1a}-0.08\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 1.1306ms 0.6991ms 1.4304 KOps/s 1.4352 KOps/s $\color{#d91a1a}-0.34\%$
test_dqn_speed[False-None] 6.8598ms 1.5406ms 649.0924 Ops/s 649.6637 Ops/s $\color{#d91a1a}-0.09\%$
test_dqn_speed[False-backward] 2.2988ms 2.1374ms 467.8544 Ops/s 462.9400 Ops/s $\color{#35bf28}+1.06\%$
test_dqn_speed[True-None] 0.9990ms 0.5712ms 1.7506 KOps/s 1.7200 KOps/s $\color{#35bf28}+1.78\%$
test_dqn_speed[True-backward] 1.4355ms 1.2704ms 787.1561 Ops/s 845.6361 Ops/s $\textbf{\color{#d91a1a}-6.92\%}$
test_dqn_speed[reduce-overhead-None] 1.0562ms 0.6306ms 1.5857 KOps/s 1.6736 KOps/s $\textbf{\color{#d91a1a}-5.25\%}$
test_dqn_speed[reduce-overhead-backward] 1.2320ms 1.0960ms 912.3961 Ops/s 1.0049 KOps/s $\textbf{\color{#d91a1a}-9.20\%}$
test_ddpg_speed[False-None] 3.3330ms 2.9643ms 337.3428 Ops/s 345.0480 Ops/s $\color{#d91a1a}-2.23\%$
test_ddpg_speed[False-backward] 4.9181ms 4.3197ms 231.4952 Ops/s 237.0164 Ops/s $\color{#d91a1a}-2.33\%$
test_ddpg_speed[True-None] 1.8831ms 1.4431ms 692.9349 Ops/s 721.2439 Ops/s $\color{#d91a1a}-3.93\%$
test_ddpg_speed[True-backward] 2.7822ms 2.6159ms 382.2826 Ops/s 379.6312 Ops/s $\color{#35bf28}+0.70\%$
test_ddpg_speed[reduce-overhead-None] 1.7895ms 1.3848ms 722.1371 Ops/s 712.2555 Ops/s $\color{#35bf28}+1.39\%$
test_ddpg_speed[reduce-overhead-backward] 2.2071ms 2.0726ms 482.4903 Ops/s 482.4604 Ops/s $+0.01\%$
test_sac_speed[False-None] 8.4718ms 8.0838ms 123.7037 Ops/s 122.9985 Ops/s $\color{#35bf28}+0.57\%$
test_sac_speed[False-backward] 12.1268ms 11.3604ms 88.0250 Ops/s 88.4524 Ops/s $\color{#d91a1a}-0.48\%$
test_sac_speed[True-None] 2.1543ms 1.8940ms 527.9864 Ops/s 520.8272 Ops/s $\color{#35bf28}+1.37\%$
test_sac_speed[True-backward] 4.0194ms 3.8092ms 262.5214 Ops/s 256.6788 Ops/s $\color{#35bf28}+2.28\%$
test_sac_speed[reduce-overhead-None] 20.9621ms 11.8782ms 84.1880 Ops/s 83.3210 Ops/s $\color{#35bf28}+1.04\%$
test_sac_speed[reduce-overhead-backward] 2.0619ms 1.8296ms 546.5813 Ops/s 583.5567 Ops/s $\textbf{\color{#d91a1a}-6.34\%}$
test_redq_speed[False-None] 7.9161ms 7.5426ms 132.5803 Ops/s 130.8297 Ops/s $\color{#35bf28}+1.34\%$
test_redq_speed[False-backward] 12.1839ms 11.7525ms 85.0885 Ops/s 86.6714 Ops/s $\color{#d91a1a}-1.83\%$
test_redq_speed[True-None] 2.5572ms 2.3823ms 419.7613 Ops/s 414.8144 Ops/s $\color{#35bf28}+1.19\%$
test_redq_speed[True-backward] 4.7435ms 4.3041ms 232.3342 Ops/s 231.0836 Ops/s $\color{#35bf28}+0.54\%$
test_redq_speed[reduce-overhead-None] 2.6078ms 2.3972ms 417.1498 Ops/s 410.4601 Ops/s $\color{#35bf28}+1.63\%$
test_redq_speed[reduce-overhead-backward] 4.7024ms 4.2776ms 233.7778 Ops/s 241.8356 Ops/s $\color{#d91a1a}-3.33\%$
test_redq_deprec_speed[False-None] 9.4126ms 9.0934ms 109.9699 Ops/s 109.8467 Ops/s $\color{#35bf28}+0.11\%$
test_redq_deprec_speed[False-backward] 13.0821ms 12.4678ms 80.2065 Ops/s 82.6306 Ops/s $\color{#d91a1a}-2.93\%$
test_redq_deprec_speed[True-None] 2.9856ms 2.7026ms 370.0093 Ops/s 366.3237 Ops/s $\color{#35bf28}+1.01\%$
test_redq_deprec_speed[True-backward] 5.0009ms 4.5754ms 218.5579 Ops/s 221.1437 Ops/s $\color{#d91a1a}-1.17\%$
test_redq_deprec_speed[reduce-overhead-None] 2.9996ms 2.7101ms 368.9965 Ops/s 364.8135 Ops/s $\color{#35bf28}+1.15\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.9592ms 4.5526ms 219.6550 Ops/s 224.8288 Ops/s $\color{#d91a1a}-2.30\%$
test_td3_speed[False-None] 8.0515ms 7.9798ms 125.3162 Ops/s 123.4836 Ops/s $\color{#35bf28}+1.48\%$
test_td3_speed[False-backward] 11.4242ms 10.6391ms 93.9926 Ops/s 96.7536 Ops/s $\color{#d91a1a}-2.85\%$
test_td3_speed[True-None] 1.8271ms 1.7089ms 585.1695 Ops/s 551.6476 Ops/s $\textbf{\color{#35bf28}+6.08\%}$
test_td3_speed[True-backward] 3.6006ms 3.4211ms 292.3035 Ops/s 284.0469 Ops/s $\color{#35bf28}+2.91\%$
test_td3_speed[reduce-overhead-None] 52.0046ms 26.6103ms 37.5794 Ops/s 37.9263 Ops/s $\color{#d91a1a}-0.91\%$
test_td3_speed[reduce-overhead-backward] 1.7347ms 1.5354ms 651.2761 Ops/s 625.6074 Ops/s $\color{#35bf28}+4.10\%$
test_cql_speed[False-None] 17.2664ms 16.8651ms 59.2940 Ops/s 55.9525 Ops/s $\textbf{\color{#35bf28}+5.97\%}$
test_cql_speed[False-backward] 22.8814ms 22.4633ms 44.5171 Ops/s 44.9738 Ops/s $\color{#d91a1a}-1.02\%$
test_cql_speed[True-None] 3.6091ms 3.3526ms 298.2721 Ops/s 290.2633 Ops/s $\color{#35bf28}+2.76\%$
test_cql_speed[True-backward] 6.1143ms 5.7619ms 173.5541 Ops/s 172.8010 Ops/s $\color{#35bf28}+0.44\%$
test_cql_speed[reduce-overhead-None] 20.5062ms 12.9532ms 77.2009 Ops/s 74.7994 Ops/s $\color{#35bf28}+3.21\%$
test_cql_speed[reduce-overhead-backward] 2.3678ms 2.0625ms 484.8552 Ops/s 479.2641 Ops/s $\color{#35bf28}+1.17\%$
test_a2c_speed[False-None] 3.6394ms 3.2253ms 310.0526 Ops/s 306.1929 Ops/s $\color{#35bf28}+1.26\%$
test_a2c_speed[False-backward] 7.1424ms 6.4041ms 156.1497 Ops/s 153.4725 Ops/s $\color{#35bf28}+1.74\%$
test_a2c_speed[True-None] 1.6223ms 1.3838ms 722.6345 Ops/s 709.9893 Ops/s $\color{#35bf28}+1.78\%$
test_a2c_speed[True-backward] 3.2739ms 3.1200ms 320.5138 Ops/s 317.5488 Ops/s $\color{#35bf28}+0.93\%$
test_a2c_speed[reduce-overhead-None] 15.7371ms 8.7979ms 113.6636 Ops/s 112.5553 Ops/s $\color{#35bf28}+0.98\%$
test_a2c_speed[reduce-overhead-backward] 1.7584ms 1.6375ms 610.6889 Ops/s 659.4412 Ops/s $\textbf{\color{#d91a1a}-7.39\%}$
test_ppo_speed[False-None] 4.0690ms 3.7773ms 264.7375 Ops/s 265.9641 Ops/s $\color{#d91a1a}-0.46\%$
test_ppo_speed[False-backward] 7.5011ms 7.1214ms 140.4217 Ops/s 142.2555 Ops/s $\color{#d91a1a}-1.29\%$
test_ppo_speed[True-None] 1.6503ms 1.4471ms 691.0536 Ops/s 679.3168 Ops/s $\color{#35bf28}+1.73\%$
test_ppo_speed[True-backward] 3.5350ms 3.2487ms 307.8171 Ops/s 317.0197 Ops/s $\color{#d91a1a}-2.90\%$
test_ppo_speed[reduce-overhead-None] 1.3777ms 0.9996ms 1.0004 KOps/s 988.8032 Ops/s $\color{#35bf28}+1.18\%$
test_ppo_speed[reduce-overhead-backward] 1.9048ms 1.6038ms 623.5364 Ops/s 673.0305 Ops/s $\textbf{\color{#d91a1a}-7.35\%}$
test_reinforce_speed[False-None] 2.6687ms 2.3199ms 431.0466 Ops/s 433.7598 Ops/s $\color{#d91a1a}-0.63\%$
test_reinforce_speed[False-backward] 3.9684ms 3.4253ms 291.9443 Ops/s 298.6606 Ops/s $\color{#d91a1a}-2.25\%$
test_reinforce_speed[True-None] 1.5527ms 1.3220ms 756.4325 Ops/s 709.2133 Ops/s $\textbf{\color{#35bf28}+6.66\%}$
test_reinforce_speed[True-backward] 3.3549ms 3.1259ms 319.9119 Ops/s 335.2922 Ops/s $\color{#d91a1a}-4.59\%$
test_reinforce_speed[reduce-overhead-None] 17.9757ms 9.9544ms 100.4580 Ops/s 98.6254 Ops/s $\color{#35bf28}+1.86\%$
test_reinforce_speed[reduce-overhead-backward] 1.8048ms 1.6805ms 595.0780 Ops/s 585.9766 Ops/s $\color{#35bf28}+1.55\%$
test_iql_speed[False-None] 9.6978ms 9.2519ms 108.0855 Ops/s 106.0951 Ops/s $\color{#35bf28}+1.88\%$
test_iql_speed[False-backward] 13.7883ms 13.2695ms 75.3610 Ops/s 74.1563 Ops/s $\color{#35bf28}+1.62\%$
test_iql_speed[True-None] 2.4397ms 2.2722ms 440.1093 Ops/s 422.4295 Ops/s $\color{#35bf28}+4.19\%$
test_iql_speed[True-backward] 5.4947ms 5.0149ms 199.4057 Ops/s 192.8812 Ops/s $\color{#35bf28}+3.38\%$
test_iql_speed[reduce-overhead-None] 0.4906s 12.5913ms 79.4200 Ops/s 89.6243 Ops/s $\textbf{\color{#d91a1a}-11.39\%}$
test_iql_speed[reduce-overhead-backward] 2.2953ms 2.1336ms 468.6870 Ops/s 456.8962 Ops/s $\color{#35bf28}+2.58\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.7674ms 6.2940ms 158.8822 Ops/s 155.0974 Ops/s $\color{#35bf28}+2.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6989ms 0.3559ms 2.8096 KOps/s 3.6763 KOps/s $\textbf{\color{#d91a1a}-23.58\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4956ms 0.2537ms 3.9420 KOps/s 4.0586 KOps/s $\color{#d91a1a}-2.87\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4722ms 6.0311ms 165.8079 Ops/s 163.7953 Ops/s $\color{#35bf28}+1.23\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8546ms 0.2635ms 3.7952 KOps/s 3.2122 KOps/s $\textbf{\color{#35bf28}+18.15\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5243ms 0.2464ms 4.0580 KOps/s 4.2033 KOps/s $\color{#d91a1a}-3.46\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.5234ms 1.2651ms 790.4709 Ops/s 702.3276 Ops/s $\textbf{\color{#35bf28}+12.55\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6213ms 1.3202ms 757.4621 Ops/s 849.2648 Ops/s $\textbf{\color{#d91a1a}-10.81\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5743ms 6.2652ms 159.6130 Ops/s 158.6807 Ops/s $\color{#35bf28}+0.59\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.1161ms 0.4608ms 2.1700 KOps/s 2.4148 KOps/s $\textbf{\color{#d91a1a}-10.14\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7461ms 0.4233ms 2.3624 KOps/s 2.5617 KOps/s $\textbf{\color{#d91a1a}-7.78\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 9.9057ms 6.1563ms 162.4348 Ops/s 162.4731 Ops/s $\color{#d91a1a}-0.02\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.7987ms 0.3109ms 3.2166 KOps/s 3.7338 KOps/s $\textbf{\color{#d91a1a}-13.85\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 1.2426ms 0.3377ms 2.9615 KOps/s 4.1028 KOps/s $\textbf{\color{#d91a1a}-27.82\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5575ms 6.0480ms 165.3447 Ops/s 162.7479 Ops/s $\color{#35bf28}+1.60\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.6470ms 0.2961ms 3.3773 KOps/s 2.9780 KOps/s $\textbf{\color{#35bf28}+13.41\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5206ms 0.2533ms 3.9477 KOps/s 4.1194 KOps/s $\color{#d91a1a}-4.17\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5554ms 6.2240ms 160.6692 Ops/s 158.1134 Ops/s $\color{#35bf28}+1.62\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.2098ms 0.4179ms 2.3927 KOps/s 2.0373 KOps/s $\textbf{\color{#35bf28}+17.45\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7835ms 0.3946ms 2.5343 KOps/s 2.2074 KOps/s $\textbf{\color{#35bf28}+14.81\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.2759ms 5.6359ms 177.4341 Ops/s 179.2187 Ops/s $\color{#d91a1a}-1.00\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 3.8995ms 1.8589ms 537.9577 Ops/s 436.7766 Ops/s $\textbf{\color{#35bf28}+23.17\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 8.7959ms 1.2766ms 783.3005 Ops/s 844.6234 Ops/s $\textbf{\color{#d91a1a}-7.26\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4516s 14.6833ms 68.1048 Ops/s 180.4403 Ops/s $\textbf{\color{#d91a1a}-62.26\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.4327ms 2.0804ms 480.6714 Ops/s 430.4584 Ops/s $\textbf{\color{#35bf28}+11.67\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.1211ms 1.2335ms 810.6817 Ops/s 865.9297 Ops/s $\textbf{\color{#d91a1a}-6.38\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.5879ms 5.9068ms 169.2968 Ops/s 31.0837 Ops/s $\textbf{\color{#35bf28}+444.65\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.8506ms 2.2041ms 453.6932 Ops/s 502.2579 Ops/s $\textbf{\color{#d91a1a}-9.67\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.8014ms 1.3911ms 718.8398 Ops/s 794.8948 Ops/s $\textbf{\color{#d91a1a}-9.57\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 14.3594ms 13.6459ms 73.2823 Ops/s 71.8036 Ops/s $\color{#35bf28}+2.06\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.2683ms 16.7909ms 59.5562 Ops/s 58.3196 Ops/s $\color{#35bf28}+2.12\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.2141ms 17.9823ms 55.6102 Ops/s 54.1727 Ops/s $\color{#35bf28}+2.65\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 19.1659ms 17.1256ms 58.3920 Ops/s 57.7126 Ops/s $\color{#35bf28}+1.18\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 0.3924s 25.4869ms 39.2359 Ops/s 54.3637 Ops/s $\textbf{\color{#d91a1a}-27.83\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 20.3477ms 18.5655ms 53.8634 Ops/s 53.8291 Ops/s $\color{#35bf28}+0.06\%$

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Feb 13, 2025
ghstack-source-id: 12d19f6
Pull Request resolved: #2786
@vmoens vmoens added the documentation Improvements or additions to documentation label Feb 13, 2025
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Feb 13, 2025
ghstack-source-id: 708ec96
Pull Request resolved: #2786
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Feb 17, 2025
ghstack-source-id: 1a83451
Pull Request resolved: #2786
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Feb 17, 2025
ghstack-source-id: ac1f3da
Pull Request resolved: #2786
@vmoens vmoens merged commit bd40596 into gh/vmoens/90/base Feb 17, 2025
9 of 20 checks passed
vmoens pushed a commit that referenced this pull request Feb 17, 2025
ghstack-source-id: ac1f3da
Pull Request resolved: #2786
@vmoens vmoens deleted the gh/vmoens/90/head branch February 17, 2025 20:19
vmoens pushed a commit that referenced this pull request Feb 17, 2025
ghstack-source-id: ac1f3da
Pull Request resolved: #2786

(cherry picked from commit 03d6586)
vmoens pushed a commit that referenced this pull request Feb 18, 2025
ghstack-source-id: ac1f3da
Pull Request resolved: #2786

(cherry picked from commit 03d6586)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载