Skip to content

[Versioning] Better usage of pyproject.toml #3089

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

[Versioning] Better usage of pyproject.toml #3089

wants to merge 4 commits into from

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Jul 23, 2025

No description provided.

Copy link

pytorch-bot bot commented Jul 23, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/3089

Note: Links to docs will display an error until the docs builds have been completed.

❌ 16 New Failures, 1 Cancelled Job, 2 Unrelated Failures

As of commit 53bd649 with merge base b4c7188 (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOB - The following job was cancelled. Please retry:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 23, 2025
@vmoens vmoens force-pushed the poetry branch 10 times, most recently from fb77ec9 to 1e7e6ea Compare July 24, 2025 02:30
@vmoens vmoens added CI Has to do with CI setup (e.g. wheels & builds, tests...) versioning Versioning change (version number etc) labels Jul 24, 2025
@vmoens vmoens force-pushed the poetry branch 7 times, most recently from 3415ae7 to 51e57a8 Compare July 25, 2025 10:35
Copy link

github-actions bot commented Jul 25, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 154. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_tensor_to_bytestream_speed[pickle] 82.4173μs 81.2967μs 12.3006 KOps/s 11.9378 KOps/s $\color{#35bf28}+3.04\%$
test_tensor_to_bytestream_speed[torch.save] 0.1414ms 0.1399ms 7.1460 KOps/s 6.7251 KOps/s $\textbf{\color{#35bf28}+6.26\%}$
test_tensor_to_bytestream_speed[untyped_storage] 0.1089s 0.1087s 9.1974 Ops/s 8.7798 Ops/s $\color{#35bf28}+4.76\%$
test_tensor_to_bytestream_speed[numpy] 2.7764μs 2.7693μs 361.0991 KOps/s 359.8983 KOps/s $\color{#35bf28}+0.33\%$
test_tensor_to_bytestream_speed[safetensors] 40.4171μs 40.2245μs 24.8605 KOps/s 22.5638 KOps/s $\textbf{\color{#35bf28}+10.18\%}$
test_simple 0.5400s 0.5391s 1.8550 Ops/s 1.7736 Ops/s $\color{#35bf28}+4.59\%$
test_transformed 1.1067s 1.1051s 0.9049 Ops/s 0.8780 Ops/s $\color{#35bf28}+3.06\%$
test_serial 1.6550s 1.6512s 0.6056 Ops/s 0.5943 Ops/s $\color{#35bf28}+1.91\%$
test_parallel 1.1666s 1.0805s 0.9255 Ops/s 0.9067 Ops/s $\color{#35bf28}+2.07\%$
test_step_mdp_speed[True-True-True-True-True] 0.2029ms 45.2314μs 22.1085 KOps/s 22.2025 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[True-True-True-True-False] 0.1316ms 25.3360μs 39.4696 KOps/s 39.5593 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[True-True-True-False-True] 52.5020μs 25.3949μs 39.3780 KOps/s 38.5536 KOps/s $\color{#35bf28}+2.14\%$
test_step_mdp_speed[True-True-True-False-False] 41.6510μs 14.0273μs 71.2895 KOps/s 71.5917 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[True-True-False-True-True] 81.3830μs 48.4513μs 20.6393 KOps/s 20.6872 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[True-True-False-True-False] 63.3920μs 27.8461μs 35.9117 KOps/s 35.5326 KOps/s $\color{#35bf28}+1.07\%$
test_step_mdp_speed[True-True-False-False-True] 0.1663ms 28.9321μs 34.5637 KOps/s 34.8424 KOps/s $\color{#d91a1a}-0.80\%$
test_step_mdp_speed[True-True-False-False-False] 41.5410μs 16.8945μs 59.1907 KOps/s 59.7044 KOps/s $\color{#d91a1a}-0.86\%$
test_step_mdp_speed[True-False-True-True-True] 82.9620μs 51.6076μs 19.3770 KOps/s 19.6499 KOps/s $\color{#d91a1a}-1.39\%$
test_step_mdp_speed[True-False-True-True-False] 56.7720μs 30.5267μs 32.7582 KOps/s 32.6195 KOps/s $\color{#35bf28}+0.43\%$
test_step_mdp_speed[True-False-True-False-True] 59.2520μs 28.8074μs 34.7133 KOps/s 35.1937 KOps/s $\color{#d91a1a}-1.37\%$
test_step_mdp_speed[True-False-True-False-False] 42.5520μs 16.6913μs 59.9115 KOps/s 60.2447 KOps/s $\color{#d91a1a}-0.55\%$
test_step_mdp_speed[True-False-False-True-True] 82.7930μs 54.2768μs 18.4241 KOps/s 18.5777 KOps/s $\color{#d91a1a}-0.83\%$
test_step_mdp_speed[True-False-False-True-False] 77.9820μs 33.1204μs 30.1928 KOps/s 29.8326 KOps/s $\color{#35bf28}+1.21\%$
test_step_mdp_speed[True-False-False-False-True] 0.1022ms 31.2887μs 31.9604 KOps/s 32.1797 KOps/s $\color{#d91a1a}-0.68\%$
test_step_mdp_speed[True-False-False-False-False] 93.6020μs 19.4356μs 51.4520 KOps/s 51.7744 KOps/s $\color{#d91a1a}-0.62\%$
test_step_mdp_speed[False-True-True-True-True] 95.3230μs 51.6538μs 19.3596 KOps/s 19.8214 KOps/s $\color{#d91a1a}-2.33\%$
test_step_mdp_speed[False-True-True-True-False] 63.2710μs 30.9468μs 32.3136 KOps/s 32.5232 KOps/s $\color{#d91a1a}-0.64\%$
test_step_mdp_speed[False-True-True-False-True] 0.2202ms 32.6783μs 30.6014 KOps/s 30.8503 KOps/s $\color{#d91a1a}-0.81\%$
test_step_mdp_speed[False-True-True-False-False] 58.2520μs 18.5601μs 53.8790 KOps/s 52.4944 KOps/s $\color{#35bf28}+2.64\%$
test_step_mdp_speed[False-True-False-True-True] 2.6933ms 54.3832μs 18.3880 KOps/s 18.5300 KOps/s $\color{#d91a1a}-0.77\%$
test_step_mdp_speed[False-True-False-True-False] 65.5620μs 33.4128μs 29.9287 KOps/s 29.9744 KOps/s $\color{#d91a1a}-0.15\%$
test_step_mdp_speed[False-True-False-False-True] 68.1320μs 35.3705μs 28.2721 KOps/s 28.6889 KOps/s $\color{#d91a1a}-1.45\%$
test_step_mdp_speed[False-True-False-False-False] 48.8010μs 21.6051μs 46.2853 KOps/s 46.1954 KOps/s $\color{#35bf28}+0.19\%$
test_step_mdp_speed[False-False-True-True-True] 97.6030μs 57.1245μs 17.5056 KOps/s 17.7476 KOps/s $\color{#d91a1a}-1.36\%$
test_step_mdp_speed[False-False-True-True-False] 75.4220μs 36.1274μs 27.6798 KOps/s 27.8261 KOps/s $\color{#d91a1a}-0.53\%$
test_step_mdp_speed[False-False-True-False-True] 69.4020μs 35.0098μs 28.5634 KOps/s 28.7213 KOps/s $\color{#d91a1a}-0.55\%$
test_step_mdp_speed[False-False-True-False-False] 0.1560ms 21.5368μs 46.4321 KOps/s 46.4885 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[False-False-False-True-True] 86.6720μs 59.4053μs 16.8335 KOps/s 17.0965 KOps/s $\color{#d91a1a}-1.54\%$
test_step_mdp_speed[False-False-False-True-False] 67.3120μs 38.9425μs 25.6789 KOps/s 26.0696 KOps/s $\color{#d91a1a}-1.50\%$
test_step_mdp_speed[False-False-False-False-True] 66.1220μs 37.4457μs 26.7053 KOps/s 26.6577 KOps/s $\color{#35bf28}+0.18\%$
test_step_mdp_speed[False-False-False-False-False] 46.5320μs 23.6830μs 42.2244 KOps/s 42.2097 KOps/s $\color{#35bf28}+0.03\%$
test_values[generalized_advantage_estimate-True-True] 10.5056ms 10.2072ms 97.9698 Ops/s 93.7961 Ops/s $\color{#35bf28}+4.45\%$
test_values[vec_generalized_advantage_estimate-True-True] 19.3567ms 17.5992ms 56.8208 Ops/s 88.9504 Ops/s $\textbf{\color{#d91a1a}-36.12\%}$
test_values[td0_return_estimate-False-False] 0.1994ms 0.1234ms 8.1050 KOps/s 7.4988 KOps/s $\textbf{\color{#35bf28}+8.08\%}$
test_values[td1_return_estimate-False-False] 27.4736ms 27.1140ms 36.8813 Ops/s 36.3601 Ops/s $\color{#35bf28}+1.43\%$
test_values[vec_td1_return_estimate-False-False] 18.4052ms 17.7030ms 56.4875 Ops/s 88.0591 Ops/s $\textbf{\color{#d91a1a}-35.85\%}$
test_values[td_lambda_return_estimate-True-False] 40.5058ms 39.8718ms 25.0804 Ops/s 24.6820 Ops/s $\color{#35bf28}+1.61\%$
test_values[vec_td_lambda_return_estimate-True-False] 18.1611ms 17.7402ms 56.3690 Ops/s 88.3886 Ops/s $\textbf{\color{#d91a1a}-36.23\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 9.3127ms 9.1990ms 108.7074 Ops/s 106.9624 Ops/s $\color{#35bf28}+1.63\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.7647ms 1.5154ms 659.8989 Ops/s 663.8681 Ops/s $\color{#d91a1a}-0.60\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5583ms 0.4066ms 2.4594 KOps/s 2.4533 KOps/s $\color{#35bf28}+0.25\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 36.5086ms 34.3072ms 29.1484 Ops/s 32.7986 Ops/s $\textbf{\color{#d91a1a}-11.13\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 1.9091ms 1.7443ms 573.3085 Ops/s 571.9070 Ops/s $\color{#35bf28}+0.25\%$
test_dqn_speed[False-None] 6.2021ms 1.3740ms 727.8259 Ops/s 721.2410 Ops/s $\color{#35bf28}+0.91\%$
test_dqn_speed[False-backward] 1.9892ms 1.8855ms 530.3535 Ops/s 536.1259 Ops/s $\color{#d91a1a}-1.08\%$
test_dqn_speed[True-None] 0.6812ms 0.5213ms 1.9181 KOps/s 1.8230 KOps/s $\textbf{\color{#35bf28}+5.22\%}$
test_dqn_speed[True-backward] 1.0002ms 0.9665ms 1.0347 KOps/s 851.3031 Ops/s $\textbf{\color{#35bf28}+21.54\%}$
test_dqn_speed[reduce-overhead-None] 0.6888ms 0.5213ms 1.9183 KOps/s 1.8028 KOps/s $\textbf{\color{#35bf28}+6.41\%}$
test_dqn_speed[reduce-overhead-backward] 0.9910ms 0.9610ms 1.0406 KOps/s 1.0221 KOps/s $\color{#35bf28}+1.81\%$
test_ddpg_speed[False-None] 3.1110ms 2.7856ms 358.9955 Ops/s 354.4838 Ops/s $\color{#35bf28}+1.27\%$
test_ddpg_speed[False-backward] 4.2090ms 3.9653ms 252.1876 Ops/s 250.2501 Ops/s $\color{#35bf28}+0.77\%$
test_ddpg_speed[True-None] 1.5426ms 1.3748ms 727.3924 Ops/s 714.8572 Ops/s $\color{#35bf28}+1.75\%$
test_ddpg_speed[True-backward] 2.5326ms 2.3663ms 422.5990 Ops/s 345.7137 Ops/s $\textbf{\color{#35bf28}+22.24\%}$
test_ddpg_speed[reduce-overhead-None] 1.5672ms 1.3809ms 724.1812 Ops/s 694.5455 Ops/s $\color{#35bf28}+4.27\%$
test_ddpg_speed[reduce-overhead-backward] 2.3911ms 2.3511ms 425.3249 Ops/s 421.1655 Ops/s $\color{#35bf28}+0.99\%$
test_sac_speed[False-None] 7.9426ms 7.5549ms 132.3639 Ops/s 131.2036 Ops/s $\color{#35bf28}+0.88\%$
test_sac_speed[False-backward] 10.9517ms 10.6978ms 93.4768 Ops/s 92.5268 Ops/s $\color{#35bf28}+1.03\%$
test_sac_speed[True-None] 2.3423ms 2.1400ms 467.2921 Ops/s 460.7425 Ops/s $\color{#35bf28}+1.42\%$
test_sac_speed[True-backward] 4.2189ms 4.0643ms 246.0451 Ops/s 253.6214 Ops/s $\color{#d91a1a}-2.99\%$
test_sac_speed[reduce-overhead-None] 2.3914ms 2.1740ms 459.9858 Ops/s 469.4645 Ops/s $\color{#d91a1a}-2.02\%$
test_sac_speed[reduce-overhead-backward] 4.3675ms 4.1200ms 242.7181 Ops/s 257.3980 Ops/s $\textbf{\color{#d91a1a}-5.70\%}$
test_redq_speed[False-None] 13.3964ms 10.2244ms 97.8057 Ops/s 99.7328 Ops/s $\color{#d91a1a}-1.93\%$
test_redq_speed[False-backward] 18.3572ms 17.6753ms 56.5761 Ops/s 59.6893 Ops/s $\textbf{\color{#d91a1a}-5.22\%}$
test_redq_speed[True-None] 4.8184ms 4.5201ms 221.2355 Ops/s 219.9164 Ops/s $\color{#35bf28}+0.60\%$
test_redq_speed[True-backward] 10.3534ms 9.9488ms 100.5150 Ops/s 98.0917 Ops/s $\color{#35bf28}+2.47\%$
test_redq_speed[reduce-overhead-None] 4.6726ms 4.4638ms 224.0246 Ops/s 232.1357 Ops/s $\color{#d91a1a}-3.49\%$
test_redq_speed[reduce-overhead-backward] 10.2583ms 9.9160ms 100.8474 Ops/s 105.2803 Ops/s $\color{#d91a1a}-4.21\%$
test_redq_deprec_speed[False-None] 12.4976ms 10.6796ms 93.6365 Ops/s 95.6222 Ops/s $\color{#d91a1a}-2.08\%$
test_redq_deprec_speed[False-backward] 16.1428ms 15.5264ms 64.4065 Ops/s 66.3873 Ops/s $\color{#d91a1a}-2.98\%$
test_redq_deprec_speed[True-None] 3.8735ms 3.6120ms 276.8537 Ops/s 259.7125 Ops/s $\textbf{\color{#35bf28}+6.60\%}$
test_redq_deprec_speed[True-backward] 7.9958ms 7.5833ms 131.8683 Ops/s 127.3597 Ops/s $\color{#35bf28}+3.54\%$
test_redq_deprec_speed[reduce-overhead-None] 3.7970ms 3.5738ms 279.8136 Ops/s 266.6222 Ops/s $\color{#35bf28}+4.95\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.7990ms 7.4969ms 133.3878 Ops/s 126.4393 Ops/s $\textbf{\color{#35bf28}+5.50\%}$
test_td3_speed[False-None] 7.8858ms 7.6704ms 130.3720 Ops/s 130.8191 Ops/s $\color{#d91a1a}-0.34\%$
test_td3_speed[False-backward] 10.8966ms 10.4345ms 95.8357 Ops/s 96.4906 Ops/s $\color{#d91a1a}-0.68\%$
test_td3_speed[True-None] 1.8488ms 1.7954ms 556.9643 Ops/s 565.6180 Ops/s $\color{#d91a1a}-1.53\%$
test_td3_speed[True-backward] 3.8755ms 3.6819ms 271.6023 Ops/s 289.5590 Ops/s $\textbf{\color{#d91a1a}-6.20\%}$
test_td3_speed[reduce-overhead-None] 1.8589ms 1.8095ms 552.6241 Ops/s 565.1310 Ops/s $\color{#d91a1a}-2.21\%$
test_td3_speed[reduce-overhead-backward] 3.8141ms 3.6214ms 276.1387 Ops/s 296.8089 Ops/s $\textbf{\color{#d91a1a}-6.96\%}$
test_cql_speed[False-None] 0.2831s 32.1703ms 31.0845 Ops/s 39.6026 Ops/s $\textbf{\color{#d91a1a}-21.51\%}$
test_cql_speed[False-backward] 40.0190ms 35.3797ms 28.2648 Ops/s 29.2044 Ops/s $\color{#d91a1a}-3.22\%$
test_cql_speed[True-None] 12.7128ms 12.2696ms 81.5020 Ops/s 81.3716 Ops/s $\color{#35bf28}+0.16\%$
test_cql_speed[True-backward] 18.7609ms 18.3373ms 54.5338 Ops/s 57.0713 Ops/s $\color{#d91a1a}-4.45\%$
test_cql_speed[reduce-overhead-None] 12.6663ms 12.3353ms 81.0679 Ops/s 81.5459 Ops/s $\color{#d91a1a}-0.59\%$
test_cql_speed[reduce-overhead-backward] 20.5454ms 18.0674ms 55.3484 Ops/s 57.0410 Ops/s $\color{#d91a1a}-2.97\%$
test_a2c_speed[False-None] 5.3612ms 5.1369ms 194.6691 Ops/s 186.1983 Ops/s $\color{#35bf28}+4.55\%$
test_a2c_speed[False-backward] 11.9439ms 11.6557ms 85.7947 Ops/s 83.9366 Ops/s $\color{#35bf28}+2.21\%$
test_a2c_speed[True-None] 3.9023ms 3.6721ms 272.3219 Ops/s 262.9510 Ops/s $\color{#35bf28}+3.56\%$
test_a2c_speed[True-backward] 8.8922ms 8.6791ms 115.2192 Ops/s 113.5635 Ops/s $\color{#35bf28}+1.46\%$
test_a2c_speed[reduce-overhead-None] 3.8712ms 3.6940ms 270.7121 Ops/s 263.6614 Ops/s $\color{#35bf28}+2.67\%$
test_a2c_speed[reduce-overhead-backward] 8.7490ms 8.5111ms 117.4932 Ops/s 113.9742 Ops/s $\color{#35bf28}+3.09\%$
test_ppo_speed[False-None] 5.9490ms 5.6460ms 177.1161 Ops/s 174.3374 Ops/s $\color{#35bf28}+1.59\%$
test_ppo_speed[False-backward] 12.5284ms 12.2235ms 81.8095 Ops/s 81.5320 Ops/s $\color{#35bf28}+0.34\%$
test_ppo_speed[True-None] 4.1620ms 3.6654ms 272.8251 Ops/s 264.1781 Ops/s $\color{#35bf28}+3.27\%$
test_ppo_speed[True-backward] 8.8376ms 8.4859ms 117.8420 Ops/s 116.4291 Ops/s $\color{#35bf28}+1.21\%$
test_ppo_speed[reduce-overhead-None] 4.8647ms 3.7328ms 267.8943 Ops/s 265.6762 Ops/s $\color{#35bf28}+0.83\%$
test_ppo_speed[reduce-overhead-backward] 8.8243ms 8.4945ms 117.7235 Ops/s 117.4625 Ops/s $\color{#35bf28}+0.22\%$
test_reinforce_speed[False-None] 4.7300ms 4.4426ms 225.0937 Ops/s 214.4561 Ops/s $\color{#35bf28}+4.96\%$
test_reinforce_speed[False-backward] 7.5705ms 7.3079ms 136.8382 Ops/s 133.3757 Ops/s $\color{#35bf28}+2.60\%$
test_reinforce_speed[True-None] 3.2497ms 2.8545ms 350.3287 Ops/s 334.9562 Ops/s $\color{#35bf28}+4.59\%$
test_reinforce_speed[True-backward] 7.8938ms 7.6324ms 131.0198 Ops/s 129.2008 Ops/s $\color{#35bf28}+1.41\%$
test_reinforce_speed[reduce-overhead-None] 3.0905ms 2.8655ms 348.9801 Ops/s 344.0253 Ops/s $\color{#35bf28}+1.44\%$
test_reinforce_speed[reduce-overhead-backward] 7.9343ms 7.6741ms 130.3089 Ops/s 129.0604 Ops/s $\color{#35bf28}+0.97\%$
test_iql_speed[False-None] 24.3073ms 19.7069ms 50.7437 Ops/s 50.3604 Ops/s $\color{#35bf28}+0.76\%$
test_iql_speed[False-backward] 30.8078ms 29.9128ms 33.4306 Ops/s 33.3526 Ops/s $\color{#35bf28}+0.23\%$
test_iql_speed[True-None] 8.7480ms 8.5125ms 117.4745 Ops/s 116.4419 Ops/s $\color{#35bf28}+0.89\%$
test_iql_speed[True-backward] 16.9397ms 16.5043ms 60.5904 Ops/s 59.3296 Ops/s $\color{#35bf28}+2.13\%$
test_iql_speed[reduce-overhead-None] 8.8894ms 8.5700ms 116.6864 Ops/s 115.8224 Ops/s $\color{#35bf28}+0.75\%$
test_iql_speed[reduce-overhead-backward] 17.0213ms 16.5831ms 60.3022 Ops/s 59.7279 Ops/s $\color{#35bf28}+0.96\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.4525ms 6.0977ms 163.9969 Ops/s 160.9527 Ops/s $\color{#35bf28}+1.89\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.5958ms 0.3175ms 3.1498 KOps/s 3.3766 KOps/s $\textbf{\color{#d91a1a}-6.72\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5396ms 0.2761ms 3.6214 KOps/s 3.3167 KOps/s $\textbf{\color{#35bf28}+9.19\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.1543ms 5.8472ms 171.0213 Ops/s 169.5747 Ops/s $\color{#35bf28}+0.85\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9422ms 0.2587ms 3.8652 KOps/s 3.8837 KOps/s $\color{#d91a1a}-0.48\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4865ms 0.2403ms 4.1615 KOps/s 4.1912 KOps/s $\color{#d91a1a}-0.71\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.4910ms 1.2109ms 825.8261 Ops/s 740.8267 Ops/s $\textbf{\color{#35bf28}+11.47\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3517ms 1.1282ms 886.3356 Ops/s 886.5063 Ops/s $\color{#d91a1a}-0.02\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.1810ms 5.9968ms 166.7565 Ops/s 163.5725 Ops/s $\color{#35bf28}+1.95\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.1300ms 0.5007ms 1.9971 KOps/s 2.1887 KOps/s $\textbf{\color{#d91a1a}-8.76\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7066ms 0.4627ms 2.1614 KOps/s 2.1920 KOps/s $\color{#d91a1a}-1.40\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.3206ms 5.9028ms 169.4103 Ops/s 165.6111 Ops/s $\color{#35bf28}+2.29\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.5882ms 0.3399ms 2.9416 KOps/s 3.7745 KOps/s $\textbf{\color{#d91a1a}-22.06\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5192ms 0.3095ms 3.2306 KOps/s 3.2498 KOps/s $\color{#d91a1a}-0.59\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.1563ms 5.8515ms 170.8960 Ops/s 168.3755 Ops/s $\color{#35bf28}+1.50\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.7322ms 0.3198ms 3.1274 KOps/s 3.0069 KOps/s $\color{#35bf28}+4.01\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5567ms 0.3033ms 3.2971 KOps/s 4.1088 KOps/s $\textbf{\color{#d91a1a}-19.76\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.1964ms 6.0253ms 165.9677 Ops/s 163.0213 Ops/s $\color{#35bf28}+1.81\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9381ms 0.4051ms 2.4687 KOps/s 2.4830 KOps/s $\color{#d91a1a}-0.57\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.5773ms 0.3839ms 2.6050 KOps/s 2.6073 KOps/s $\color{#d91a1a}-0.09\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.9888ms 5.4380ms 183.8903 Ops/s 178.7332 Ops/s $\color{#35bf28}+2.89\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.5172ms 2.2879ms 437.0808 Ops/s 438.7555 Ops/s $\color{#d91a1a}-0.38\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.4928ms 1.2322ms 811.5265 Ops/s 795.0897 Ops/s $\color{#35bf28}+2.07\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4505s 14.4595ms 69.1588 Ops/s 59.4251 Ops/s $\textbf{\color{#35bf28}+16.38\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.3159ms 2.0466ms 488.6066 Ops/s 564.0807 Ops/s $\textbf{\color{#d91a1a}-13.38\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.1245ms 1.1397ms 877.3948 Ops/s 800.5867 Ops/s $\textbf{\color{#35bf28}+9.59\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.2235ms 5.6715ms 176.3187 Ops/s 171.7392 Ops/s $\color{#35bf28}+2.67\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.6020ms 2.2411ms 446.2102 Ops/s 451.4515 Ops/s $\color{#d91a1a}-1.16\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 3.6754ms 1.3152ms 760.3168 Ops/s 712.9689 Ops/s $\textbf{\color{#35bf28}+6.64\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 62.9478ms 59.7741ms 16.7297 Ops/s 16.8301 Ops/s $\color{#d91a1a}-0.60\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 17.8309ms 16.2693ms 61.4655 Ops/s 58.4512 Ops/s $\textbf{\color{#35bf28}+5.16\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 59.4388ms 57.7516ms 17.3155 Ops/s 16.8906 Ops/s $\color{#35bf28}+2.52\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.1347ms 16.6214ms 60.1635 Ops/s 59.5551 Ops/s $\color{#35bf28}+1.02\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 59.8677ms 57.9116ms 17.2677 Ops/s 17.0818 Ops/s $\color{#35bf28}+1.09\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.3048ms 17.7977ms 56.1869 Ops/s 53.3780 Ops/s $\textbf{\color{#35bf28}+5.26\%}$

Copy link

github-actions bot commented Jul 25, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 148. Improved: $\large\color{#35bf28}29$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_tensor_to_bytestream_speed[pickle] 80.2784μs 79.3963μs 12.5950 KOps/s 12.5284 KOps/s $\color{#35bf28}+0.53\%$
test_tensor_to_bytestream_speed[torch.save] 0.1385ms 0.1375ms 7.2728 KOps/s 7.2981 KOps/s $\color{#d91a1a}-0.35\%$
test_tensor_to_bytestream_speed[untyped_storage] 0.1114s 0.1107s 9.0298 Ops/s 8.7165 Ops/s $\color{#35bf28}+3.59\%$
test_tensor_to_bytestream_speed[numpy] 2.7489μs 2.7448μs 364.3252 KOps/s 361.4657 KOps/s $\color{#35bf28}+0.79\%$
test_tensor_to_bytestream_speed[safetensors] 41.3400μs 39.9170μs 25.0520 KOps/s 25.2022 KOps/s $\color{#d91a1a}-0.60\%$
test_simple 0.7635s 0.7623s 1.3118 Ops/s 1.2621 Ops/s $\color{#35bf28}+3.94\%$
test_transformed 1.4652s 1.3858s 0.7216 Ops/s 0.7111 Ops/s $\color{#35bf28}+1.47\%$
test_serial 2.3197s 2.2448s 0.4455 Ops/s 0.4388 Ops/s $\color{#35bf28}+1.51\%$
test_parallel 1.9082s 1.8620s 0.5371 Ops/s 0.5245 Ops/s $\color{#35bf28}+2.39\%$
test_step_mdp_speed[True-True-True-True-True] 0.1477ms 42.9595μs 23.2777 KOps/s 23.3343 KOps/s $\color{#d91a1a}-0.24\%$
test_step_mdp_speed[True-True-True-True-False] 56.0610μs 24.2400μs 41.2541 KOps/s 40.6283 KOps/s $\color{#35bf28}+1.54\%$
test_step_mdp_speed[True-True-True-False-True] 51.4110μs 23.9689μs 41.7207 KOps/s 41.4232 KOps/s $\color{#35bf28}+0.72\%$
test_step_mdp_speed[True-True-True-False-False] 41.3110μs 13.5626μs 73.7322 KOps/s 74.0440 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[True-True-False-True-True] 74.2210μs 45.8446μs 21.8128 KOps/s 22.1488 KOps/s $\color{#d91a1a}-1.52\%$
test_step_mdp_speed[True-True-False-True-False] 67.1410μs 26.8134μs 37.2948 KOps/s 36.8312 KOps/s $\color{#35bf28}+1.26\%$
test_step_mdp_speed[True-True-False-False-True] 75.6010μs 26.7668μs 37.3597 KOps/s 37.3177 KOps/s $\color{#35bf28}+0.11\%$
test_step_mdp_speed[True-True-False-False-False] 46.6610μs 16.1440μs 61.9425 KOps/s 62.0361 KOps/s $\color{#d91a1a}-0.15\%$
test_step_mdp_speed[True-False-True-True-True] 91.3110μs 47.7065μs 20.9615 KOps/s 20.7320 KOps/s $\color{#35bf28}+1.11\%$
test_step_mdp_speed[True-False-True-True-False] 59.9310μs 29.3793μs 34.0375 KOps/s 33.8702 KOps/s $\color{#35bf28}+0.49\%$
test_step_mdp_speed[True-False-True-False-True] 58.7710μs 26.6021μs 37.5911 KOps/s 37.4000 KOps/s $\color{#35bf28}+0.51\%$
test_step_mdp_speed[True-False-True-False-False] 42.4410μs 15.7471μs 63.5039 KOps/s 62.2814 KOps/s $\color{#35bf28}+1.96\%$
test_step_mdp_speed[True-False-False-True-True] 80.9420μs 50.3042μs 19.8790 KOps/s 19.7278 KOps/s $\color{#35bf28}+0.77\%$
test_step_mdp_speed[True-False-False-True-False] 67.6610μs 32.1219μs 31.1314 KOps/s 30.8449 KOps/s $\color{#35bf28}+0.93\%$
test_step_mdp_speed[True-False-False-False-True] 6.0024ms 29.9041μs 33.4402 KOps/s 33.8794 KOps/s $\color{#d91a1a}-1.30\%$
test_step_mdp_speed[True-False-False-False-False] 49.6010μs 18.7802μs 53.2474 KOps/s 53.2127 KOps/s $\color{#35bf28}+0.07\%$
test_step_mdp_speed[False-True-True-True-True] 81.4010μs 47.9221μs 20.8672 KOps/s 20.3971 KOps/s $\color{#35bf28}+2.30\%$
test_step_mdp_speed[False-True-True-True-False] 0.1097ms 29.7403μs 33.6245 KOps/s 33.1699 KOps/s $\color{#35bf28}+1.37\%$
test_step_mdp_speed[False-True-True-False-True] 72.8410μs 30.6455μs 32.6312 KOps/s 32.5424 KOps/s $\color{#35bf28}+0.27\%$
test_step_mdp_speed[False-True-True-False-False] 41.3210μs 18.2123μs 54.9078 KOps/s 54.5032 KOps/s $\color{#35bf28}+0.74\%$
test_step_mdp_speed[False-True-False-True-True] 2.7546ms 51.8442μs 19.2886 KOps/s 19.2289 KOps/s $\color{#35bf28}+0.31\%$
test_step_mdp_speed[False-True-False-True-False] 68.9410μs 32.0519μs 31.1994 KOps/s 30.5612 KOps/s $\color{#35bf28}+2.09\%$
test_step_mdp_speed[False-True-False-False-True] 67.5110μs 32.4894μs 30.7793 KOps/s 30.2876 KOps/s $\color{#35bf28}+1.62\%$
test_step_mdp_speed[False-True-False-False-False] 70.2210μs 20.5445μs 48.6748 KOps/s 47.9209 KOps/s $\color{#35bf28}+1.57\%$
test_step_mdp_speed[False-False-True-True-True] 83.0610μs 53.5970μs 18.6578 KOps/s 18.6350 KOps/s $\color{#35bf28}+0.12\%$
test_step_mdp_speed[False-False-True-True-False] 67.9810μs 34.9859μs 28.5829 KOps/s 28.9887 KOps/s $\color{#d91a1a}-1.40\%$
test_step_mdp_speed[False-False-True-False-True] 63.5610μs 33.5316μs 29.8226 KOps/s 29.9511 KOps/s $\color{#d91a1a}-0.43\%$
test_step_mdp_speed[False-False-True-False-False] 54.1410μs 21.0100μs 47.5965 KOps/s 47.8179 KOps/s $\color{#d91a1a}-0.46\%$
test_step_mdp_speed[False-False-False-True-True] 92.9910μs 56.9669μs 17.5541 KOps/s 17.6752 KOps/s $\color{#d91a1a}-0.69\%$
test_step_mdp_speed[False-False-False-True-False] 71.0620μs 37.7735μs 26.4736 KOps/s 26.6853 KOps/s $\color{#d91a1a}-0.79\%$
test_step_mdp_speed[False-False-False-False-True] 73.1210μs 35.2756μs 28.3482 KOps/s 27.9719 KOps/s $\color{#35bf28}+1.35\%$
test_step_mdp_speed[False-False-False-False-False] 54.9710μs 23.1568μs 43.1838 KOps/s 43.2801 KOps/s $\color{#d91a1a}-0.22\%$
test_values[generalized_advantage_estimate-True-True] 20.8159ms 20.2845ms 49.2987 Ops/s 46.8936 Ops/s $\textbf{\color{#35bf28}+5.13\%}$
test_values[vec_generalized_advantage_estimate-True-True] 0.1213s 3.3318ms 300.1341 Ops/s 285.3080 Ops/s $\textbf{\color{#35bf28}+5.20\%}$
test_values[td0_return_estimate-False-False] 0.1007ms 78.5005μs 12.7388 KOps/s 12.4921 KOps/s $\color{#35bf28}+1.97\%$
test_values[td1_return_estimate-False-False] 48.3353ms 47.9446ms 20.8574 Ops/s 20.0506 Ops/s $\color{#35bf28}+4.02\%$
test_values[vec_td1_return_estimate-False-False] 1.3280ms 1.0916ms 916.1043 Ops/s 902.1853 Ops/s $\color{#35bf28}+1.54\%$
test_values[td_lambda_return_estimate-True-False] 78.7599ms 78.2850ms 12.7738 Ops/s 12.0284 Ops/s $\textbf{\color{#35bf28}+6.20\%}$
test_values[vec_td_lambda_return_estimate-True-False] 1.3222ms 1.0853ms 921.3962 Ops/s 903.9764 Ops/s $\color{#35bf28}+1.93\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 21.8886ms 21.2211ms 47.1229 Ops/s 46.2660 Ops/s $\color{#35bf28}+1.85\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0456ms 0.7539ms 1.3264 KOps/s 1.3085 KOps/s $\color{#35bf28}+1.36\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7536ms 0.6930ms 1.4430 KOps/s 1.4725 KOps/s $\color{#d91a1a}-2.01\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5446ms 1.4826ms 674.5032 Ops/s 669.6474 Ops/s $\color{#35bf28}+0.73\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7443ms 0.6894ms 1.4505 KOps/s 1.4290 KOps/s $\color{#35bf28}+1.50\%$
test_dqn_speed[False-None] 1.6146ms 1.4987ms 667.2309 Ops/s 657.8667 Ops/s $\color{#35bf28}+1.42\%$
test_dqn_speed[False-backward] 2.5257ms 2.1635ms 462.2117 Ops/s 460.3738 Ops/s $\color{#35bf28}+0.40\%$
test_dqn_speed[True-None] 1.1256ms 0.5866ms 1.7047 KOps/s 1.7333 KOps/s $\color{#d91a1a}-1.65\%$
test_dqn_speed[True-backward] 1.2002ms 1.1659ms 857.6917 Ops/s 773.8475 Ops/s $\textbf{\color{#35bf28}+10.83\%}$
test_dqn_speed[reduce-overhead-None] 0.6614ms 0.5937ms 1.6843 KOps/s 1.6362 KOps/s $\color{#35bf28}+2.94\%$
test_dqn_speed[reduce-overhead-backward] 1.0681ms 0.9941ms 1.0060 KOps/s 889.1625 Ops/s $\textbf{\color{#35bf28}+13.14\%}$
test_ddpg_speed[False-None] 3.1751ms 2.8928ms 345.6891 Ops/s 349.9325 Ops/s $\color{#d91a1a}-1.21\%$
test_ddpg_speed[False-backward] 4.6815ms 4.2198ms 236.9795 Ops/s 228.8700 Ops/s $\color{#35bf28}+3.54\%$
test_ddpg_speed[True-None] 1.4921ms 1.3794ms 724.9378 Ops/s 720.2147 Ops/s $\color{#35bf28}+0.66\%$
test_ddpg_speed[True-backward] 2.7667ms 2.5859ms 386.7100 Ops/s 365.5021 Ops/s $\textbf{\color{#35bf28}+5.80\%}$
test_ddpg_speed[reduce-overhead-None] 1.6233ms 1.4199ms 704.2698 Ops/s 705.9806 Ops/s $\color{#d91a1a}-0.24\%$
test_ddpg_speed[reduce-overhead-backward] 0.1836s 0.1816s 5.5060 Ops/s 4.4327 Ops/s $\textbf{\color{#35bf28}+24.21\%}$
test_sac_speed[False-None] 8.5118ms 7.9988ms 125.0192 Ops/s 121.9932 Ops/s $\color{#35bf28}+2.48\%$
test_sac_speed[False-backward] 11.7542ms 11.1134ms 89.9811 Ops/s 86.6988 Ops/s $\color{#35bf28}+3.79\%$
test_sac_speed[True-None] 2.1975ms 1.9588ms 510.5278 Ops/s 516.5225 Ops/s $\color{#d91a1a}-1.16\%$
test_sac_speed[True-backward] 4.3700ms 3.9008ms 256.3578 Ops/s 253.1766 Ops/s $\color{#35bf28}+1.26\%$
test_sac_speed[reduce-overhead-None] 20.0732ms 11.5139ms 86.8518 Ops/s 86.4822 Ops/s $\color{#35bf28}+0.43\%$
test_sac_speed[reduce-overhead-backward] 1.8711ms 1.7892ms 558.9240 Ops/s 546.9567 Ops/s $\color{#35bf28}+2.19\%$
test_redq_deprec_speed[False-None] 9.4155ms 8.9407ms 111.8482 Ops/s 108.6808 Ops/s $\color{#35bf28}+2.91\%$
test_redq_deprec_speed[False-backward] 12.8920ms 12.4506ms 80.3174 Ops/s 78.3582 Ops/s $\color{#35bf28}+2.50\%$
test_redq_deprec_speed[True-None] 2.6891ms 2.5497ms 392.2107 Ops/s 385.0941 Ops/s $\color{#35bf28}+1.85\%$
test_redq_deprec_speed[True-backward] 4.8311ms 4.5556ms 219.5081 Ops/s 214.9974 Ops/s $\color{#35bf28}+2.10\%$
test_redq_deprec_speed[reduce-overhead-None] 2.6536ms 2.5448ms 392.9506 Ops/s 381.8679 Ops/s $\color{#35bf28}+2.90\%$
test_redq_deprec_speed[reduce-overhead-backward] 5.0273ms 4.5712ms 218.7587 Ops/s 213.2945 Ops/s $\color{#35bf28}+2.56\%$
test_td3_speed[False-None] 7.9190ms 7.8638ms 127.1646 Ops/s 112.9125 Ops/s $\textbf{\color{#35bf28}+12.62\%}$
test_td3_speed[False-backward] 11.0980ms 10.6043ms 94.3011 Ops/s 90.0922 Ops/s $\color{#35bf28}+4.67\%$
test_td3_speed[True-None] 1.7320ms 1.7094ms 584.9979 Ops/s 573.1484 Ops/s $\color{#35bf28}+2.07\%$
test_td3_speed[True-backward] 3.5591ms 3.5061ms 285.2152 Ops/s 276.5782 Ops/s $\color{#35bf28}+3.12\%$
test_td3_speed[reduce-overhead-None] 49.6115ms 25.3702ms 39.4163 Ops/s 39.0912 Ops/s $\color{#35bf28}+0.83\%$
test_td3_speed[reduce-overhead-backward] 1.5429ms 1.4686ms 680.8984 Ops/s 665.5997 Ops/s $\color{#35bf28}+2.30\%$
test_cql_speed[False-None] 17.2780ms 16.6290ms 60.1359 Ops/s 58.8230 Ops/s $\color{#35bf28}+2.23\%$
test_cql_speed[False-backward] 23.2055ms 22.3228ms 44.7973 Ops/s 43.8205 Ops/s $\color{#35bf28}+2.23\%$
test_cql_speed[True-None] 3.7695ms 3.4302ms 291.5251 Ops/s 288.4697 Ops/s $\color{#35bf28}+1.06\%$
test_cql_speed[True-backward] 6.3233ms 5.9389ms 168.3805 Ops/s 164.9249 Ops/s $\color{#35bf28}+2.10\%$
test_cql_speed[reduce-overhead-None] 19.5801ms 12.4123ms 80.5654 Ops/s 79.1824 Ops/s $\color{#35bf28}+1.75\%$
test_cql_speed[reduce-overhead-backward] 1.8694ms 1.7718ms 564.4028 Ops/s 498.8728 Ops/s $\textbf{\color{#35bf28}+13.14\%}$
test_a2c_speed[False-None] 3.2652ms 3.1451ms 317.9526 Ops/s 309.1704 Ops/s $\color{#35bf28}+2.84\%$
test_a2c_speed[False-backward] 6.8601ms 6.2303ms 160.5047 Ops/s 151.8828 Ops/s $\textbf{\color{#35bf28}+5.68\%}$
test_a2c_speed[True-None] 1.4433ms 1.3506ms 740.4177 Ops/s 744.5496 Ops/s $\color{#d91a1a}-0.55\%$
test_a2c_speed[True-backward] 3.1214ms 3.0416ms 328.7721 Ops/s 302.8548 Ops/s $\textbf{\color{#35bf28}+8.56\%}$
test_a2c_speed[reduce-overhead-None] 15.4717ms 8.7029ms 114.9039 Ops/s 115.7394 Ops/s $\color{#d91a1a}-0.72\%$
test_a2c_speed[reduce-overhead-backward] 1.4796ms 1.4345ms 697.1249 Ops/s 619.7407 Ops/s $\textbf{\color{#35bf28}+12.49\%}$
test_ppo_speed[False-None] 3.9686ms 3.7428ms 267.1766 Ops/s 254.1609 Ops/s $\textbf{\color{#35bf28}+5.12\%}$
test_ppo_speed[False-backward] 7.3461ms 6.9025ms 144.8748 Ops/s 135.4139 Ops/s $\textbf{\color{#35bf28}+6.99\%}$
test_ppo_speed[True-None] 1.6779ms 1.4452ms 691.9273 Ops/s 675.9459 Ops/s $\color{#35bf28}+2.36\%$
test_ppo_speed[True-backward] 3.3493ms 3.2302ms 309.5744 Ops/s 291.8312 Ops/s $\textbf{\color{#35bf28}+6.08\%}$
test_ppo_speed[reduce-overhead-None] 1.5184ms 1.4227ms 702.8690 Ops/s 677.1112 Ops/s $\color{#35bf28}+3.80\%$
test_ppo_speed[reduce-overhead-backward] 3.3208ms 3.2174ms 310.8052 Ops/s 289.6321 Ops/s $\textbf{\color{#35bf28}+7.31\%}$
test_reinforce_speed[False-None] 2.3759ms 2.2471ms 445.0259 Ops/s 431.9217 Ops/s $\color{#35bf28}+3.03\%$
test_reinforce_speed[False-backward] 3.6720ms 3.2997ms 303.0582 Ops/s 288.9161 Ops/s $\color{#35bf28}+4.89\%$
test_reinforce_speed[True-None] 1.3891ms 1.2994ms 769.5599 Ops/s 753.1475 Ops/s $\color{#35bf28}+2.18\%$
test_reinforce_speed[True-backward] 3.5945ms 3.0558ms 327.2495 Ops/s 306.3360 Ops/s $\textbf{\color{#35bf28}+6.83\%}$
test_reinforce_speed[reduce-overhead-None] 18.7481ms 10.2460ms 97.5987 Ops/s 98.9401 Ops/s $\color{#d91a1a}-1.36\%$
test_reinforce_speed[reduce-overhead-backward] 1.5272ms 1.4829ms 674.3544 Ops/s 601.1309 Ops/s $\textbf{\color{#35bf28}+12.18\%}$
test_iql_speed[False-None] 9.8227ms 9.2020ms 108.6724 Ops/s 106.4367 Ops/s $\color{#35bf28}+2.10\%$
test_iql_speed[False-backward] 13.6100ms 13.1521ms 76.0335 Ops/s 73.1591 Ops/s $\color{#35bf28}+3.93\%$
test_iql_speed[True-None] 2.3851ms 2.2837ms 437.8894 Ops/s 431.8734 Ops/s $\color{#35bf28}+1.39\%$
test_iql_speed[True-backward] 5.4388ms 4.9867ms 200.5333 Ops/s 188.5995 Ops/s $\textbf{\color{#35bf28}+6.33\%}$
test_iql_speed[reduce-overhead-None] 18.0089ms 10.7001ms 93.4574 Ops/s 92.8917 Ops/s $\color{#35bf28}+0.61\%$
test_iql_speed[reduce-overhead-backward] 1.9839ms 1.8869ms 529.9828 Ops/s 504.7353 Ops/s $\textbf{\color{#35bf28}+5.00\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.4643ms 6.0624ms 164.9518 Ops/s 164.6583 Ops/s $\color{#35bf28}+0.18\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.7620ms 0.3601ms 2.7769 KOps/s 3.7670 KOps/s $\textbf{\color{#d91a1a}-26.28\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5645ms 0.2467ms 4.0528 KOps/s 4.0165 KOps/s $\color{#35bf28}+0.91\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.1182ms 5.8458ms 171.0635 Ops/s 171.9526 Ops/s $\color{#d91a1a}-0.52\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8509ms 0.3183ms 3.1420 KOps/s 3.2749 KOps/s $\color{#d91a1a}-4.06\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5706ms 0.2925ms 3.4192 KOps/s 3.2370 KOps/s $\textbf{\color{#35bf28}+5.63\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.5064ms 1.2797ms 781.4425 Ops/s 739.7405 Ops/s $\textbf{\color{#35bf28}+5.64\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.4218ms 1.1772ms 849.4415 Ops/s 802.9278 Ops/s $\textbf{\color{#35bf28}+5.79\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.0371ms 5.9520ms 168.0114 Ops/s 165.7554 Ops/s $\color{#35bf28}+1.36\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.8302ms 0.4136ms 2.4180 KOps/s 2.2571 KOps/s $\textbf{\color{#35bf28}+7.13\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7844ms 0.4038ms 2.4762 KOps/s 2.1299 KOps/s $\textbf{\color{#35bf28}+16.26\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.8588ms 5.7514ms 173.8703 Ops/s 173.4900 Ops/s $\color{#35bf28}+0.22\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8585ms 0.2711ms 3.6886 KOps/s 3.7776 KOps/s $\color{#d91a1a}-2.36\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4433ms 0.2456ms 4.0723 KOps/s 4.0736 KOps/s $\color{#d91a1a}-0.03\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.0664ms 5.6938ms 175.6295 Ops/s 173.9885 Ops/s $\color{#35bf28}+0.94\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9272ms 0.2897ms 3.4518 KOps/s 3.2941 KOps/s $\color{#35bf28}+4.79\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4630ms 0.2812ms 3.5566 KOps/s 3.8110 KOps/s $\textbf{\color{#d91a1a}-6.68\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.0896ms 5.8774ms 170.1419 Ops/s 168.0285 Ops/s $\color{#35bf28}+1.26\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.3648ms 0.4370ms 2.2884 KOps/s 1.9681 KOps/s $\textbf{\color{#35bf28}+16.27\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6428ms 0.4348ms 2.3001 KOps/s 2.5411 KOps/s $\textbf{\color{#d91a1a}-9.49\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1439ms 5.5298ms 180.8374 Ops/s 51.2350 Ops/s $\textbf{\color{#35bf28}+252.96\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.2023ms 2.1249ms 470.6005 Ops/s 464.7495 Ops/s $\color{#35bf28}+1.26\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.8430ms 1.2717ms 786.3777 Ops/s 785.7819 Ops/s $\color{#35bf28}+0.08\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.5395ms 5.6471ms 177.0830 Ops/s 179.4150 Ops/s $\color{#d91a1a}-1.30\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.1765ms 2.1142ms 472.9874 Ops/s 467.0813 Ops/s $\color{#35bf28}+1.26\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 2.0083ms 0.9970ms 1.0030 KOps/s 836.4391 Ops/s $\textbf{\color{#35bf28}+19.91\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.5053s 15.8322ms 63.1623 Ops/s 170.3241 Ops/s $\textbf{\color{#d91a1a}-62.92\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 11.2164ms 2.3026ms 434.2918 Ops/s 441.3880 Ops/s $\color{#d91a1a}-1.61\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.2963ms 1.3138ms 761.1620 Ops/s 677.4053 Ops/s $\textbf{\color{#35bf28}+12.36\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 60.1936ms 58.2059ms 17.1804 Ops/s 17.2393 Ops/s $\color{#d91a1a}-0.34\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.6998ms 16.9871ms 58.8682 Ops/s 59.0651 Ops/s $\color{#d91a1a}-0.33\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 60.6147ms 57.9844ms 17.2460 Ops/s 17.2381 Ops/s $\color{#35bf28}+0.05\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.3629ms 16.8767ms 59.2532 Ops/s 58.0446 Ops/s $\color{#35bf28}+2.08\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 62.8576ms 59.7534ms 16.7354 Ops/s 16.7031 Ops/s $\color{#35bf28}+0.19\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.3530ms 18.0736ms 55.3292 Ops/s 54.3288 Ops/s $\color{#35bf28}+1.84\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. versioning Versioning change (version number etc)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants