-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] param_count #1046
Merged
Merged
[Feature] param_count #1046
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: af2b55a96550ef9f42dcf14fa5dbf4b62873f85c Pull Request resolved: #1046
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Oct 17, 2024
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: af38095667859b9bea8d2e1bbd9d482b88db8c62 Pull Request resolved: #1046
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 48.8220μs | 25.6414μs | 38.9995 KOps/s | 41.6930 KOps/s | |
test_plain_set_stack_nested | 61.4360μs | 25.7997μs | 38.7601 KOps/s | 41.5250 KOps/s | |
test_plain_set_nested_inplace | 80.3810μs | 27.6109μs | 36.2176 KOps/s | 37.9270 KOps/s | |
test_plain_set_stack_nested_inplace | 83.6770μs | 27.7725μs | 36.0068 KOps/s | 38.0282 KOps/s | |
test_items | 22.8730μs | 4.1404μs | 241.5240 KOps/s | 237.4600 KOps/s | |
test_items_nested | 0.6233ms | 0.3822ms | 2.6161 KOps/s | 2.6349 KOps/s | |
test_items_nested_locked | 0.8096ms | 0.3816ms | 2.6206 KOps/s | 2.6227 KOps/s | |
test_items_nested_leaf | 0.1494ms | 79.9031μs | 12.5152 KOps/s | 12.4742 KOps/s | |
test_items_stack_nested | 0.5518ms | 0.3857ms | 2.5927 KOps/s | 2.6005 KOps/s | |
test_items_stack_nested_leaf | 0.1492ms | 83.3744μs | 11.9941 KOps/s | 12.0642 KOps/s | |
test_items_stack_nested_locked | 0.5793ms | 0.3849ms | 2.5978 KOps/s | 2.6180 KOps/s | |
test_keys | 24.9570μs | 3.5265μs | 283.5687 KOps/s | 281.4258 KOps/s | |
test_keys_nested | 0.2233ms | 0.1317ms | 7.5915 KOps/s | 7.5839 KOps/s | |
test_keys_nested_locked | 0.7608ms | 0.1383ms | 7.2292 KOps/s | 7.2587 KOps/s | |
test_keys_nested_leaf | 0.1960ms | 0.1158ms | 8.6335 KOps/s | 8.6713 KOps/s | |
test_keys_stack_nested | 0.2216ms | 0.1338ms | 7.4730 KOps/s | 7.6776 KOps/s | |
test_keys_stack_nested_leaf | 0.1956ms | 0.1154ms | 8.6656 KOps/s | 8.8020 KOps/s | |
test_keys_stack_nested_locked | 0.2310ms | 0.1377ms | 7.2640 KOps/s | 7.3809 KOps/s | |
test_values | 7.5262μs | 1.0441μs | 957.7630 KOps/s | 933.3039 KOps/s | |
test_values_nested | 0.1571ms | 93.8316μs | 10.6574 KOps/s | 10.4633 KOps/s | |
test_values_nested_locked | 0.1509ms | 94.0605μs | 10.6315 KOps/s | 10.5585 KOps/s | |
test_values_nested_leaf | 0.1419ms | 80.2186μs | 12.4659 KOps/s | 12.3712 KOps/s | |
test_values_stack_nested | 0.1525ms | 94.6456μs | 10.5657 KOps/s | 10.5919 KOps/s | |
test_values_stack_nested_leaf | 0.1385ms | 80.0150μs | 12.4977 KOps/s | 12.2763 KOps/s | |
test_values_stack_nested_locked | 0.1540ms | 94.6535μs | 10.5648 KOps/s | 10.6281 KOps/s | |
test_membership | 6.8940μs | 0.7242μs | 1.3809 MOps/s | 1.0627 MOps/s | |
test_membership_nested | 46.0460μs | 2.7956μs | 357.7065 KOps/s | 339.7938 KOps/s | |
test_membership_nested_leaf | 38.2420μs | 2.8191μs | 354.7250 KOps/s | 360.0475 KOps/s | |
test_membership_stacked_nested | 51.0360μs | 2.8478μs | 351.1448 KOps/s | 358.6675 KOps/s | |
test_membership_stacked_nested_leaf | 29.6050μs | 2.8261μs | 353.8409 KOps/s | 359.2637 KOps/s | |
test_membership_nested_last | 24.6060μs | 4.3187μs | 231.5524 KOps/s | 238.9637 KOps/s | |
test_membership_nested_leaf_last | 54.4920μs | 4.2584μs | 234.8292 KOps/s | 237.8064 KOps/s | |
test_membership_stacked_nested_last | 27.5120μs | 4.2576μs | 234.8748 KOps/s | 237.6538 KOps/s | |
test_membership_stacked_nested_leaf_last | 45.7360μs | 4.2606μs | 234.7104 KOps/s | 236.3491 KOps/s | |
test_nested_getleaf | 54.4220μs | 10.5931μs | 94.4012 KOps/s | 88.6652 KOps/s | |
test_nested_get | 35.8880μs | 10.0452μs | 99.5505 KOps/s | 97.3936 KOps/s | |
test_stacked_getleaf | 55.0330μs | 10.3045μs | 97.0447 KOps/s | 93.3717 KOps/s | |
test_stacked_get | 39.7480μs | 10.0124μs | 99.8759 KOps/s | 98.6953 KOps/s | |
test_nested_getitemleaf | 34.6950μs | 11.0080μs | 90.8427 KOps/s | 90.8711 KOps/s | |
test_nested_getitem | 57.1470μs | 10.3721μs | 96.4125 KOps/s | 97.7874 KOps/s | |
test_stacked_getitemleaf | 55.9750μs | 10.9900μs | 90.9920 KOps/s | 92.8658 KOps/s | |
test_stacked_getitem | 33.1920μs | 10.2844μs | 97.2347 KOps/s | 96.6230 KOps/s | |
test_lock_nested | 91.5322ms | 0.6148ms | 1.6264 KOps/s | 1.9492 KOps/s | |
test_lock_stack_nested | 0.7163ms | 0.4812ms | 2.0783 KOps/s | 2.1357 KOps/s | |
test_unlock_nested | 95.5112ms | 0.5300ms | 1.8870 KOps/s | 2.3767 KOps/s | |
test_unlock_stack_nested | 0.7427ms | 0.3943ms | 2.5364 KOps/s | 2.6040 KOps/s | |
test_flatten_speed | 0.1814ms | 0.1016ms | 9.8468 KOps/s | 10.0935 KOps/s | |
test_unflatten_speed | 0.7456ms | 0.5207ms | 1.9206 KOps/s | 1.9376 KOps/s | |
test_common_ops | 7.4143ms | 1.1891ms | 840.9914 Ops/s | 867.7400 Ops/s | |
test_creation | 51.5770μs | 2.0943μs | 477.4804 KOps/s | 470.4581 KOps/s | |
test_creation_empty | 62.7380μs | 20.6125μs | 48.5144 KOps/s | 56.1023 KOps/s | |
test_creation_nested_1 | 51.6370μs | 23.1717μs | 43.1561 KOps/s | 45.7678 KOps/s | |
test_creation_nested_2 | 1.2797ms | 27.3596μs | 36.5503 KOps/s | 39.3143 KOps/s | |
test_clone | 63.7200μs | 17.8387μs | 56.0578 KOps/s | 57.0088 KOps/s | |
test_getitem[int] | 0.9205ms | 16.8327μs | 59.4082 KOps/s | 61.8655 KOps/s | |
test_getitem[slice_int] | 0.1580ms | 30.4573μs | 32.8328 KOps/s | 33.9266 KOps/s | |
test_getitem[range] | 0.1772ms | 58.0292μs | 17.2327 KOps/s | 17.4659 KOps/s | |
test_getitem[tuple] | 0.1399ms | 25.2296μs | 39.6359 KOps/s | 40.1450 KOps/s | |
test_getitem[list] | 0.1818ms | 53.6871μs | 18.6264 KOps/s | 18.8931 KOps/s | |
test_setitem_dim[int] | 58.3000μs | 33.6064μs | 29.7562 KOps/s | 31.0129 KOps/s | |
test_setitem_dim[slice_int] | 0.1301ms | 61.6267μs | 16.2267 KOps/s | 16.7028 KOps/s | |
test_setitem_dim[range] | 0.1994ms | 86.6965μs | 11.5345 KOps/s | 11.9056 KOps/s | |
test_setitem_dim[tuple] | 90.5000μs | 49.3771μs | 20.2523 KOps/s | 20.6041 KOps/s | |
test_setitem | 0.4265ms | 32.1245μs | 31.1289 KOps/s | 33.2234 KOps/s | |
test_set | 0.1314ms | 31.8692μs | 31.3782 KOps/s | 33.5665 KOps/s | |
test_set_shared | 1.0793ms | 0.2173ms | 4.6023 KOps/s | 4.6592 KOps/s | |
test_update | 0.1537ms | 40.2807μs | 24.8258 KOps/s | 25.8646 KOps/s | |
test_update_nested | 0.3091ms | 51.2739μs | 19.5031 KOps/s | 20.1074 KOps/s | |
test_update__nested | 1.0437ms | 46.0027μs | 21.7379 KOps/s | 22.4158 KOps/s | |
test_set_nested | 0.1151ms | 34.3694μs | 29.0957 KOps/s | 30.3790 KOps/s | |
test_set_nested_new | 81.5430μs | 39.1046μs | 25.5725 KOps/s | 25.8954 KOps/s | |
test_select | 0.1138ms | 57.0400μs | 17.5315 KOps/s | 17.6533 KOps/s | |
test_select_nested | 0.1354ms | 59.1329μs | 16.9111 KOps/s | 16.7028 KOps/s | |
test_exclude_nested | 0.3522ms | 75.7235μs | 13.2059 KOps/s | 13.4284 KOps/s | |
test_empty[True] | 0.4835ms | 0.3529ms | 2.8333 KOps/s | 2.8966 KOps/s | |
test_empty[False] | 11.5340μs | 1.2349μs | 809.7969 KOps/s | 800.1657 KOps/s | |
test_unbind_speed | 0.5116ms | 0.3076ms | 3.2510 KOps/s | 3.3284 KOps/s | |
test_unbind_speed_stack0 | 0.5958ms | 0.3039ms | 3.2907 KOps/s | 3.4621 KOps/s | |
test_unbind_speed_stack1 | 91.7951ms | 0.8748ms | 1.1432 KOps/s | 1.3809 KOps/s | |
test_split | 90.4751ms | 2.1536ms | 464.3459 Ops/s | 470.2255 Ops/s | |
test_chunk | 3.1992ms | 1.9993ms | 500.1745 Ops/s | 463.3342 Ops/s | |
test_creation[device0] | 3.7000ms | 0.1196ms | 8.3580 KOps/s | 8.3425 KOps/s | |
test_creation_from_tensor | 0.2304ms | 0.1177ms | 8.4933 KOps/s | 8.6605 KOps/s | |
test_add_one[memmap_tensor0] | 78.5870μs | 7.5596μs | 132.2819 KOps/s | 139.1965 KOps/s | |
test_contiguous[memmap_tensor0] | 33.1730μs | 2.0013μs | 499.6646 KOps/s | 507.9896 KOps/s | |
test_stack[memmap_tensor0] | 89.9490μs | 5.6879μs | 175.8122 KOps/s | 180.0352 KOps/s | |
test_memmaptd_index | 1.1491ms | 0.4157ms | 2.4053 KOps/s | 2.4648 KOps/s | |
test_memmaptd_index_astensor | 0.8895ms | 0.5158ms | 1.9388 KOps/s | 1.9777 KOps/s | |
test_memmaptd_index_op | 94.8327ms | 1.2292ms | 813.5219 Ops/s | 946.9290 Ops/s | |
test_serialize_model | 0.1278s | 0.1162s | 8.6027 Ops/s | 8.4626 Ops/s | |
test_serialize_model_pickle | 0.4568s | 0.3930s | 2.5445 Ops/s | 2.5565 Ops/s | |
test_serialize_weights | 0.1252s | 0.1166s | 8.5769 Ops/s | 7.6621 Ops/s | |
test_serialize_weights_returnearly | 0.1796s | 0.1641s | 6.0931 Ops/s | 6.3332 Ops/s | |
test_serialize_weights_pickle | 0.4994s | 0.4148s | 2.4109 Ops/s | 2.5101 Ops/s | |
test_serialize_weights_filesystem | 0.1438s | 0.1399s | 7.1492 Ops/s | 7.1790 Ops/s | |
test_serialize_model_filesystem | 0.2400s | 0.1630s | 6.1340 Ops/s | 6.5694 Ops/s | |
test_reshape_pytree | 99.2760μs | 39.1445μs | 25.5464 KOps/s | 25.7030 KOps/s | |
test_reshape_td | 0.1295ms | 46.9616μs | 21.2940 KOps/s | 21.9121 KOps/s | |
test_view_pytree | 0.1004ms | 38.3741μs | 26.0592 KOps/s | 25.9832 KOps/s | |
test_view_td | 0.1059ms | 52.4748μs | 19.0568 KOps/s | 18.8961 KOps/s | |
test_unbind_pytree | 98.1640μs | 36.3421μs | 27.5163 KOps/s | 27.7474 KOps/s | |
test_unbind_td | 0.3275ms | 45.6265μs | 21.9171 KOps/s | 21.7871 KOps/s | |
test_split_pytree | 0.1021ms | 38.1412μs | 26.2184 KOps/s | 26.4677 KOps/s | |
test_split_td | 0.4879ms | 56.9537μs | 17.5581 KOps/s | 17.7762 KOps/s | |
test_add_pytree | 92.3440μs | 44.9218μs | 22.2609 KOps/s | 21.8743 KOps/s | |
test_add_td | 0.1743ms | 88.9120μs | 11.2471 KOps/s | 11.2788 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1637ms | 57.9388μs | 17.2596 KOps/s | 16.9675 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.1279ms | 0.1971ms | 5.0746 KOps/s | 4.8985 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1455ms | 56.6316μs | 17.6580 KOps/s | 17.5509 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3211ms | 0.1435ms | 6.9699 KOps/s | 7.0368 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 62.3870μs | 23.1911μs | 43.1199 KOps/s | 42.4397 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1530ms | 74.9466μs | 13.3428 KOps/s | 13.0445 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1760ms | 74.8558μs | 13.3590 KOps/s | 13.0444 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1336ms | 66.9213μs | 14.9429 KOps/s | 14.5038 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3822ms | 0.1850ms | 5.4051 KOps/s | 5.4461 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.5367ms | 0.2397ms | 4.1716 KOps/s | 4.1473 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1063ms | 46.5015μs | 21.5047 KOps/s | 19.8067 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4632ms | 77.2799μs | 12.9400 KOps/s | 12.5349 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3524ms | 0.1765ms | 5.6657 KOps/s | 5.7047 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4285ms | 0.2920ms | 3.4247 KOps/s | 3.5102 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.5852ms | 0.2770ms | 3.6100 KOps/s | 3.6370 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4356ms | 0.1862ms | 5.3696 KOps/s | 5.5248 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2507ms | 75.7289μs | 13.2050 KOps/s | 13.6535 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1148ms | 48.2796μs | 20.7127 KOps/s | 20.5782 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3284ms | 0.2357ms | 4.2430 KOps/s | 4.3194 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3271ms | 0.1780ms | 5.6179 KOps/s | 5.7312 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2463ms | 0.1143ms | 8.7504 KOps/s | 8.8014 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1518ms | 78.0637μs | 12.8101 KOps/s | 12.7078 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1658ms | 77.6223μs | 12.8829 KOps/s | 12.8330 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.5710ms | 73.6960μs | 13.5693 KOps/s | 14.3019 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3353ms | 0.1956ms | 5.1117 KOps/s | 5.1106 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.0397ms | 1.7546ms | 569.9343 Ops/s | 565.5969 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2835ms | 0.1909ms | 5.2376 KOps/s | 5.2506 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.3177ms | 1.1190ms | 893.6700 Ops/s | 906.7981 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7393ms | 0.4197ms | 2.3824 KOps/s | 2.4059 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.3862ms | 4.1757ms | 239.4819 Ops/s | 247.4070 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 91.2920μs | 34.4504μs | 29.0273 KOps/s | 29.2963 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.0432ms | 49.2592μs | 20.3008 KOps/s | 20.6709 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 75.7320μs | 29.9050μs | 33.4392 KOps/s | 33.3741 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 94.6880μs | 29.5423μs | 33.8498 KOps/s | 34.4854 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 76.0530μs | 29.4761μs | 33.9257 KOps/s | 33.5468 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 85.1200μs | 29.6533μs | 33.7231 KOps/s | 34.3782 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1967ms | 74.3289μs | 13.4537 KOps/s | 13.7469 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5702ms | 27.5923μs | 36.2420 KOps/s | 35.2391 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1556ms | 69.1617μs | 14.4589 KOps/s | 14.7253 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 96.0830μs | 23.8819μs | 41.8728 KOps/s | 43.6333 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1306ms | 68.6728μs | 14.5618 KOps/s | 14.6932 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 68.3280μs | 23.2339μs | 43.0406 KOps/s | 43.0904 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1716ms | 73.5617μs | 13.5940 KOps/s | 13.7719 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0101ms | 27.3723μs | 36.5333 KOps/s | 36.6035 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1341ms | 69.0374μs | 14.4849 KOps/s | 14.7601 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 77.7160μs | 23.2458μs | 43.0186 KOps/s | 43.6697 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1458ms | 68.4976μs | 14.5991 KOps/s | 14.8734 KOps/s | |
test_compile_indexing[int-pytree-eager] | 77.0050μs | 23.1633μs | 43.1717 KOps/s | 43.2627 KOps/s | |
test_mod_add[eager] | 76.8840μs | 25.9941μs | 38.4703 KOps/s | 39.9396 KOps/s | |
test_mod_add[compile] | 86.3020μs | 37.3143μs | 26.7994 KOps/s | 25.4535 KOps/s | |
test_mod_add[compile-overhead] | 81.9540μs | 37.6139μs | 26.5859 KOps/s | 26.1235 KOps/s | |
test_mod_wrap[eager] | 0.3485ms | 0.2124ms | 4.7081 KOps/s | 4.6707 KOps/s | |
test_mod_wrap[compile] | 0.3382ms | 0.2296ms | 4.3554 KOps/s | 4.2184 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3581ms | 0.2287ms | 4.3721 KOps/s | 4.2829 KOps/s | |
test_mod_wrap_and_backward[eager] | 11.9249ms | 10.7052ms | 93.4126 Ops/s | 83.6627 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.3332ms | 10.8645ms | 92.0432 Ops/s | 82.0007 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.5373ms | 12.2347ms | 81.7350 Ops/s | 83.3061 Ops/s | |
test_seq_add[eager] | 0.1720ms | 96.0594μs | 10.4102 KOps/s | 10.4859 KOps/s | |
test_seq_add[compile] | 0.1363ms | 63.7042μs | 15.6975 KOps/s | 14.7456 KOps/s | |
test_seq_add[compile-overhead] | 0.1816ms | 63.5048μs | 15.7468 KOps/s | 15.5296 KOps/s | |
test_seq_wrap[eager] | 0.5693ms | 0.3925ms | 2.5479 KOps/s | 2.5221 KOps/s | |
test_seq_wrap[compile] | 0.4329ms | 0.2665ms | 3.7527 KOps/s | 3.6850 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4435ms | 0.2694ms | 3.7118 KOps/s | 3.6912 KOps/s | |
test_func_call_runtime[False-eager] | 0.8408ms | 0.5301ms | 1.8866 KOps/s | 1.8881 KOps/s | |
test_func_call_runtime[False-compile] | 0.9320ms | 0.5058ms | 1.9771 KOps/s | 1.9945 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9113ms | 0.5052ms | 1.9795 KOps/s | 1.9746 KOps/s | |
test_func_call_runtime[True-eager] | 0.9422ms | 0.7494ms | 1.3345 KOps/s | 1.3179 KOps/s | |
test_func_call_runtime[True-compile] | 0.6223ms | 0.5162ms | 1.9374 KOps/s | 1.9441 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8274ms | 0.5192ms | 1.9259 KOps/s | 1.9385 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8166ms | 0.5325ms | 1.8779 KOps/s | 1.8322 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7421ms | 0.5047ms | 1.9814 KOps/s | 1.9795 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6341ms | 0.5027ms | 1.9892 KOps/s | 1.9847 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0945ms | 0.9038ms | 1.1065 KOps/s | 1.0868 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0177ms | 0.7496ms | 1.3341 KOps/s | 1.3103 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9750ms | 0.7522ms | 1.3294 KOps/s | 1.2970 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5854ms | 1.9232ms | 519.9594 Ops/s | 511.2667 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 3.7487ms | 1.9997ms | 500.0766 Ops/s | 497.3465 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 3.0798ms | 1.9680ms | 508.1326 Ops/s | 499.7846 Ops/s | |
test_distributed | 0.3129ms | 0.1274ms | 7.8508 KOps/s | 7.6343 KOps/s | |
test_tdmodule | 34.7250μs | 18.4364μs | 54.2405 KOps/s | 57.0201 KOps/s | |
test_tdmodule_dispatch | 77.3950μs | 38.4327μs | 26.0195 KOps/s | 28.0658 KOps/s | |
test_tdseq | 52.9400μs | 22.4271μs | 44.5888 KOps/s | 49.1079 KOps/s | |
test_tdseq_dispatch | 76.6640μs | 43.6624μs | 22.9030 KOps/s | 24.2607 KOps/s | |
test_instantiation_functorch | 1.8532ms | 1.5650ms | 638.9902 Ops/s | 626.1310 Ops/s | |
test_exec_functorch | 0.3344ms | 0.1840ms | 5.4342 KOps/s | 5.4197 KOps/s | |
test_exec_functional_call | 0.4123ms | 0.1798ms | 5.5630 KOps/s | 5.7086 KOps/s | |
test_exec_td_decorator | 0.5203ms | 0.2337ms | 4.2792 KOps/s | 4.2493 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8857ms | 0.6472ms | 1.5452 KOps/s | 1.5384 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9690ms | 0.6486ms | 1.5418 KOps/s | 1.5177 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8376ms | 0.5379ms | 1.8591 KOps/s | 1.8694 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8255ms | 0.5368ms | 1.8629 KOps/s | 1.8723 KOps/s | |
test_to_module_speed[True] | 1.7350ms | 1.4008ms | 713.8698 Ops/s | 701.0404 Ops/s | |
test_to_module_speed[False] | 1.9547ms | 1.3872ms | 720.8668 Ops/s | 719.6153 Ops/s | |
test_tc_init | 0.1014ms | 48.4176μs | 20.6537 KOps/s | 21.4388 KOps/s | |
test_tc_init_nested | 0.1454ms | 94.7364μs | 10.5556 KOps/s | 10.7645 KOps/s | |
test_tc_first_layer_tensor | 17.7230μs | 1.5105μs | 662.0532 KOps/s | 650.5185 KOps/s | |
test_tc_first_layer_nontensor | 34.3640μs | 4.7247μs | 211.6543 KOps/s | 214.5850 KOps/s | |
test_tc_second_layer_tensor | 36.5380μs | 2.7926μs | 358.0913 KOps/s | 353.5425 KOps/s | |
test_tc_second_layer_nontensor | 40.8770μs | 6.0733μs | 164.6560 KOps/s | 166.0406 KOps/s | |
test_unbind | 0.4773s | 13.6631ms | 73.1898 Ops/s | 75.1053 Ops/s | |
test_full_like | 20.8382ms | 12.5291ms | 79.8140 Ops/s | 126.5416 Ops/s | |
test_zeros_like | 13.2621ms | 7.6261ms | 131.1292 Ops/s | 330.3176 Ops/s | |
test_ones_like | 13.8942ms | 7.5503ms | 132.4453 Ops/s | 292.0269 Ops/s | |
test_clone | 13.4456ms | 9.1544ms | 109.2373 Ops/s | 188.7171 Ops/s | |
test_squeeze | 72.2950μs | 12.4450μs | 80.3536 KOps/s | 79.0768 KOps/s | |
test_unsqueeze | 0.3326ms | 93.4042μs | 10.7062 KOps/s | 10.8748 KOps/s | |
test_split | 0.3330ms | 0.1917ms | 5.2160 KOps/s | 5.1213 KOps/s | |
test_permute | 0.3719ms | 0.2193ms | 4.5591 KOps/s | 4.5931 KOps/s | |
test_stack | 28.6057ms | 25.0127ms | 39.9797 Ops/s | 38.9500 Ops/s | |
test_cat | 30.5426ms | 25.1647ms | 39.7382 Ops/s | 39.3135 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 51.6210μs | 17.3381μs | 57.6763 KOps/s | 62.4774 KOps/s | |
test_plain_set_stack_nested | 56.0510μs | 17.2691μs | 57.9070 KOps/s | 61.9188 KOps/s | |
test_plain_set_nested_inplace | 47.5700μs | 18.4344μs | 54.2464 KOps/s | 57.8379 KOps/s | |
test_plain_set_stack_nested_inplace | 44.5700μs | 18.3691μs | 54.4392 KOps/s | 57.8805 KOps/s | |
test_items | 43.4410μs | 2.8833μs | 346.8207 KOps/s | 346.6854 KOps/s | |
test_items_nested | 0.4123ms | 0.3421ms | 2.9228 KOps/s | 2.9592 KOps/s | |
test_items_nested_locked | 0.3860ms | 0.3414ms | 2.9294 KOps/s | 2.9640 KOps/s | |
test_items_nested_leaf | 94.9210μs | 62.5497μs | 15.9873 KOps/s | 16.0616 KOps/s | |
test_items_stack_nested | 0.5322ms | 0.3419ms | 2.9246 KOps/s | 2.9549 KOps/s | |
test_items_stack_nested_leaf | 91.5310μs | 63.0526μs | 15.8598 KOps/s | 15.9794 KOps/s | |
test_items_stack_nested_locked | 0.3922ms | 0.3392ms | 2.9482 KOps/s | 2.9376 KOps/s | |
test_keys | 29.7800μs | 3.4525μs | 289.6420 KOps/s | 290.8378 KOps/s | |
test_keys_nested | 0.1085ms | 71.0155μs | 14.0814 KOps/s | 13.7272 KOps/s | |
test_keys_nested_locked | 0.7614ms | 79.0278μs | 12.6538 KOps/s | 12.6914 KOps/s | |
test_keys_nested_leaf | 98.7410μs | 63.4825μs | 15.7524 KOps/s | 15.7987 KOps/s | |
test_keys_stack_nested | 0.1038ms | 72.8577μs | 13.7254 KOps/s | 13.7285 KOps/s | |
test_keys_stack_nested_leaf | 0.1100ms | 63.9020μs | 15.6490 KOps/s | 15.6770 KOps/s | |
test_keys_stack_nested_locked | 0.1140ms | 78.6829μs | 12.7092 KOps/s | 12.7435 KOps/s | |
test_values | 6.3083μs | 0.8421μs | 1.1876 MOps/s | 1.2005 MOps/s | |
test_values_nested | 83.9110μs | 48.8652μs | 20.4645 KOps/s | 20.4135 KOps/s | |
test_values_nested_locked | 76.3710μs | 50.8713μs | 19.6575 KOps/s | 19.9241 KOps/s | |
test_values_nested_leaf | 66.3410μs | 42.6882μs | 23.4257 KOps/s | 23.5901 KOps/s | |
test_values_stack_nested | 96.7110μs | 49.1366μs | 20.3514 KOps/s | 20.3296 KOps/s | |
test_values_stack_nested_leaf | 84.4610μs | 43.1487μs | 23.1757 KOps/s | 22.9926 KOps/s | |
test_values_stack_nested_locked | 89.7810μs | 50.8784μs | 19.6547 KOps/s | 19.7799 KOps/s | |
test_membership | 1.8241μs | 0.5038μs | 1.9849 MOps/s | 1.9952 MOps/s | |
test_membership_nested | 13.2905μs | 1.8867μs | 530.0228 KOps/s | 530.9112 KOps/s | |
test_membership_nested_leaf | 10.7503μs | 1.8470μs | 541.4129 KOps/s | 533.1457 KOps/s | |
test_membership_stacked_nested | 78.9410μs | 1.9059μs | 524.6896 KOps/s | 504.0141 KOps/s | |
test_membership_stacked_nested_leaf | 0.1076ms | 1.9134μs | 522.6230 KOps/s | 510.5743 KOps/s | |
test_membership_nested_last | 41.2700μs | 2.9629μs | 337.5110 KOps/s | 329.5645 KOps/s | |
test_membership_nested_leaf_last | 30.5700μs | 3.0007μs | 333.2522 KOps/s | 334.2300 KOps/s | |
test_membership_stacked_nested_last | 29.6110μs | 2.9466μs | 339.3748 KOps/s | 335.2004 KOps/s | |
test_membership_stacked_nested_leaf_last | 25.5700μs | 2.9602μs | 337.8194 KOps/s | 338.7315 KOps/s | |
test_nested_getleaf | 31.1700μs | 6.0853μs | 164.3303 KOps/s | 166.0193 KOps/s | |
test_nested_get | 33.5800μs | 5.7668μs | 173.4073 KOps/s | 175.0551 KOps/s | |
test_stacked_getleaf | 25.7000μs | 6.0777μs | 164.5362 KOps/s | 165.4091 KOps/s | |
test_stacked_get | 26.8500μs | 5.6531μs | 176.8932 KOps/s | 175.9218 KOps/s | |
test_nested_getitemleaf | 58.3300μs | 6.1392μs | 162.8887 KOps/s | 161.2747 KOps/s | |
test_nested_getitem | 29.5400μs | 5.7425μs | 174.1391 KOps/s | 173.4080 KOps/s | |
test_stacked_getitemleaf | 38.2500μs | 6.0730μs | 164.6627 KOps/s | 163.2837 KOps/s | |
test_stacked_getitem | 33.0010μs | 5.6618μs | 176.6214 KOps/s | 176.7077 KOps/s | |
test_lock_nested | 0.8380ms | 0.4218ms | 2.3709 KOps/s | 2.3342 KOps/s | |
test_lock_stack_nested | 0.4283ms | 0.3903ms | 2.5620 KOps/s | 2.5439 KOps/s | |
test_unlock_nested | 0.7804ms | 0.3616ms | 2.7654 KOps/s | 2.7273 KOps/s | |
test_unlock_stack_nested | 0.3611ms | 0.3290ms | 3.0392 KOps/s | 3.0169 KOps/s | |
test_flatten_speed | 0.1653ms | 75.6984μs | 13.2103 KOps/s | 13.0061 KOps/s | |
test_unflatten_speed | 0.3719ms | 0.3234ms | 3.0925 KOps/s | 3.0990 KOps/s | |
test_common_ops | 1.6708ms | 1.2719ms | 786.2114 Ops/s | 825.6015 Ops/s | |
test_creation | 29.6100μs | 1.4905μs | 670.9299 KOps/s | 662.1473 KOps/s | |
test_creation_empty | 54.5210μs | 16.7207μs | 59.8063 KOps/s | 70.7725 KOps/s | |
test_creation_nested_1 | 65.7310μs | 18.5789μs | 53.8246 KOps/s | 62.7254 KOps/s | |
test_creation_nested_2 | 52.9900μs | 21.0279μs | 47.5559 KOps/s | 54.2284 KOps/s | |
test_clone | 57.5800μs | 28.4411μs | 35.1604 KOps/s | 35.9389 KOps/s | |
test_getitem[int] | 1.1201ms | 15.4981μs | 64.5240 KOps/s | 64.4009 KOps/s | |
test_getitem[slice_int] | 0.1154ms | 27.3088μs | 36.6182 KOps/s | 36.5462 KOps/s | |
test_getitem[range] | 0.2472ms | 0.1073ms | 9.3201 KOps/s | 9.2622 KOps/s | |
test_getitem[tuple] | 0.1178ms | 23.1516μs | 43.1935 KOps/s | 42.5267 KOps/s | |
test_getitem[list] | 0.2544ms | 95.3623μs | 10.4863 KOps/s | 10.4198 KOps/s | |
test_setitem_dim[int] | 65.9810μs | 44.0541μs | 22.6994 KOps/s | 22.7487 KOps/s | |
test_setitem_dim[slice_int] | 99.2810μs | 66.1185μs | 15.1244 KOps/s | 15.2427 KOps/s | |
test_setitem_dim[range] | 0.1538ms | 0.1245ms | 8.0307 KOps/s | 8.0536 KOps/s | |
test_setitem_dim[tuple] | 0.1018ms | 59.7058μs | 16.7488 KOps/s | 16.7142 KOps/s | |
test_setitem | 78.1710μs | 41.4152μs | 24.1457 KOps/s | 25.3695 KOps/s | |
test_set | 0.1850ms | 40.4449μs | 24.7250 KOps/s | 26.1183 KOps/s | |
test_set_shared | 0.3634ms | 52.8025μs | 18.9385 KOps/s | 18.8807 KOps/s | |
test_update | 0.1820ms | 51.0245μs | 19.5984 KOps/s | 21.2908 KOps/s | |
test_update_nested | 0.2340ms | 62.1524μs | 16.0895 KOps/s | 18.6210 KOps/s | |
test_update__nested | 0.1711ms | 61.0011μs | 16.3932 KOps/s | 16.5259 KOps/s | |
test_set_nested | 0.1054ms | 43.9009μs | 22.7786 KOps/s | 24.0599 KOps/s | |
test_set_nested_new | 0.1951ms | 47.2911μs | 21.1456 KOps/s | 22.3926 KOps/s | |
test_select | 0.2186ms | 61.8435μs | 16.1698 KOps/s | 17.1457 KOps/s | |
test_select_nested | 78.1510μs | 41.5301μs | 24.0789 KOps/s | 23.6274 KOps/s | |
test_exclude_nested | 94.6610μs | 58.8418μs | 16.9947 KOps/s | 16.7402 KOps/s | |
test_empty[True] | 0.3139ms | 0.2630ms | 3.8019 KOps/s | 3.8144 KOps/s | |
test_empty[False] | 3.8570μs | 0.7423μs | 1.3472 MOps/s | 1.3577 MOps/s | |
test_to | 67.3600μs | 26.5465μs | 37.6697 KOps/s | 36.8031 KOps/s | |
test_to_nonblocking | 55.1610μs | 25.4854μs | 39.2382 KOps/s | 39.2729 KOps/s | |
test_unbind_speed | 1.0118ms | 0.2729ms | 3.6646 KOps/s | 3.6121 KOps/s | |
test_unbind_speed_stack0 | 0.3459ms | 0.2730ms | 3.6624 KOps/s | 3.5965 KOps/s | |
test_unbind_speed_stack1 | 93.5209ms | 0.7005ms | 1.4275 KOps/s | 1.4117 KOps/s | |
test_split | 94.5152ms | 2.2048ms | 453.5608 Ops/s | 449.7565 Ops/s | |
test_chunk | 95.4846ms | 2.2050ms | 453.5137 Ops/s | 449.5424 Ops/s | |
test_to[False] | 3.5118ms | 3.3353ms | 299.8188 Ops/s | 306.5166 Ops/s | |
test_to[True] | 4.7495ms | 4.4517ms | 224.6315 Ops/s | 225.7125 Ops/s | |
test_to_njt[False] | 0.2276s | 0.2273s | 4.4004 Ops/s | 4.0172 Ops/s | |
test_to_njt[True] | 0.3609s | 0.2788s | 3.5874 Ops/s | 3.8167 Ops/s | |
test_creation[device0] | 0.2595ms | 0.1275ms | 7.8432 KOps/s | 7.8707 KOps/s | |
test_creation_from_tensor | 0.3461ms | 0.1284ms | 7.7889 KOps/s | 7.7576 KOps/s | |
test_add_one[memmap_tensor0] | 0.2442ms | 8.3242μs | 120.1320 KOps/s | 119.6937 KOps/s | |
test_contiguous[memmap_tensor0] | 31.0500μs | 2.1938μs | 455.8244 KOps/s | 455.8531 KOps/s | |
test_stack[memmap_tensor0] | 48.4810μs | 6.5702μs | 152.2030 KOps/s | 150.5917 KOps/s | |
test_memmaptd_index | 1.1093ms | 0.4310ms | 2.3202 KOps/s | 2.3140 KOps/s | |
test_memmaptd_index_astensor | 1.0354ms | 0.5055ms | 1.9783 KOps/s | 1.9671 KOps/s | |
test_memmaptd_index_op | 1.4314ms | 1.0493ms | 953.0236 Ops/s | 996.8816 Ops/s | |
test_serialize_model | 0.1310s | 0.1299s | 7.6980 Ops/s | 7.6830 Ops/s | |
test_serialize_model_pickle | 1.3480s | 1.1899s | 0.8404 Ops/s | 0.8401 Ops/s | |
test_serialize_weights | 0.1304s | 0.1300s | 7.6935 Ops/s | 7.7163 Ops/s | |
test_serialize_weights_returnearly | 0.2405s | 62.7617ms | 15.9333 Ops/s | 15.7638 Ops/s | |
test_serialize_weights_pickle | 1.3459s | 1.1861s | 0.8431 Ops/s | 0.8354 Ops/s | |
test_reshape_pytree | 71.6110μs | 35.7649μs | 27.9604 KOps/s | 27.8550 KOps/s | |
test_reshape_td | 0.1608ms | 42.4348μs | 23.5656 KOps/s | 23.1268 KOps/s | |
test_view_pytree | 0.1204ms | 35.1652μs | 28.4372 KOps/s | 27.9589 KOps/s | |
test_view_td | 0.1388ms | 46.4181μs | 21.5433 KOps/s | 21.1393 KOps/s | |
test_unbind_pytree | 0.1477ms | 34.1820μs | 29.2552 KOps/s | 29.3741 KOps/s | |
test_unbind_td | 0.6942ms | 42.2271μs | 23.6814 KOps/s | 23.5533 KOps/s | |
test_split_pytree | 0.1477ms | 46.0820μs | 21.7005 KOps/s | 21.6518 KOps/s | |
test_split_td | 0.1865ms | 56.6355μs | 17.6568 KOps/s | 17.7078 KOps/s | |
test_add_pytree | 0.2055ms | 56.4085μs | 17.7278 KOps/s | 18.1081 KOps/s | |
test_add_td | 0.1364ms | 96.3847μs | 10.3751 KOps/s | 11.1018 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2852ms | 0.1611ms | 6.2078 KOps/s | 6.1440 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.5462ms | 0.1628ms | 6.1411 KOps/s | 6.1857 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.5426ms | 0.1552ms | 6.4414 KOps/s | 6.3060 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.5723ms | 0.1843ms | 5.4254 KOps/s | 5.5500 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1561ms | 21.7115μs | 46.0585 KOps/s | 45.6237 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 95.8610μs | 49.7457μs | 20.1022 KOps/s | 20.2549 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3183ms | 64.9256μs | 15.4022 KOps/s | 15.3665 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1025ms | 50.1998μs | 19.9204 KOps/s | 20.1277 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4829ms | 0.3181ms | 3.1440 KOps/s | 3.1099 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.6262ms | 0.2330ms | 4.2927 KOps/s | 4.3567 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2013ms | 0.1289ms | 7.7563 KOps/s | 7.7379 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1321ms | 66.3559μs | 15.0702 KOps/s | 15.1394 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4627ms | 0.3273ms | 3.0554 KOps/s | 3.0346 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.8251ms | 0.6529ms | 1.5316 KOps/s | 1.6428 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4221ms | 0.2839ms | 3.5228 KOps/s | 3.5412 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4716ms | 0.3211ms | 3.1144 KOps/s | 3.0977 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2142ms | 78.4721μs | 12.7434 KOps/s | 12.5308 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2225ms | 0.1355ms | 7.3794 KOps/s | 7.6983 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.9070ms | 0.5330ms | 1.8763 KOps/s | 1.9401 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4781ms | 0.3297ms | 3.0334 KOps/s | 3.0476 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 79.6510μs | 20.2525μs | 49.3767 KOps/s | 51.3483 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.4258ms | 38.1073μs | 26.2417 KOps/s | 25.5158 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.4529ms | 69.0016μs | 14.4924 KOps/s | 14.3913 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.4514ms | 51.0950μs | 19.5714 KOps/s | 19.4915 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.4558ms | 0.8551ms | 1.1694 KOps/s | 1.1073 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.6347ms | 3.2253ms | 310.0530 Ops/s | 317.6531 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3924ms | 0.8397ms | 1.1909 KOps/s | 1.1039 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.3381ms | 3.1384ms | 318.6301 Ops/s | 321.0946 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.2722ms | 0.1202ms | 8.3189 KOps/s | 8.3996 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.2078ms | 59.5061μs | 16.8050 KOps/s | 16.7764 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2664ms | 0.1142ms | 8.7550 KOps/s | 8.7883 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1926ms | 42.1021μs | 23.7518 KOps/s | 24.0106 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1605ms | 0.1145ms | 8.7363 KOps/s | 8.7147 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1924ms | 42.0243μs | 23.7957 KOps/s | 23.7374 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1823ms | 0.1481ms | 6.7538 KOps/s | 6.8011 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1569ms | 24.8712μs | 40.2071 KOps/s | 39.4446 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1959ms | 0.1400ms | 7.1406 KOps/s | 7.0687 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 84.2210μs | 20.9722μs | 47.6822 KOps/s | 48.6751 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2164ms | 0.1418ms | 7.0512 KOps/s | 7.0521 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 48.2810μs | 20.7709μs | 48.1443 KOps/s | 48.1344 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2960ms | 0.1485ms | 6.7348 KOps/s | 6.7843 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4948ms | 25.1434μs | 39.7719 KOps/s | 39.3973 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2902ms | 0.1420ms | 7.0436 KOps/s | 7.0694 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1004ms | 20.7832μs | 48.1159 KOps/s | 48.9375 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2838ms | 0.1416ms | 7.0605 KOps/s | 7.0593 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3411ms | 20.7419μs | 48.2115 KOps/s | 47.9806 KOps/s | |
test_mod_add[eager] | 0.1821ms | 32.9387μs | 30.3594 KOps/s | 32.5045 KOps/s | |
test_mod_add[compile] | 0.2295ms | 82.2468μs | 12.1585 KOps/s | 12.1806 KOps/s | |
test_mod_add[compile-overhead] | 0.3192ms | 0.1548ms | 6.4603 KOps/s | 5.8738 KOps/s | |
test_mod_wrap[eager] | 0.3757ms | 0.2382ms | 4.1989 KOps/s | 4.0621 KOps/s | |
test_mod_wrap[compile] | 1.4869ms | 0.2965ms | 3.3721 KOps/s | 3.3406 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5135ms | 3.9631ms | 252.3258 Ops/s | 370.1358 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4780ms | 1.3080ms | 764.5257 Ops/s | 715.2463 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5718ms | 1.3221ms | 756.3885 Ops/s | 704.5232 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3544ms | 0.8918ms | 1.1214 KOps/s | 993.4597 Ops/s | |
test_seq_add[eager] | 0.2691ms | 0.1004ms | 9.9621 KOps/s | 10.3249 KOps/s | |
test_seq_add[compile] | 0.1420ms | 94.8182μs | 10.5465 KOps/s | 10.9299 KOps/s | |
test_seq_add[compile-overhead] | 0.2847ms | 0.1292ms | 7.7404 KOps/s | 7.9596 KOps/s | |
test_seq_wrap[eager] | 0.6126ms | 0.3908ms | 2.5588 KOps/s | 2.6816 KOps/s | |
test_seq_wrap[compile] | 0.4574ms | 0.3161ms | 3.1635 KOps/s | 3.1788 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3916ms | 0.2217ms | 4.5113 KOps/s | 4.4852 KOps/s | |
test_func_call_runtime[False-eager] | 0.8852ms | 0.7185ms | 1.3917 KOps/s | 1.3161 KOps/s | |
test_func_call_runtime[False-compile] | 1.0355ms | 0.7964ms | 1.2556 KOps/s | 1.2699 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4962ms | 0.3610ms | 2.7702 KOps/s | 2.7602 KOps/s | |
test_func_call_runtime[True-eager] | 0.9669ms | 0.8847ms | 1.1303 KOps/s | 1.1233 KOps/s | |
test_func_call_runtime[True-compile] | 0.9907ms | 0.8186ms | 1.2215 KOps/s | 1.2392 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4970ms | 0.3824ms | 2.6150 KOps/s | 2.6245 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9397ms | 0.7384ms | 1.3543 KOps/s | 1.3892 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9408ms | 0.7990ms | 1.2516 KOps/s | 1.2673 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4869ms | 0.3662ms | 2.7307 KOps/s | 2.7598 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1199ms | 0.9937ms | 1.0063 KOps/s | 1.0022 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9902ms | 0.8493ms | 1.1775 KOps/s | 1.1937 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5509ms | 0.4062ms | 2.4621 KOps/s | 2.4535 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4533ms | 2.0065ms | 498.3904 Ops/s | 493.9058 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9949ms | 0.8613ms | 1.1610 KOps/s | 1.1739 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4812ms | 0.4123ms | 2.4257 KOps/s | 2.4339 KOps/s | |
test_distributed | 4.9436ms | 0.1866ms | 5.3597 KOps/s | 8.5599 KOps/s | |
test_tdmodule | 0.1573ms | 16.0654μs | 62.2455 KOps/s | 69.9904 KOps/s | |
test_tdmodule_dispatch | 52.9710μs | 31.5067μs | 31.7393 KOps/s | 36.8741 KOps/s | |
test_tdseq | 37.5500μs | 16.8843μs | 59.2266 KOps/s | 66.3591 KOps/s | |
test_tdseq_dispatch | 55.8610μs | 33.2584μs | 30.0676 KOps/s | 33.4575 KOps/s | |
test_instantiation_functorch | 1.9986ms | 1.8542ms | 539.3237 Ops/s | 537.1741 Ops/s | |
test_exec_functorch | 0.3130ms | 0.2053ms | 4.8710 KOps/s | 4.8701 KOps/s | |
test_exec_functional_call | 0.3791ms | 0.2047ms | 4.8861 KOps/s | 4.9179 KOps/s | |
test_exec_td_decorator | 0.4473ms | 0.2556ms | 3.9116 KOps/s | 3.8627 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8423ms | 0.6744ms | 1.4828 KOps/s | 1.5227 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8449ms | 0.6794ms | 1.4719 KOps/s | 1.5234 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7943ms | 0.5935ms | 1.6849 KOps/s | 1.7322 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7680ms | 0.5819ms | 1.7186 KOps/s | 1.7281 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.9508ms | 18.8316ms | 53.1024 Ops/s | 53.3025 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.5349ms | 18.8432ms | 53.0695 Ops/s | 53.1980 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.8723ms | 18.6901ms | 53.5042 Ops/s | 53.7761 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.9179ms | 18.8013ms | 53.1879 Ops/s | 53.5404 Ops/s | |
test_to_module_speed[True] | 1.4539ms | 0.9936ms | 1.0064 KOps/s | 988.9079 Ops/s | |
test_to_module_speed[False] | 1.4100ms | 0.9688ms | 1.0322 KOps/s | 1.0159 KOps/s | |
test_tc_init | 60.6010μs | 37.0220μs | 27.0109 KOps/s | 30.7574 KOps/s | |
test_tc_init_nested | 0.2024ms | 74.5248μs | 13.4184 KOps/s | 15.2979 KOps/s | |
test_tc_first_layer_tensor | 4.0486μs | 0.6739μs | 1.4839 MOps/s | 1.4640 MOps/s | |
test_tc_first_layer_nontensor | 23.9410μs | 2.2412μs | 446.1969 KOps/s | 448.5194 KOps/s | |
test_tc_second_layer_tensor | 7.4927μs | 1.3660μs | 732.0730 KOps/s | 727.9802 KOps/s | |
test_tc_second_layer_nontensor | 23.9000μs | 2.9419μs | 339.9153 KOps/s | 337.0489 KOps/s | |
test_unbind | 0.1955s | 9.4583ms | 105.7276 Ops/s | 91.8243 Ops/s | |
test_full_like | 0.7878ms | 0.5734ms | 1.7440 KOps/s | 1.7444 KOps/s | |
test_zeros_like | 0.2861ms | 0.1979ms | 5.0536 KOps/s | 5.0483 KOps/s | |
test_ones_like | 0.3719ms | 0.1979ms | 5.0535 KOps/s | 5.0513 KOps/s | |
test_clone | 0.5391ms | 0.4148ms | 2.4108 KOps/s | 2.4099 KOps/s | |
test_squeeze | 43.9810μs | 9.8596μs | 101.4236 KOps/s | 103.9085 KOps/s | |
test_unsqueeze | 0.2865ms | 75.4560μs | 13.2528 KOps/s | 13.2557 KOps/s | |
test_split | 0.1808s | 0.2071ms | 4.8287 KOps/s | 6.2038 KOps/s | |
test_permute | 0.2758ms | 0.1827ms | 5.4738 KOps/s | 5.6008 KOps/s | |
test_stack | 1.2734ms | 0.8325ms | 1.2012 KOps/s | 1.1738 KOps/s | |
test_cat | 1.3376ms | 1.2313ms | 812.1484 Ops/s | 812.1478 Ops/s |
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: a54705edffddaa7cdafd037d327a133834d7e669 Pull Request resolved: #1046
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: 69b76ae5dfd1ee743b0592fc2608a2bbafc945d6 Pull Request resolved: #1046
vmoens
added a commit
that referenced
this pull request
Oct 17, 2024
ghstack-source-id: 69b76ae5dfd1ee743b0592fc2608a2bbafc945d6 Pull Request resolved: #1046
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):