-
Notifications
You must be signed in to change notification settings - Fork 476
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DO NOT REVIEW YET] Extend paged attention #8237
base: master
Are you sure you want to change the base?
Commits on Oct 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b1a26e7 - Browse repository at this point
Copy the full SHA b1a26e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for f712b34 - Browse repository at this point
Copy the full SHA f712b34View commit details -
Configuration menu - View commit details
-
Copy full SHA for cf6dcf5 - Browse repository at this point
Copy the full SHA cf6dcf5View commit details
Commits on Oct 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 43453da - Browse repository at this point
Copy the full SHA 43453daView commit details -
create a new extended_paged_attention api with a flag controlling if …
…we call the kernel or the non-kernel
Configuration menu - View commit details
-
Copy full SHA for bf71c8c - Browse repository at this point
Copy the full SHA bf71c8cView commit details
Commits on Oct 4, 2024
-
Create a test that call both non-kernel extended_paged_attention and …
…kernel versio and compare the result.
Configuration menu - View commit details
-
Copy full SHA for da7150b - Browse repository at this point
Copy the full SHA da7150bView commit details -
Configuration menu - View commit details
-
Copy full SHA for f485878 - Browse repository at this point
Copy the full SHA f485878View commit details -
add the original paged_attention to the torch_xla and made sure torch…
…_xla can call into the local pallas kernel.
Configuration menu - View commit details
-
Copy full SHA for 7df596e - Browse repository at this point
Copy the full SHA 7df596eView commit details
Commits on Oct 7, 2024
-
modified the hardcode number in the test test_extended_paged_attentio…
…n and the original paged_attention finishes successfully.
Configuration menu - View commit details
-
Copy full SHA for 3eb8e33 - Browse repository at this point
Copy the full SHA 3eb8e33View commit details -
Configuration menu - View commit details
-
Copy full SHA for 830388d - Browse repository at this point
Copy the full SHA 830388dView commit details
Commits on Oct 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 83528de - Browse repository at this point
Copy the full SHA 83528deView commit details -
Configuration menu - View commit details
-
Copy full SHA for fbca5cf - Browse repository at this point
Copy the full SHA fbca5cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for ba30b9b - Browse repository at this point
Copy the full SHA ba30b9bView commit details -
added reference extended paged attention impl and the test for the or…
…iginal paged attention.
Configuration menu - View commit details
-
Copy full SHA for 43c2bf0 - Browse repository at this point
Copy the full SHA 43c2bf0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 58fe257 - Browse repository at this point
Copy the full SHA 58fe257View commit details
Commits on Oct 10, 2024
-
finished implementing the v0. Also add a test that use 1 query token …
…and verify the extend_paged_attention generate the same result as the original paged_attention.
Configuration menu - View commit details
-
Copy full SHA for 669d598 - Browse repository at this point
Copy the full SHA 669d598View commit details -
Something wrong with the test. Now the test test_extended_paged_atten…
…tion_single_query succeeded.
Configuration menu - View commit details
-
Copy full SHA for 290ab57 - Browse repository at this point
Copy the full SHA 290ab57View commit details
Commits on Oct 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 118fba5 - Browse repository at this point
Copy the full SHA 118fba5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 54f0af1 - Browse repository at this point
Copy the full SHA 54f0af1View commit details
Commits on Oct 14, 2024
-
revised v0 implementation. Add partly finished v1 impl. Also added mo…
…re test and experiement which may be cleaned up later.
Configuration menu - View commit details
-
Copy full SHA for 3d9e359 - Browse repository at this point
Copy the full SHA 3d9e359View commit details
Commits on Oct 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d282b2e - Browse repository at this point
Copy the full SHA d282b2eView commit details
Commits on Oct 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 069ca31 - Browse repository at this point
Copy the full SHA 069ca31View commit details
Commits on Oct 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 8ce1bb3 - Browse repository at this point
Copy the full SHA 8ce1bb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e839cb - Browse repository at this point
Copy the full SHA 2e839cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6645e7b - Browse repository at this point
Copy the full SHA 6645e7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fc0b345 - Browse repository at this point
Copy the full SHA fc0b345View commit details
Commits on Oct 20, 2024
-
Configuration menu - View commit details
-
Copy full SHA for afb97ae - Browse repository at this point
Copy the full SHA afb97aeView commit details
Commits on Oct 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5a6ff8f - Browse repository at this point
Copy the full SHA 5a6ff8fView commit details -
Configuration menu - View commit details
-
Copy full SHA for d6b994a - Browse repository at this point
Copy the full SHA d6b994aView commit details -
fixed the blocker issue that pltpu.repeat(acc_scale, acc_scale_repeat…
…) due to the 2nd to last dimension is 4 instead of a multiple of 8.
Configuration menu - View commit details
-
Copy full SHA for 0cca110 - Browse repository at this point
Copy the full SHA 0cca110View commit details -
Configuration menu - View commit details
-
Copy full SHA for c33da03 - Browse repository at this point
Copy the full SHA c33da03View commit details -
Configuration menu - View commit details
-
Copy full SHA for e8ccd04 - Browse repository at this point
Copy the full SHA e8ccd04View commit details -
Configuration menu - View commit details
-
Copy full SHA for 92672e4 - Browse repository at this point
Copy the full SHA 92672e4View commit details
Commits on Oct 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 9834f06 - Browse repository at this point
Copy the full SHA 9834f06View commit details
Commits on Oct 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 35a3c55 - Browse repository at this point
Copy the full SHA 35a3c55View commit details
Commits on Oct 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for bb79ead - Browse repository at this point
Copy the full SHA bb79eadView commit details -
when we write to o_ref, don't check @pl.when(kv_blk_idx == num_kv_blk…
…s - 1). Sometimes, lengths[b] is very small kv_blk_idx may never reach (num_kv_blks-1) due to the check @pl.when(kv_blk_idx * compute_blk_size_kv < kv_len)
Configuration menu - View commit details
-
Copy full SHA for 8305949 - Browse repository at this point
Copy the full SHA 8305949View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d28a68 - Browse repository at this point
Copy the full SHA 5d28a68View commit details
Commits on Oct 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0c49d5a - Browse repository at this point
Copy the full SHA 0c49d5aView commit details