Alternate MultiHeadedSelfAttention implementation #51
rayed-therap
started this conversation in
Show and tell
Replies: 1 comment 9 replies
-
Thanks for this! I do like your implementation. My original implementation also used some similar more compact approaches ... but after trying to explain it in the text, and to make it a bit more accessible for beginners, I evolved that more into the verbose version I currently have. However, perhaps this could be an optional alternative implementation. Perhaps this could be inside a |
Beta Was this translation helpful? Give feedback.
9 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi @rasbt,
Please checkout this MHSA implementation. If you like it, I can add it to the repo. If you love it and decide to include it in the book then I am willing to make as many changes as necessary to get contributor credits 😎
Beta Was this translation helpful? Give feedback.
All reactions