Alternate MultiHeadedSelfAttention implementation #51
rayed-therap
started this conversation in
Show and tell
Replies: 1 comment 9 replies
-
Thanks for this! I do like your implementation. My original implementation also used some similar more compact approaches ... but after trying to explain it in the text, and to make it a bit more accessible for beginners, I evolved that more into the verbose version I currently have. However, perhaps this could be an optional alternative implementation. Perhaps this could be inside a |
Beta Was this translation helpful? Give feedback.
9 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @rasbt,
Please checkout this MHSA implementation. If you like it, I can add it to the repo. If you love it and decide to include it in the book then I am willing to make as many changes as necessary to get contributor credits 😎
Beta Was this translation helpful? Give feedback.
All reactions