Proper doc string deepseek ref in latent_attention.py (#139)

curlup · web-flow · commit 24cfc7677360 · 2025-05-23T15:10:17.000-07:00
diff --git a/attn_gym/mods/latent_attention.py b/attn_gym/mods/latent_attention.py
@@ -1,4 +1,4 @@
-"""Implementation of Multi-head Level Attention (MLA) RoPE score modification from DeepSeek-V2.
+"""Implementation of Multi-head Latent Attention (MLA) RoPE score modification from DeepSeek-V2.
 
 Reference: https://arxiv.org/pdf/2405.04434 - DeepSeek-V2: A Strong, Economical, and
 Efficient Mixture-of-Experts Language Model

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-"""Implementation of Multi-head Level Attention (MLA) RoPE score modification from DeepSeek-V2.`
	`1`	`+"""Implementation of Multi-head Latent Attention (MLA) RoPE score modification from DeepSeek-V2.`
`2`	`2`
`3`	`3`	`Reference: https://arxiv.org/pdf/2405.04434 - DeepSeek-V2: A Strong, Economical, and`
`4`	`4`	`Efficient Mixture-of-Experts Language Model`