|
num_q_tokens_per_head_k: int, |
Hi, your updated metadata api change the variable name from num_heads_per_head_k to num_q_token_per_head_k, which will lead to compatibility issue, could you change the var name back or define a new python api for get_mla_metadata?