Skip to content

What is the benefit to store hidden states and attention weights? #101

Discussion options

You must be logged in to vote

Thank you for your question.

In our classification of memory types, we divide them into three categories: plaintext memory, activation memory, and parameter memory. Among these, activation memory can include forms such as KV-caches, hidden states, and attention weights.

In the current version of MemOS, KV-cache is the primary form of activation memory used. This is due to its stable performance, interpretability, and the maturity of research enabling its effective and fast integration. Other forms of activation memory, such as hidden states and attention weights, are still under active research and are not yet mature enough for stable engineering use.

That said, recent studies have demons…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by Ki-Seki
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #100 on July 16, 2025 07:57.