Why session_tokens and KV_cache are not matched when max context is reached? #9792
Unanswered
Zapotecatl
asked this question in
Q&A
Replies: 1 comment
-
Seems like it has been overlooked and a fix is necessary. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I understand that by setting: "--prompt-cache = C:\MyConversation.bin...", "--prompt-cache-all" and "--keep = -1", I could recover the context and previous conversations.
However, I can't understand when the maximum context is reached and it is necessary to make room for new conversations, the session tokens stop saving instead of matching with the kv cache.
I understand this:
Why are session tokens not updated as expected (session_tokens.erase(first, last))?
Thanks in advance!
Beta Was this translation helpful? Give feedback.
All reactions