Skip to content

Caching #4341

Caching #4341
Dec 5, 2023 · 2 comments · 1 reply
Discussion options

You must be logged in to vote

@ggerganov has fixed it in branch gg/server-oai-cache-prompt. Works very well now. See #4329. Makes it feasible to work on large-ish docs and chats interactively with 7B models running on my Mac mini.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@Michael-F-Ellis
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by Michael-F-Ellis
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants