Skip to content

Can I use CUSTOM buffer type to optimize the KVCache? #13670

Closed Answered by slaren
Zijie-Tian asked this question in Q&A
Discussion options

You must be logged in to vote

It is feasible, but it may require significant changes that may be hard to make without previous knowledge of the ggml code. Mainly, you would need to implement the missing operations, and ensure that they are properly routed to the extra buffer type compute functions.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@slaren
Comment options

Answer selected by Zijie-Tian
@Zijie-Tian
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants