We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
server
1 parent 0e9c4bf commit d07387cCopy full SHA for d07387c
examples/server/tests/features/cancel.feature
@@ -12,7 +12,7 @@ Feature: Cancellation of llama.cpp server requests
12
# KV Cache corresponds to the total amount of tokens
13
# that can be stored across all independent sequences: #4130
14
# see --ctx-size and #5568
15
- And 512 KV cache size
+ And 256 KV cache size
16
And 32 as batch size
17
And 2 slots
18
And 64 server max tokens to predict
0 commit comments