-
I'm reading the code of
However, it won't call Every time It this a corner case bug? Did I lose any details? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
Thanks for the analysis. Ideally, if you can open an issue with the step to reproduce using the server test framework, this would be awesome. |
Beta Was this translation helpful? Give feedback.
-
I don't think this can ever happen since we always process the entire batch (in chunks/views of ... Each token with |
Beta Was this translation helpful? Give feedback.
I don't think this can ever happen since we always process the entire batch (in chunks/views of
n_batch
):https://github.com/ggerganov/llama.cpp/blob/4399f13fb9462cd06f3f154d0aee738425000fea/examples/server/server.cpp#L2033-L2037
...
https://github.com/ggerganov/llama.cpp/blob/4399f13fb9462cd06f3f154d0aee738425000fea/examples/server/server.cpp#L2066-L2079
Each token with
logits == true
should fall in one of the batch views and will be processed.