How is tokenizer's add_prefix_space handled in llama.cpp? #6295
-
I'm debugging certain performance issue and I found that HF tokenizer adds prefix space by default. I'm wondering if llama.cpp does the same since it's not obvious in the code. |
Beta Was this translation helpful? Give feedback.
Answered by
dranger003
Mar 25, 2024
Replies: 1 comment 1 reply
-
I think what you are looking for is here? |
Beta Was this translation helpful? Give feedback.
1 reply
Answer selected by
anhnami
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I think what you are looking for is here?
https://github.com/ggerganov/llama.cpp/blob/ad3a0505e3b6cd777259ee35e61d428357ffc565/llama.cpp#L10912-L10923