Different answers of endpoint `/completion` using curl and using chatui #8302

andreys42 · 2024-07-04T15:57:28Z

andreys42
Jul 4, 2024

I wonder what can cause difference in Llama3-8b answers for the same promt when use curl to llamacpp http server endpoint vs when use chat ui requesting the same llamacpp server.
curl respond with messy output, looks like the model ignores rules in system_prompt parameter, while chatui case looks good to me.

My guess is that reason is chatPromptTemplate which I use as parameter in chatui config vs raw string as prompt in curl request. Can this cause differences. If so, how can I set chat template when use /completion endpoint
Thank you for any suggestions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Different answers of endpoint `/completion` using curl and using chatui #8302

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Different answers of endpoint /completion using curl and using chatui #8302

Uh oh!

Uh oh!

andreys42 Jul 4, 2024

Replies: 0 comments

Different answers of endpoint `/completion` using curl and using chatui #8302

andreys42
Jul 4, 2024