You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I started a llamacpp server for chatting. But its response for one prompt consisted of a whole conversation.
I expected:
Send a question to server through HTTP POST
Server respond an answer
repeat 1&2
The procedure should be like these:
[send request]User: Hello
[response]AI: Hi, can I help you?
[send request]User: Yeah, I met a problem that...
[repsonse]AI: I understood that...
However, it worked like:
Send a question to server through HTTP POST
return a series of answers and questions that it generated automatically, till the end of the conversation.
That is to say:
[send request]User: Hello
[response]AI: Hi, can I help you? User: Yeah, I met a problem that... AI: I understood that... User: Thank you. Bye. AI: Welcome, Bye.
When I used "main" and -i parameter, it worked exactly like my expectation. Meanwhile, I also noticed if I pressed Enter, it would generate the conversation continuously including both questions and answers.
What's the essential issue? And can it be addressed? Thanks a lot!
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
I started a llamacpp server for chatting. But its response for one prompt consisted of a whole conversation.
I expected:
The procedure should be like these:
[send request]User: Hello
[response]AI: Hi, can I help you?
[send request]User: Yeah, I met a problem that...
[repsonse]AI: I understood that...
However, it worked like:
That is to say:
[send request]User: Hello
[response]AI: Hi, can I help you? User: Yeah, I met a problem that... AI: I understood that... User: Thank you. Bye. AI: Welcome, Bye.
When I used "main" and -i parameter, it worked exactly like my expectation. Meanwhile, I also noticed if I pressed Enter, it would generate the conversation continuously including both questions and answers.
What's the essential issue? And can it be addressed? Thanks a lot!
Beta Was this translation helpful? Give feedback.
All reactions