prompt syntax when using stream #2924

iplayfast · 2023-08-31T06:56:29Z

iplayfast
Aug 31, 2023

I'm trying to interface https://github.com/paul-gauthier/aider.git with llama's stream (instead of gtp).

./server -t 16 --host 127.0.0.1 --port 8080 -m AllModels/TheBloke_CodeLlama-13B-oasst-sft-v10-GGUF/codellama-13b-oasst-sft-v10.Q8_0.gguf

stream is working and from a web browser it gives some options.
One of the options is a prompt template

{{prompt}}
{{history}}
{{char}}:

Suppose I wanted the prompt to be "you are a 1960's hippie that likes to use flowery language"
If I use

{{prompt}}
you are a 1960's hippie that likes to use flowery language
{{history}}
{{char}}:

It's ignored,
If I use

{{prompt}
you are a 1960's hippie that likes to use flowery language}
{{history}}
{{char}}:

Then it works, but keeps on conversing with itself.

2nd question, is that stream seems slow, compared to textgen-web-ui running it. I think the difference is the GPU memory can be set in text...ui, how do I set that in stream?

Answered by staviq

Aug 31, 2023

-ngl or --n-gpu-layers is for offloading to GPU, it must be in the output of main -h, if it isn't, you compiled it without GPU support.

View full answer

staviq · 2023-08-31T14:59:36Z

staviq
Aug 31, 2023

If you want to use curl directly, or something like that for streaming with server, the web prompt template doesn't really apply, those {{}} keywords get replaced in the web interface with appropriate parts of the conversation.

As for 2. server accepts most parameters for main, so if you run ./main -h you'll get a description of all parameters, including GPU use parameters, and you can use them for server

3 replies

iplayfast Aug 31, 2023
Author

I'm trying to hook it up to aider which normally interfaces with the openai gpt. Thanks for your help.

iplayfast Aug 31, 2023
Author

Output from main -h doesn't seem to have any options for GPU use parameters. What should I be looking at?

staviq Aug 31, 2023

-ngl or --n-gpu-layers is for offloading to GPU, it must be in the output of main -h, if it isn't, you compiled it without GPU support.

Answer selected by iplayfast

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

prompt syntax when using stream #2924

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

prompt syntax when using stream #2924

Uh oh!

iplayfast Aug 31, 2023

Replies: 1 comment · 3 replies

Uh oh!

staviq Aug 31, 2023

Uh oh!

iplayfast Aug 31, 2023 Author

Uh oh!

iplayfast Aug 31, 2023 Author

Uh oh!

staviq Aug 31, 2023

iplayfast
Aug 31, 2023

Replies: 1 comment 3 replies

staviq
Aug 31, 2023

iplayfast Aug 31, 2023
Author

iplayfast Aug 31, 2023
Author