llama-parallel is processing only one request, when run with various parameters #8388

skprasadu · 2024-07-09T03:19:13Z

skprasadu
Jul 9, 2024

I cloned and built llama.cpp in my Ubuntu GPU A100 box. And why I am running this example as below

./llama-parallel --prompt "where is bangalore\nwho is lord krishna" --parallel 2 --cont-batching --sequences 1 -ntg 1024 -npp 1024,1024 -ngl 35 I am only getting response for 1st request. The second request's response is not printing. Am I missing something. Can someone, put a readme of how we do parallel processing of prompts.

Answered by skprasadu

Jul 9, 2024

I think I figured out, this is the right command,
./llama-parallel --prompt "where is bangalore\nwho is lord krishna" --parallel 2 --cont-batching --sequences 2 -ntg 1024 -npp 1024,1024 -ngl 35 actually we need to give the sequence size equal to the number of prompts that need to be processed.

Hope it will be useful to others.

View full answer

skprasadu · 2024-07-09T03:21:23Z

skprasadu
Jul 9, 2024
Author

I think I figured out, this is the right command,
./llama-parallel --prompt "where is bangalore\nwho is lord krishna" --parallel 2 --cont-batching --sequences 2 -ntg 1024 -npp 1024,1024 -ngl 35 actually we need to give the sequence size equal to the number of prompts that need to be processed.

Hope it will be useful to others.

1 reply

ggerganov Jul 9, 2024
Maintainer

The -ntg and -npp args are ignored by the llama-parallel so you can remove those

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama-parallel is processing only one request, when run with various parameters #8388

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

llama-parallel is processing only one request, when run with various parameters #8388

Uh oh!

skprasadu Jul 9, 2024

Replies: 1 comment · 1 reply

Uh oh!

skprasadu Jul 9, 2024 Author

Uh oh!

ggerganov Jul 9, 2024 Maintainer

skprasadu
Jul 9, 2024

Replies: 1 comment 1 reply

skprasadu
Jul 9, 2024
Author

ggerganov Jul 9, 2024
Maintainer