What’s the recommended way to use vLLM openAI server for batch processing? #7639

ktrapeznikov · 2024-08-18T12:05:59Z

ktrapeznikov
Aug 18, 2024

I want to process a batch of requests. What is the recommended way?
I typically use multiple workers with ThreadpoolExectuor. I am wondering if there is a better way?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

What’s the recommended way to use vLLM openAI server for batch processing? #7639

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

What’s the recommended way to use vLLM openAI server for batch processing? #7639

Uh oh!

ktrapeznikov Aug 18, 2024

Replies: 0 comments

ktrapeznikov
Aug 18, 2024