is there a way to terminal model server? #5020

gabohouhou · 2024-05-24T06:37:27Z

gabohouhou
May 24, 2024

for example，10 people call model api，there are 10 requests running，but 3 people waiting too long ，so 3 people cancel request at front-end，is there a way to terminal model server request for 3 people，and 7 people keep waiting ？ can use vllm to complete ？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

is there a way to terminal model server? #5020

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

is there a way to terminal model server? #5020

Uh oh!

gabohouhou May 24, 2024

Replies: 0 comments

gabohouhou
May 24, 2024