Timeout parameter #81

greg-gav · 2025-10-29T09:46:35Z

greg-gav
Oct 29, 2025

Question is for 'queue-timeout' parameter: as I understand that is for requests that are queued waiting and waiting in queue, doesn't work for requests that are stuck in loop of reasoning which never finishes (happens for some Qwen models). For such cases where generating the response exceeds some timeout, can we have another parameter, say 'request-timeout' so it force stops inference after some set max time?

cubist38 · 2025-10-30T04:41:40Z

cubist38
Oct 30, 2025
Maintainer

Oh, that’s a great point @greg-gav — thank you for your recommendation! Would you be able to help me implement it? I’d really appreciate your support, as I’m currently quite busy with work, and it might take me a few days or even weeks to get to it on my own.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Timeout parameter #81

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Timeout parameter #81

Uh oh!

greg-gav Oct 29, 2025

Replies: 1 comment

Uh oh!

cubist38 Oct 30, 2025 Maintainer

greg-gav
Oct 29, 2025

cubist38
Oct 30, 2025
Maintainer