FEAT: Adapt to Rate Limit instead of Failure

### Feature Request

Providers like OpenAI have some rate limits (things like a limit in the requests per minute).
This feature would allow llm studio to wait it out (or keep trying) when necessary so that the response does not error even if it takes longer.

Advanced feature:
By being aware of the exact rate limit of the user (depends on their tier in OpenAI for example) it could also decide which prompts to send at what time in order to maximize the rate limit without overstepping it (in cases where these are parallel).

### Motivation

Gives more robustness to LLM calls. The user does not need to worry about their application breaking when making to many requests per minute.

### Your contribution

Discussion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: Adapt to Rate Limit instead of Failure #37

Feature Request

Motivation

Your contribution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

FEAT: Adapt to Rate Limit instead of Failure #37

Description

Feature Request

Motivation

Your contribution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions