Azure Embeddings Quota Limit #854

danieldekay · 2024-09-18T10:45:58Z

danieldekay
Sep 18, 2024

I am running GPTR and experience a quota limit in my subscription. While I have just asked for an extended quota, I am puzzled that I could also "just have waited for 1s".

Does anyone know if there would be a right point in GPTR to implement a retry with backoff around API calls that could be quota-limited? for example using -- https://pypi.org/project/backoff/

Error running job: Error code: 429 - {'error': {'code': '429', 'message': 'Requests to the Embeddings_Create Operation under Azure OpenAI API version 2024-02-15-preview have exceeded call rate limit of your current OpenAI S0 pricing tier. Please retry after 1 second. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.'}}

Since Embeddings_Create is not used in GPTR's code, I suspect this being in langchain's code somewhere -- which is called by GPTR. But where?

Langchain recommends setting the maxConcurrency option. https://js.langchain.com/v0.1/docs/modules/data_connection/text_embedding/rate_limits/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Azure Embeddings Quota Limit #854

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Azure Embeddings Quota Limit #854

Uh oh!

Uh oh!

danieldekay Sep 18, 2024

Replies: 0 comments

danieldekay
Sep 18, 2024