Add Millisecond or Decimal Second Options for API Request Timing #2428
neno-is-ooo
started this conversation in
Feature Requests
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Feature Request: Add Millisecond or Decimal Second Options for API Request Timing
Current Limitation
The UI currently only allows setting minimum time between API requests in whole seconds (1, 2, 3 seconds, etc.). This creates unnecessary restrictions when working with high-performance AI models like Claude 3.7 Sonnet that support much faster request rates.
Proposed Enhancement
Add support for:
Benefits
Improved Responsiveness: Models like Claude 3.7 Sonnet support 1,000 RPM (one request per 60ms), but the current UI forces a minimum 1-second delay, reducing potential throughput by ~94%.
Better Rate Limit Optimization: Different AI models have varying rate limits - enabling finer-grained control allows developers to precisely tune request timing to each model's capabilities.
Enhanced Developer Experience: Smoother, more responsive coding assistance without unnecessary waiting periods.
Use Cases
Implementation Suggestion
A simple input field that accepts decimal values (0.2, 0.3, etc.) or values with "ms" suffix, with appropriate validation to prevent values below model-specific minimums.
Thank you for considering this enhancement to improve the developer experience!
Beta Was this translation helpful? Give feedback.
All reactions