[Feature Request] Max error rate parameter #105

markVaykhansky · 2025-04-08T12:25:20Z

Feature Description
Please add a --max-error-rate parameter which would stop the execution of a single benchmark.

Why is this needed?
When running benchmarks we want to fail as early as possible in order to save costs of either GPU machine time or remote API calls s.a ChatGPT API.

Further Description
From benchmarks that we've been running a reasonable default value would be 0.05 i.e 5% max error rate.
Also, if a benchmark fails due to reaching the max error rate it should be reflected in the report generated by GuideLLM.

The text was updated successfully, but these errors were encountered:

rgreenberg1 added this to GuideLLM Kanban Board May 8, 2025

rgreenberg1 added good first issue Good for newcomers priority-low and removed good first issue Good for newcomers labels May 8, 2025

rgreenberg1 moved this to Ready in GuideLLM Kanban Board May 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Max error rate parameter #105

[Feature Request] Max error rate parameter #105

markVaykhansky commented Apr 8, 2025

[Feature Request] Max error rate parameter #105

[Feature Request] Max error rate parameter #105

Comments

markVaykhansky commented Apr 8, 2025