Add dataset-from-file command to extract datasets from benchmark reports #235

Harshith-umesh · 2025-07-21T21:00:04Z

This PR adds a new preprocessing command that enables users to extract datasets from saved benchmark reports, facilitating "apples-to-apples" model comparisons and reproducible benchmarking workflows.

New guidellm preprocess dataset-from-file command that converts benchmark report files into reusable datasets.

Users often want to compare different models using identical prompts to eliminate variability. This command extracts successful request-response pairs from benchmark results, creating standardized datasets with known prompt and output token counts.

The command validates benchmark reports and extracts successful requests, and outputs a structured JSON dataset.

Features Added

guidellm preprocess dataset-from-file [OPTIONS] BENCHMARK_FILE
Uses GenerativeBenchmarksReport.load_file() for comprehensive input validation
User-friendly error messages without Python tracebacks
Optional --show-stats flag for dataset analysis

Testing

./venv/bin/python -m pytest tests/unit/entrypoints/test_dataset_from_file_entrypoint.py -v

Usage Examples

# Extract dataset from benchmark results
guidellm preprocess dataset-from-file benchmark-results.json -o my_dataset.json

# Extract and Show statistics about the extracted dataset
guidellm preprocess dataset-from-file benchmark-results.json --show-stats

Harshith-umesh added 2 commits July 21, 2025 16:33

Add dataset-from-file command to extract datasets from benchmark reports

e20e2ba

Fix comments in tests

52d054e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add dataset-from-file command to extract datasets from benchmark reports #235

Add dataset-from-file command to extract datasets from benchmark reports #235

Uh oh!

Harshith-umesh commented Jul 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add dataset-from-file command to extract datasets from benchmark reports #235

Are you sure you want to change the base?

Add dataset-from-file command to extract datasets from benchmark reports #235

Uh oh!

Conversation

Harshith-umesh commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Features Added

Testing

Usage Examples

Uh oh!

Uh oh!

Harshith-umesh commented Jul 21, 2025 •

edited

Loading