-
Notifications
You must be signed in to change notification settings - Fork 327
Description
Hello HELM team,
I’m experiencing two persistent issues when trying to run HELM evaluations, both in Google Colab and in a clean local environment (PyCharm, Python 3.10), despite installing the library directly from source using:
git clone https://github.com/stanford-crfm/helm.git
cd helm
pip install -e .[all]
- ModuleNotFoundError: No module named 'helm.benchmark.simple_scenarios'
When attempting to import:
from helm.benchmark.simple_scenarios import MultipleChoiceScenario
I receive:
ModuleNotFoundError: No module named 'helm.benchmark.simple_scenarios'
This happens even after verifying that other modules (like helm.benchmark.metrics.basic_metrics) are present. The simple_scenarios.py module seems to be missing entirely from the installed package.
- Model not recognized by helm-run
In a separate attempt to run a scenario with:
--models-to-run stanfordhealthcare/llama-4-maverick-17b-128e-instruct-fp8
I encounter the following:
Exception: Unknown model 'stanfordhealthcare/llama-4-maverick-17b-128e-instruct-fp8'
The model is correctly defined in the YAML configuration file, but helm-run fails to recognize it.
Questions:
• Has simple_scenarios.py been removed or renamed in recent versions?
• What is the correct way to define and evaluate multiple-choice tasks now?
• How can we ensure that public models are correctly recognized by helm-run?
Any guidance would be greatly appreciated.
Thanks in advance for your help!