We use AudioBench to evaluate Spoken‑MQA.
- Install AudioBench
git clone https://github.com/AudioLLMs/AudioBench
cd AudioBench
pip install -r requirements.txt
- Install additional dependencies for Spoken-MQA
pip install -r latex2sympy2
We provide a script for full dataset evaluation. Copy the bash script eval_spoken_mqa.sh
to AudioBench/
bash eval_spoken_mqa.sh
In the paper on Spoken‑MQA, we evaluate several models, such as math-specific LLMs, which may require customized prompting formats or are not yet supported in AudioBench.
To enable evaluation of these models:
-
Their implementation files are provided in the
model_src/
directory of this repository. -
To integrate them with AudioBench:
- Copy the model files into
AudioBench/src/model_src/
. - Modify
AudioBench/src/model.py
to handle the new models appropriately
- Copy the model files into
If you use Spoken-MQA or find our work helpful, please cite:
@article{wei2025towards,
title = {Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems},
author = {Wei, Chengwei and Wang, Bin and Kim, Jung-jae and Chen, Nancy F},
journal = {arXiv preprint arXiv:2505.15000},
year = {2025}
}