Skip to content

PerceptualEvaluationSpeechQuality - NoUtterancesError: b'No utterances detected' #3304

@maxstrobel

Description

@maxstrobel

🐛 Bug

PerceptualEvaluationSpeechQuality raises for some samples `NoUtterancesError: b'No utterances detected'.

To Reproduce

Sample code and steps to reproduce the behavior with expected result...

Code sample
import torch
from torchmetrics.audio.pesq import PerceptualEvaluationSpeechQuality

target = torch.load("target.pt")
print(f"{target.shape=} {target.dtype=} {target.min()=} {target.max()=} {target.mean()=} {target.std()=}")
pred = torch.load("pred.pt")
print(f"{pred.shape=} {pred.dtype=} {pred.min()=} {pred.max()=} {pred.mean()=} {pred.std()=}")


pesq = PerceptualEvaluationSpeechQuality(fs=8000, mode="nb")
pesq.update(pred, target)
Environment
  • TorchMetrics version (if build from source, add commit SHA): 1.7.0
  • Python & PyTorch Version (e.g., 1.0): 3.11.10 & 2.6.0+cpu (also CUDA version)
  • Any other relevant information such as OS (e.g., Linux): Windows & Linux

Additional context

Attached a data sample to reproduce the issue.

sample_data.zip

Metadata

Metadata

Assignees

No one assigned

    Labels

    bug / fixSomething isn't workinghelp wantedExtra attention is needed

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions