Add support for shallow biasing of Whisper #1889

anthonyrathe · 2025-05-02T22:06:55Z

Attempting to fix #1789 by @zwycl

Adds an optional contextual biasing parameter to the Whisper model to enable shallow contextual biasing toward given sequences by modifying logits. This is a flexible and fairly simple method useful for transcribing out-of-vocabulary entities in ASR or mitigating harmful mistranscriptions toward unwanted token sequences. Similar parameter is implemented in the HuggingFace package: https://huggingface.co/docs/transformers/en/internal/generation_utils#transformers.SequenceBiasLogitsProcessor

MrigankRaman · 2025-05-06T18:46:13Z

Attempting to merge #1789

Do you have the python wheels for this branch? I would like to use it

anthonyrathe · 2025-05-18T13:27:18Z

Attempting to merge #1789

Do you have the python wheels for this branch? I would like to use it

Yes, you can find the links to the wheels via the checks below.

anthonyrathe · 2025-05-18T13:32:45Z

@minhthuc2502 would you mind taking a look at this PR? Would love to get this released!

MrigankRaman · 2025-05-27T22:52:58Z

Attempting to merge #1789

Do you have the python wheels for this branch? I would like to use it

Yes, you can find the links to the wheels via the checks below.

There might be a bug in this PR. When I use sequence bias with a model compiled in int8_float16 I get this error

ValueError: expected storage to be of type float16, but is of type float32

which I dont get when I use sequence_bias as None. The exact same input

MrigankRaman · 2025-05-27T23:21:48Z

Attempting to merge #1789

Do you have the python wheels for this branch? I would like to use it

Yes, you can find the links to the wheels via the checks below.

There might be a bug in this PR. When I use sequence bias with a model compiled in int8_float16 I get this error

ValueError: expected storage to be of type float16, but is of type float32

which I dont get when I use sequence_bias as None. The exact same input

Sorry I resolved. Does this increase latency?

anthonyrathe · 2025-05-28T07:21:30Z

Sorry I resolved. Does this increase latency?

It does, unfortunately. Especially if the number of sequences to bias is large.

MrigankRaman · 2025-05-29T18:26:56Z

Sorry I resolved. Does this increase latency?

It does, unfortunately. Especially if the number of sequences to bias is large.

Where does this latency increase come from?

zwycl and others added 3 commits September 23, 2024 15:44

contextual biasing for asr

dbf6605

fix merge

1ce361c

Merge remote-tracking branch 'origin/master' into add-shallow-biasing

52efc77

anthonyrathe added 2 commits May 18, 2025 13:36

Fix linting

c524552

Refactor indexed_pointwise_multiply call to use device type variable

758aea9

anthonyrathe changed the title ~~Add biasing~~ Add support for shallow biasing of Whisper May 18, 2025

anthonyrathe added 2 commits May 18, 2025 18:37

Increase bias in order for tests to succeed.

5dbcd18

Cleanup docs and comments

ef7dc00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for shallow biasing of Whisper #1889

Add support for shallow biasing of Whisper #1889

Uh oh!

anthonyrathe commented May 2, 2025 •

edited

Loading

Uh oh!

MrigankRaman commented May 6, 2025

Uh oh!

anthonyrathe commented May 18, 2025

Uh oh!

anthonyrathe commented May 18, 2025

Uh oh!

MrigankRaman commented May 27, 2025

Uh oh!

MrigankRaman commented May 27, 2025

Uh oh!

anthonyrathe commented May 28, 2025

Uh oh!

MrigankRaman commented May 29, 2025

Uh oh!

Uh oh!

Add support for shallow biasing of Whisper #1889

Are you sure you want to change the base?

Add support for shallow biasing of Whisper #1889

Uh oh!

Conversation

anthonyrathe commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MrigankRaman commented May 6, 2025

Uh oh!

anthonyrathe commented May 18, 2025

Uh oh!

anthonyrathe commented May 18, 2025

Uh oh!

MrigankRaman commented May 27, 2025

Uh oh!

MrigankRaman commented May 27, 2025

Uh oh!

anthonyrathe commented May 28, 2025

Uh oh!

MrigankRaman commented May 29, 2025

Uh oh!

Uh oh!

anthonyrathe commented May 2, 2025 •

edited

Loading