Reduce Hallucination - Implementing temperature time threshold #1087

hermify · 2023-07-07T17:39:14Z

hermify
Jul 7, 2023

Hi there,

I'm currently experiencing some issues with hallucination, similar to what others have encountered. I'm looking for a solution to generate subtitles for a one-hour long video. I just tried some configuration, but with its limits.

According to:
Issue #896,
Pull Request #291
Whisper Thread

Context of the video:
At the start of the video, there is music playing for about 10 minutes, followed by a speech.

Custom Settings:
Beam size: 5 (-bs 5)
Entropy threshold: 2.4 (-et 2.4)
Maximum context: 64 (max-context = 64)

With this configuration, the hallucination is now limited and "only" takes 2 minutes to find the way back. Previously, I had about 60 minutes of the word "[Music]" before making the adjustments.

However, after approximately 64 spoken words, the context changes, and the model starts working fine again. But there is still around 2 minutes of hallucination during the start of the speech. Is there a way to implement a time threshold (in seconds) to establish a new context after 10-15 seconds? Or reset the context, if the temperature is on high level for x seconds?

Further can someone explain the variables? As it might help reducing hallucinations?
--word-thold N [0.01 ] word timestamp probability threshold
--entropy-thold N [2.40 ] entropy threshold for decoder fail
-logprob-thold N [-1.00 ] log probability threshold for decoder fail

Thank you!

hermify · 2023-07-25T08:29:50Z

hermify
Jul 25, 2023
Author

Just a heads up for anyone looking into this issue:
Currently, I have to do post-processing. Reading the vtt-file. Detecting hallucinations (if a certain number of words are found to be repeated), and removing the duplicate entries automatically, then writing the vtt-File

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce Hallucination - Implementing temperature time threshold #1087

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Reduce Hallucination - Implementing temperature time threshold #1087

Uh oh!

hermify Jul 7, 2023

Replies: 1 comment

Uh oh!

hermify Jul 25, 2023 Author

hermify
Jul 7, 2023

hermify
Jul 25, 2023
Author