Skip to content

Flash-Attention Ruins Timestamps on 1.75 #3066

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
AeneasZhu opened this issue Apr 22, 2025 · 0 comments
Open

Flash-Attention Ruins Timestamps on 1.75 #3066

AeneasZhu opened this issue Apr 22, 2025 · 0 comments

Comments

@AeneasZhu
Copy link

I found that the timestamps of transcript unmatched the speaker many times when turned -fa on after updating to 1.75. Here are the transcripts:

fa-on.txt

fa-off.txt

You can try the extension to add sub on Youtube: https://github.com/yashagarwal1411/SubtitlesForYoutube and check the difference. (Rename the file ".srt")

Here is the original video: https://www.youtube.com/watch?v=_TTI2ZQZpXc.

After the opening music is off, the timestamps is in a mess (though the transcript quality is good) for almost 1 min. But when I turned off flash-attention by removing -fa, the timestamps could match the speaker.

By the way, I used large-v2 to transcribe it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant