why pad_or_trim use 1000 rather than 3000 when transcribe_audio? #45

Open

Open

why pad_or_trim use 1000 rather than 3000 when transcribe_audio?#45

Labels

opened

on Jul 25, 2024

why pad_or_trim use 1000 rather than 3000 when transcribe_audio?
mel = pad_or_trim(mel, 1000).to(model.device).to(dtype)

Metadata

Assignees

No one assigned

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests