Skip to content

why pad_or_trim use 1000 rather than 3000 when transcribe_audio? #45

@peggyxpxu

Description

@peggyxpxu

why pad_or_trim use 1000 rather than 3000 when transcribe_audio?
mel = pad_or_trim(mel, 1000).to(model.device).to(dtype)

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions