Skip to content
Discussion options

You must be logged in to vote

It is preferred. Almost all ASR models out there use 16 kHz single channel audio as input. Training can be done in wav, flac or mp3, but wav is the fastest.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by AYUSH27112021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants