-
Notifications
You must be signed in to change notification settings - Fork 30
Open
Labels
documentationImprovements or additions to documentationImprovements or additions to documentationgood first issueGood for newcomersGood for newcomers
Description
Apart from data size, please publish other information such as duration, number of files, average length of files, quality, number of speakers, source, and method of collection.
Also, since these data are Google's speech-to-text transcriptions, it is better to report this issue and its approximate error.
The raw outputs of a speech-to-text model can be used with some considerations to train other models, but it certainly cannot be introduced as a speech-to-text dataset.
Metadata
Metadata
Assignees
Labels
documentationImprovements or additions to documentationImprovements or additions to documentationgood first issueGood for newcomersGood for newcomers