What is the information of your dataset in detail?

Apart from data size, please publish other information such as duration, number of files, average length of files, quality, number of speakers, source, and method of collection.

Also, since these data are Google's speech-to-text transcriptions, it is better to report this issue and its approximate error.
The raw outputs of a speech-to-text model can be used with some considerations to train other models, but it certainly cannot be introduced as a speech-to-text dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What is the information of your dataset in detail? #8

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What is the information of your dataset in detail? #8

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions