Please help me in preparing data step #107

cod3r0k · 2025-05-23T13:10:27Z

cod3r0k
May 23, 2025

Hi, I have an Arabic dataset structured as follows:

mydata/wavs/*.wav       # Audio files (22,050 Hz)
mydata/metadata.csv     # Metadata file in the format: id|transcription

I'm unsure about the next steps.

Previously, I used Coqui and eSpeak, which handled everything automatically. But now I'm working with the vits_pytorch code, and I'm stuck at Step 2 — generating mel spectrograms. I'm not sure what I need to do at this stage.

Could you guide me on how to proceed?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Please help me in preparing data step #107

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Please help me in preparing data step #107

Uh oh!

Uh oh!

cod3r0k May 23, 2025

Replies: 0 comments

cod3r0k
May 23, 2025