Running tts on long text #5152

dln22 · 2022-10-12T14:11:34Z

dln22
Oct 12, 2022

I am facing an out of memory issue when running the tts model on long sentences. I worked around it by splitting the text and processing it in chunks and then combining produced audio files. This, however, produces small glitches in the final audio, in places where the audio chunks were merged. I wonder if there is any other way to convert long text to audio in one go, without running out of memory?

Models I am using:
spectrogram_model="tts_en_fastpitch"
vocoder_model="tts_hifigan"

XuesongYang · 2022-12-13T08:06:16Z

XuesongYang
Dec 13, 2022
Collaborator

It would be better to segment your input text based on punctuation and keep each segment around 15 seconds.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Running tts on long text #5152

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Running tts on long text #5152

Uh oh!

dln22 Oct 12, 2022

Replies: 1 comment

Uh oh!

XuesongYang Dec 13, 2022 Collaborator

dln22
Oct 12, 2022

XuesongYang
Dec 13, 2022
Collaborator