Replies: 4 comments
-
Hi @kwmedia, Thanks for your interest, and trying out the project!
Just to double check my understanding, you'd like to ability to do the following: First, perform transcription on some chunk of audio, producing a label track. Then, given that label track, perform speech synthesis (text-to-speech), which produces a new audio track. Do I have it right? Thanks! |
Beta Was this translation helpful? Give feedback.
-
Hey Ryan, |
Beta Was this translation helpful? Give feedback.
-
Thanks for the details.. I've been looking at porting this model / pipeline to work within our set of Audacity plugins: https://github.com/suno-ai/bark It might take me a little while to enable, but I believe it would cover most of what you're looking to do. The only part that will (probably) be missing is the ability to perform 'voice cloning' -- looks like they intentionally prevent this feature, as it may cause more harm than good. I'll post some updates here when I have some more details of what the timeline might look like for this feature. Cheers! |
Beta Was this translation helpful? Give feedback.
-
There are also these to look at for this idea. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hey there,
just tried out your transcription plugin. Was surprised how fast it was (running on a RTX3080).
I would love to see the option for a translated speech synthesis from the transcription based on the original audio input.
My usecase would be dubbing for YouTube's MultiLanguageAudio feature.
Thanks for working on this project!
Cheers!
Beta Was this translation helpful? Give feedback.
All reactions