StyleTTS2 - Text to speech comparable to Eleven Labs quality. #4138
logikstate
started this conversation in
Ideas
Replies: 3 comments 9 replies
-
Yep, I gave it a few hours of tests and looks. Only downside: no international language support. |
Beta Was this translation helpful? Give feedback.
8 replies
-
Thank you! This is exactly what i was looking for, but dang do I wish this was all in 1 step. |
Beta Was this translation helpful? Give feedback.
1 reply
-
Thank you! I don't know how else I could have learned these names based on
my search parameters.
"I studied computers"
CONFIDENTIALITY NOTICE: The contents of this email message and any
attachments are intended solely for the addressee(s) and may contain
confidential and/or privileged information and may be legally protected
from disclosure. It is then shared with tech companies, bots, hackers,
government agencies, and marketers. The security of this message is none,
and it may be shared on Instagram at any time. If you are OK with this,
please respond. There isn't really any security or privacy anywhere. If
you disagree you may want to go camping and talk to people face-to-face
like in old times.
…On Sun, Mar 2, 2025, 03:55 Paul Carter ***@***.***> wrote:
Things have moved on since last year... There's now Kokoro TTS which uses
the same architecture as StyleTTS2... and now LLama also supports TTS
directly via OuteTTS
—
Reply to this email directly, view it on GitHub
<#4138 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/A24QKTQTVW56KGWTHILPRWT2SLPQ5AVCNFSM6AAAAABYEZWD72VHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTEMZWGUYTKNQ>
.
You are receiving this because you commented.Message ID: <ggml-org/llama.
***@***.***>
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Has anybody seen this? Just had a play around with the collab notebooks and it really is very good quality and it also has voice cloning. I only ask because I can't find anybody talking about this anywhere at all. It may have slipped under everybodys radar?
https://github.com/yl4579/StyleTTS2
Beta Was this translation helpful? Give feedback.
All reactions