Text-To-Speech - ElevenLabs API ? #707
Replies: 4 comments 4 replies
-
Hello @kalodj, I have watched the YouTube. To use the service, you have to register there first. If you do this, you can proceed as described in the video. I would like to have a voice service that can be used completely independently. |
Beta Was this translation helpful? Give feedback.
-
Hey, I'm working on this now but not using this library. I'm still curious if there are any plans of adding support from streaming a chunked response from the elevenlabs API streamed back as raw 16khz pcm data. Trying to implement it myself, will share here if I succeed. |
Beta Was this translation helpful? Give feedback.
-
here my code in case interested (i do not know how to add a ino code snippet here, so i upload as zip: |
Beta Was this translation helpful? Give feedback.
-
Hi Simon, minor update: I researched a lot in past, i might not use Elevenlabs as it requires a monthly abo payment. One of the best alternatives i found meanwhile is SpeechGenIO, pretty unknown but awesome voices. Also payment needed but a kind on single basket (for a defined volume, which makes sense IMHO. I will use this for sure (in parrallel to the 6 predefined OpenAI voices). In case you are still interested here is the API link, maybe possible to add (similar the Audio::openai_speech()' in Audio.ccp) |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi schreibfaul1, first of all just want to send my thank you ! .. your library is awesome, that’s exactly what i was looking for. Just building an ESP32 device with STT (using Google Cloud) and TTS (using your 'audio.openai_speech' function), btw: inspired from this project (techiesms GPT Voice assistant, he uses your library): https://www.youtube.com/watch?v=gGcskjKtArU.
As i was searching for most professional TTS solution (background: i do need several self defined custom voices, triggered on ESP32 via I2S with your library). ElevenLabs might be the leader here (more powerful than even Open AI). So i was wondering if you plan to add this API ?.., otherwise i might try myself to clone your 'Audio::openai_speech()' in Audio.ccp, using as template for an additional 'Audio::elevenlabs_speech(..)'. But i have concerns not having the needed skills to do myself. :|
Have your ever seen this awesome API?: https://elevenlabs.io/docs/api-reference/text-to-speech (also found this video which describes the capabilities of this TTS: https://www.youtube.com/watch?v=z0sD2BvUfM0).
Do you plan to add this call into your Audio.ccp ? (or should i try myself to test url & tags via Postman first then building an Function inside a local Audio.cpp and come back on questions in case i struggle)
Any feedback much appreciated, thank you !
Beta Was this translation helpful? Give feedback.
All reactions