A Proposition for Voice Generation and Conversion in ComfyUI #2737
hydrusbeta
started this conversation in
Ideas
Replies: 3 comments 5 replies
-
Good Idea. Simpler and more involved, perhaps with a simple audio type support |
Beta Was this translation helpful? Give feedback.
0 replies
-
That would be interesting indeed. Does Hey Say allow the use of 3rd party
TTS and VC services like 11 labs ?
…On Wed, Feb 7, 2024 at 10:09 AM 佩奇 ***@***.***> wrote:
Good Idea. Simpler and more involved, perhaps with a simple audio type
support
—
Reply to this email directly, view it on GitHub
<#2737 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ACDISOHD6XEAGW4WDWN2CBLYSO7N5AVCNFSM6AAAAABC5CY2GCVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4DGOJYGM3TO>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***
com>
--
Eduardo Yeh
Co-Founder, CEO
Selvz LLC <http://selvz.com/>
NOTICE: This transmission may contain privileged and confidential
information. It is intended only for the use and view of the intended
recipients SPECIFICALLY LISTED as addressees above. If you are not the
intended recipient, YOU are hereby notified that any review, dissemination,
distribution or duplication of this communication is strictly prohibited.
If you are not the intended recipient, please contact the sender by reply
email and destroy all copies of the original message. Also, due to the
susceptibility of electronic communication to corruption, the sender
warrants neither the accuracy nor the completeness of this communication
|
Beta Was this translation helpful? Give feedback.
3 replies
-
I do want to add auto capabilities to ComfyUI eventually. If it should be added to the base or not depends on how much code can be shared between the audio and image models. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I am the creator and maintainer of Hay Say, an interface for AI-powered Text-To-Speech and Voice Conversion:
Git: https://github.com/hydrusbeta/hay_say_ui
Live Server: https://haysay.ai/
A couple months ago, I began working on a complete rewrite of Hay Say from scratch, with the goal of creating a REST API and a node editor interface so that users can create their own pipelines and mix and match components. However, a few days ago, I discovered ComfyUI and I was taken aback by the similarities between it and my vision for Hay Say 2.0. This has got me wondering whether I should just attempt to extend ComfyUI to work with voice AI models. I have a few questions/ points of discussion:
Has adding voice capabilities for ComfyUI ever been discussed? I didn't spot anything in the discussions or issues tabs on this GitHub repo.
I've only started looking over the code for ComfyUI so I have limited knowledge of its inner workings so far. If anyone more knowledgeable has input as to the feasability of such a project, I'm all ears. In the meantime, I'll keep familiarizing myself with the codebase.
Lastly, if any maintainers of ComfyUI see this discussion, would you be receptive to me adding voice AI to ComfyUI itself, or would you prefer to keep it a separate project?
I haven't committed 100% to using ComfyUI yet. Any responses to this discussion topic (especially topic # 3) will likely influence my decision as to whether I embrace ComfyUI or continue doing my own thing.
Beta Was this translation helpful? Give feedback.
All reactions