Replies: 2 comments 4 replies
-
That's a cool idea! How would it be integrated into the phonemizer? |
Beta Was this translation helpful? Give feedback.
-
there would need to be an extension to the tokens to encode breath and tone variance. extended characters for so the Markdown encoding would then provide emotive post processing. Possible: [1] https://en.wikipedia.org/wiki/Ingressive_sound |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
A Markdown extension for emotive speech could be better than SSML in expressiveness and ML training efficiency because of its simplicity and tokenizer friendly syntax.
Markdown's straightforward annotations make it easier to read and edit than XML like markup language.
Beta Was this translation helpful? Give feedback.
All reactions