Replies: 1 comment 1 reply
-
|
Hi, after the full transcription has been received, the audio is streamed. Implemented here: wyoming_openai/src/wyoming_openai/handler.py Line 866 in a42abac You may have seen the new release: https://github.com/roryeckel/wyoming_openai/releases/tag/v0.3.7 which introduced pysbd text chunking to split at sentence boundaries for parallelized processing speedups: https://github.com/roryeckel/wyoming_openai?tab=readme-ov-file#table-of-environment--command-line-options (see TTS_STREAMING_MODELS) For even more responsiveness, you would also need to combine it with STT streaming transcription using STT_STREAMING_MODELS=gpt-4o-transcribe. To my knowledge, there are not any open source projects implementing streaming STT via OpenAI-compatible endpoints. I would love if someone is able to find any. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I'm not sure if I understand what I'm posting. I wonder if streaming can be implemented for kokoro-fastapi to reduce latency?
Beta Was this translation helpful? Give feedback.
All reactions