How to integrate SpeechClient V2 for bidirectional streaming voice applications in LLMFlow #181

ammmr · 2025-04-14T11:32:14Z

ammmr
Apr 14, 2025

Thank you for creating this excellent Agent Development Kit.

I would like to create a real-time bidirectional streaming voice application using SpeechClient V2, but I noticed that adk-python is using V1 in the audio_transcriber.py file:
https://github.com/google/adk-python/blob/main/src/google/adk/flows/llm_flows/audio_transcriber.py

Could you please advise on how to easily integrate SpeechClient V2 into LLMFlow for bidirectional streaming support?

Thank you in advance for your help and for maintaining this great repository.

Best regards,

Answered by hangfei

Apr 17, 2025

We will switch to a built-in transcription so this transcriber will not be needed.

Any features you are expecting from SpeechClient V2?

View full answer

hangfei · 2025-04-17T05:08:40Z

hangfei
Apr 17, 2025
Maintainer

We will switch to a built-in transcription so this transcriber will not be needed.

Any features you are expecting from SpeechClient V2?

0 replies

ammmr · 2025-04-21T04:52:33Z

ammmr
Apr 21, 2025
Author

@hangfei

Thank you for your response. If you're planning to switch to a built-in transcription solution, I'm happy to wait for that implementation rather than modifying the current transcriber.

My main interest in SpeechClient V2 was to leverage the Chirp2 model for Speech-to-Text functionality. I was hoping to create a real-time bidirectional streaming voice application that could benefit from Chirp2's improved accuracy and performance.

Would the upcoming built-in transcription solution potentially support Chirp2 or similar advanced speech recognition models? And do you have any timeline for when this built-in solution might be available?

Thanks again for your work on this project.

0 replies

chulkilee · 2025-04-21T16:52:12Z

chulkilee
Apr 21, 2025

What would be "built-in transcriber"?

I'm wondering whether it will allow integrating non-google STT/TTS services.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to integrate SpeechClient V2 for bidirectional streaming voice applications in LLMFlow #181

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to integrate SpeechClient V2 for bidirectional streaming voice applications in LLMFlow #181

Uh oh!

ammmr Apr 14, 2025

Replies: 3 comments

Uh oh!

hangfei Apr 17, 2025 Maintainer

Uh oh!

ammmr Apr 21, 2025 Author

Uh oh!

chulkilee Apr 21, 2025

ammmr
Apr 14, 2025

hangfei
Apr 17, 2025
Maintainer

ammmr
Apr 21, 2025
Author

chulkilee
Apr 21, 2025