-
Notifications
You must be signed in to change notification settings - Fork 191
Open
Description
Our use case is robot with voice replies and relevant actions. Voice replies are sent via audio channel and relevant robot actions are sent via data channel with "response.create" and "response.done". And "response.create" sends data from "response.audio_transcript.done" server event.
Actual results:
- "response.audio_transcript.done" server event arrives 5-35 seconds after relevant audio data finished transmitting (that is, after AI agent stops speaking reply).
- "response.done" arrives 5-10 seconds after "response.create" is sent.
So, at the end, robots actions are executed after some delay after robot stops speaking, which looks unnatural and gives an impression that robot is stuck.
Expected results:
- "response.audio_transcript.done" server event arrives at the same time or before audio data transmission is finished (robot stops talking)
- "response.done" server event arrives somewhat faster? 1-2 seconds if possible.
nischalj10 and thebhulawat
Metadata
Metadata
Assignees
Labels
No labels