Using Langchain with speech input #15930

jaymon0703 · 2024-01-12T03:43:57Z

jaymon0703
Jan 12, 2024

Hi

Given Multimodal models are becoming more and more prevalent, is there any work on the Langchain roadmap for "reading" speech input natively, or should we use speech-to-text libraries for the foreseeable future?

Example is, i want to verbally ask a question of my app which utilizes a QA Chain, or API Chain or SQL Agent, etc. Being able to speak instead of type will add to productivity, and doing so natively is possible with direct access to the Gemini API for example.

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using Langchain with speech input #15930

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Using Langchain with speech input #15930

Uh oh!

jaymon0703 Jan 12, 2024

Replies: 0 comments

jaymon0703
Jan 12, 2024