OpenAI TTS and Gemini (Speech Generation) availability in Langchain Python #31907

michelhabib · 2025-07-08T09:45:57Z

michelhabib
Jul 8, 2025

Checked other resources

I added a very descriptive title to this question.
I searched the LangChain documentation with the integrated search.
I used the GitHub search to find a similar question and didn't find it.

Commit to Help

I commit to help with one of those options 👆

Example Code

# its not an implemented feature, that is my question

Description

TTS is now available in major models from OpenAI and Gemini. I couldn't access them using Langchain so far.
in OpenAI, that's for example TTS-1 and gpt-4o-mini-tts - you input text with instructions, and the output is the same text in Audio Format. This is different from other models that take audio/text inputs and respond to them with audio, like gpt-4o-mini-audio-preview.
https://platform.openai.com/docs/guides/audio
In Gemini, that's speech generation, for example models like (gemini-2.5-flash-preview-tts)
https://ai.google.dev/gemini-api/docs/speech-generation

I couldn't find any documentation, or code examples that serve the above models, with the only exception of gemini speech generation availability in google/libs/vertexai/v2.0.26 (3 weeks ago) - issue 949 langchain-ai/langchain-google#949
But it's not in google/libs/genai/v2.1.6 which is the release path that i am using.

My questions:
1- Is it possible to use OpenAI/Gemini TTS with genai/v2.1.6 ? if not, is there a plan? I understand they have been released a while back, so i am just wondering.
2- Is there a langchain way to integrate the native code but still benefiting from langchain/langgraph?

System Info

System Information

OS: Windows
OS Version: 10.0.19045
Python Version: 3.12.1 (tags/v3.12.1:2305ca5, Dec 7 2023, 22:03:25) [MSC v.1937 64 bit (AMD64)]

Package Information

langchain_core: 0.3.66
langchain: 0.3.26
langsmith: 0.4.4
langchain_google_genai: 2.1.5
langchain_openai: 0.3.27
langchain_text_splitters: 0.3.8
langgraph_sdk: 0.1.72

Optional packages not installed

langserve

Other Dependencies

async-timeout<5.0.0,>=4.0.0;: Installed. No version info available.
filetype: 1.2.0
google-ai-generativelanguage: 0.6.18
httpx: 0.28.1
httpx>=0.25.2: Installed. No version info available.
jsonpatch<2.0,>=1.33: Installed. No version info available.
langchain-anthropic;: Installed. No version info available.
langchain-aws;: Installed. No version info available.
langchain-azure-ai;: Installed. No version info available.
langchain-cohere;: Installed. No version info available.
langchain-community;: Installed. No version info available.
langchain-core<1.0.0,>=0.3.51: Installed. No version info available.
langchain-core<1.0.0,>=0.3.66: Installed. No version info available.
langchain-deepseek;: Installed. No version info available.
langchain-fireworks;: Installed. No version info available.
langchain-google-genai;: Installed. No version info available.
langchain-google-vertexai;: Installed. No version info available.
langchain-groq;: Installed. No version info available.
langchain-huggingface;: Installed. No version info available.
langchain-mistralai;: Installed. No version info available.
langchain-ollama;: Installed. No version info available.
langchain-openai;: Installed. No version info available.
langchain-perplexity;: Installed. No version info available.
langchain-text-splitters<1.0.0,>=0.3.8: Installed. No version info available.
langchain-together;: Installed. No version info available.
langchain-xai;: Installed. No version info available.
langsmith-pyo3: Installed. No version info available.
langsmith>=0.1.17: Installed. No version info available.
langsmith>=0.3.45: Installed. No version info available.
openai-agents: Installed. No version info available.
openai<2.0.0,>=1.86.0: Installed. No version info available.
opentelemetry-api: Installed. No version info available.
opentelemetry-exporter-otlp-proto-http: Installed. No version info available.
opentelemetry-sdk: Installed. No version info available.
orjson: 3.10.18
orjson>=3.10.1: Installed. No version info available.
packaging<25,>=23.2: Installed. No version info available.
pydantic: 2.11.7
pydantic<3.0.0,>=2.7.4: Installed. No version info available.
pydantic>=2.7.4: Installed. No version info available.
pytest: Installed. No version info available.
PyYAML>=5.3: Installed. No version info available.
requests: 2.32.4
requests-toolbelt: 1.0.0
requests<3,>=2: Installed. No version info available.
rich: Installed. No version info available.
SQLAlchemy<3,>=1.4: Installed. No version info available.
tenacity!=8.4.0,<10.0.0,>=8.1.0: Installed. No version info available.
tiktoken<1,>=0.7: Installed. No version info available.
typing-extensions>=4.7: Installed. No version info available.
zstandard: 0.23.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OpenAI TTS and Gemini (Speech Generation) availability in Langchain Python #31907

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

OpenAI TTS and Gemini (Speech Generation) availability in Langchain Python #31907

Uh oh!

michelhabib Jul 8, 2025

Checked other resources

Commit to Help

Example Code

Description

System Info

System Information

Package Information

Optional packages not installed

Other Dependencies

Replies: 0 comments

michelhabib
Jul 8, 2025