A simple unofficial web client for OpenAI Text-to-Speech API with multi-language interface support.
- Automatic model loading: fetches all available TTS models via API
- 10 voices: alloy, ash, ballad, coral, echo, fable, nova, onyx, sage, shimmer
- GPT model support: with prompts for speech control (accent, emotions, tone)
- Flexible settings: speech speed, audio format, voice instructions
- Responsive design: works on desktop and mobile devices
- Open https://cucumber148.github.io/openai-tts-client
- Enter your OpenAI API key
- Select model and voice
- Enter text and click "Generate Speech"
The application automatically loads all available TTS models from your OpenAI account.
- MP3 - Universal format (default)
- Opus - For internet streaming, low latency
- AAC - For mobile devices and YouTube
- FLAC - Lossless compression
- WAV - Uncompressed, low latency
- PCM - Raw audio data
GPT models support instructions for controlling various aspects of speech:
- Accent: "Use a British accent"
- Emotions: "Speak cheerfully and energetically"
- Tone: "Speak like a radio host"
- Pace: "Speak slowly and clearly"
- Style: "Speak in a whisper"
- Automatic language detection based on browser settings
- Dynamic model support - automatically picks up new OpenAI models
- Local storage of API key and settings in browser
- Direct requests to OpenAI API without intermediate servers
- OpenAI API key with access to TTS models
- Maximum text length: 4096 characters
- Speech speed adjustable only for legacy models
git clone https://github.com/cucumber148/openai-tts-client
cd openai-tts-client
If you encounter issues:
- Check browser console (F12) for errors
- Verify API key is correct
- Check OpenAI account limits
- Create an issue in the GitHub repository
I wrote this application in a few minutes for quick testing of the OpenAI TTS API. Decided to share it on GitHub in case it might be useful to someone.
GPL-3.0 License