Skip to content

Commit aeb52ed

Browse files
authored
feat(VAP3-957): Adding Inworld docs page and removing Sesame voice page
1 parent 84d031b commit aeb52ed

File tree

4 files changed

+63
-45
lines changed

4 files changed

+63
-45
lines changed

fern/apis/api/openapi-overrides.yml

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1083,8 +1083,6 @@ components:
10831083
title: TavusVoice
10841084
VapiVoice:
10851085
title: VapiVoice
1086-
SesameVoice:
1087-
title: SesameVoice
10881086
AIEdgeCondition:
10891087
title: AIEdgeCondition
10901088
LogicEdgeCondition:

fern/docs.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -470,8 +470,8 @@ navigation:
470470
path: providers/voice/rimeai.mdx
471471
- page: Deepgram
472472
path: providers/voice/deepgram.mdx
473-
- page: Sesame
474-
path: providers/voice/sesame.mdx
473+
- page: Inworld
474+
path: providers/voice/inworld.mdx
475475
- section: Video models
476476
contents:
477477
- page: Tavus

fern/providers/voice/inworld.mdx

Lines changed: 61 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,61 @@
1+
---
2+
title: InworldAI
3+
subtitle: What is Inworld.ai?
4+
slug: providers/voice/inworld
5+
---
6+
7+
**What is Inworld.ai?**
8+
9+
Inworld.ai provides developers with tools to create lifelike voice agents. It supports zero-shot voice cloning, enabling the creation of personalized voices from short audio samples. The system is optimized for low-latency streaming, making it suitable for applications requiring immediate audio responses.
10+
11+
**The Evolution of AI Speech Synthesis:**
12+
13+
Advancements in deep learning and neural networks have significantly improved the quality of AI-generated speech. Inworld.ai leverages these developments to deliver natural-sounding, emotionally expressive voices suitable for various applications, including virtual assistants and interactive games.
14+
15+
**Overview of Inworld.ai's Offerings:**
16+
17+
Inworld.ai provides a comprehensive suite of features designed to meet diverse voice synthesis needs:
18+
19+
**Real-Time Speech Synthesis:**
20+
21+
Inworld.ai is engineered for low-latency performance, delivering the first two seconds of audio in approximately 200 milliseconds. This responsiveness is critical for real-time applications such as conversational agents and interactive gaming characters.
22+
23+
**Zero-Shot Voice Cloning:**
24+
25+
The platform offers zero-shot voice cloning, allowing developers to create custom voices from as little as 5 seconds of audio input. This feature facilitates the development of unique voice identities for various applications.
26+
27+
**Multilingual Support:**
28+
29+
Inworld.ai supports 11 languages, including English, Spanish, French, Korean, and Chinese. This multilingual capability enables developers to build applications for diverse global audiences.
30+
31+
**Audio Markup Controls:**
32+
33+
Developers can use audio markup tags such as [happy], [whispering], or [sigh] to control the emotional tone and style of the synthesized speech. This feature enhances the expressiveness of voice agents.
34+
35+
**Developer API:**
36+
37+
Inworld.ai provides an API with comprehensive documentation, facilitating integration into various applications. The API supports real-time streaming and offers options for customizing voice parameters to suit specific use cases.
38+
39+
**Use Cases for Inworld.ai:**
40+
41+
Inworld.ai's versatile platform supports a wide range of applications:
42+
43+
**Interactive Applications:**
44+
45+
Developers can create responsive voice agents for customer service, virtual assistants, and interactive gaming characters, enhancing user engagement through natural-sounding speech.
46+
47+
**Content Creation:**
48+
49+
Content creators can utilize Inworld.ai to generate high-quality voiceovers for videos, podcasts, and other media, streamlining the production process.
50+
51+
**Education and Training:**
52+
53+
Educational platforms can employ Inworld.ai to provide clear and expressive narration for e-learning materials, improving the learning experience for users.
54+
55+
**Integration with Vapi:**
56+
57+
Inworld.ai is integrated with Vapi, allowing developers to access its features through the Vapi platform. This integration simplifies the process of building and deploying voice agents, offering tools for testing and optimizing performance before production.
58+
59+
**Conclusion:**
60+
61+
Inworld.ai offers a combination of expressive voice synthesis, low-latency performance, and multilingual support, making it a valuable tool for developers seeking to enhance their applications with natural-sounding speech.

fern/providers/voice/sesame.mdx

Lines changed: 0 additions & 41 deletions
This file was deleted.

0 commit comments

Comments
 (0)