🤖 AI Conversation proof of concept

The goal is to build a fully backend-based AI conversational system, allowing any device (also with low resources) to connect via WebSocket and initiate a conversation with the AI just by sending and receiving audio chunks and benefits from the functionalities of the system e.g. conversation history, LLM power, low-latency streaming, cost-effectiveness, cost tracking by user, etc.

Features

🎙️ The voice is captured by the browser.
⚡️ Real-time streaming (responses and microphone), audio chunks are sent from the backend to the browser.
The models used are powered by Cloudflare and OpenAI.
✍️ Transcribe (ASR): Automatic speech recognition is used to transcribe the audio.
🤔 Think (NLU): The text is sent to an LLM (like Llama or GPT) for processing.
🗣️ Synthesize (TTS): The AI's text response is turned back into speech using the OpenAI TTS API.
⬅️ Return: The generated audio is streamed back to your browser for playback by sending audio chunks to the backend.

Future features

📝 Chat history: Need to store the chat history in the backend. Making RAG classification.
Need to check the models from Cloudflare to check performances. Take all the models and sent them the same sentence and check the performances. Then see models from Azure, Deepgram (Speech), ElevenLabs (expensive !!), LiveKit (ChatGPT system)
➡️ Stream: Need to send chunks of the microphone to the backend. (Actually the whole audio is sent). Need to make some VAD in backend or device (depending on hardware). So sentence by sentece will be transcribed by the IA model and then TTS will be called. Need to make some tests, to check if the AI need the whole text to have the good mood in the voice.
Interrupt the AI when he's talking to say something else, or answer before he finish
Wake up word: like "Hey Google".

Old features from cloudflare agents starter

🛠️ Built-in tool system with human-in-the-loop confirmation
📅 Advanced task scheduling (one-time, delayed, and recurring via cron)
🔄 State management and chat history

Prerequisites

Cloudflare account
OpenAI API key

Quick Start

Templated downloaded using the Cloudflare CLI:

npm create cloudflare@latest --template cloudflare/agents-starter

Install dependencies:

pnpm install

Set up your environment:

Create a .dev.vars file:

OPENAI_API_KEY=your_openai_api_key

Run locally:

pnpm start

Deploy:

pnpm run deploy

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.vscode		.vscode
docs		docs
public		public
src		src
tests		tests
.dev.vars.example		.dev.vars.example
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
biome.json		biome.json
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts
worker-configuration.d.ts		worker-configuration.d.ts
wrangler.jsonc		wrangler.jsonc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 AI Conversation proof of concept

Features

Future features

Old features from cloudflare agents starter

Prerequisites

Quick Start

Learn More

About

Uh oh!

Releases

Packages

Languages

ayoubqrt/ai-conversation

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Conversation proof of concept

Features

Future features

Old features from cloudflare agents starter

Prerequisites

Quick Start

Learn More

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages