Conversify 🗣️ ✨

Conversify is a real‑time, low‑latency, voice- and vision-enabled AI assistant built on LiveKit. This project demonstrates highly responsive conversational AI workflows, leveraging locally hosted models.

Demo Video

✨ Key Features

⚡ Low Latency: End-to-end response time under 600 ms.
🗣️ Real‑time Voice: Natural conversation using local STT and TTS services.
🧠 Local LLM Integration: Compatible with any OpenAI‑style API (e.g., SGLang, vLLM, Ollama).
👀 Basic Vision: Processes video frames with multimodal LLM prompts.
💾 Conversational Memory: Persists context across user sessions.
🔧 Configurable: All settings managed via config/config.yaml.

⚙️ Prerequisites

OS: Linux or WSL on Windows (tested)
Python: 3.11+
Services:
- LiveKit Server Cloud (sign up at https://cloud.livekit.io)
- An LLM inference server with OpenAI-compatible API (e.g., SGLang, vLLM, Ollama)
- Kokoro FastAPI TTS server (https://github.com/remsky/Kokoro-FastAPI)

🛠️ Installation

Clone the repository

git clone https://github.com/taresh18/conversify.git
cd conversify

Create a virtual environment (recommended)

python -m venv venv
source venv/bin/activate    # Linux/macOS
# venv\Scripts\activate   # Windows

Install dependencies

pip install -r requirements.txt
python -m conversify.main download-files

Configure environment variables

cp .env.example .env.local
nano .env.local  # Add your LiveKit and other credentials

Update config/config.yaml
- Set LLM API endpoint and model names
- Configure STT/TTS server URLs and parameters
- Adjust vision and memory settings as needed

🏃 Running the Application

Ensure all external services are running before starting Conversify.

Start the LLM server (example using provided script)

chmod +x ./scripts/run_llm.sh
./scripts/run_llm.sh &

Start the Kokoro TTS server

chmod +x ./scripts/run_kokoro.sh
./scripts/run_kokoro.sh &

Launch Conversify

chmod +x ./scripts/run_app.sh
./scripts/run_app.sh

Interact via LiveKit Agents Playground
- Navigate to https://agents-playground.livekit.io
- Select your LiveKit project and room
- Join and begin conversation

⚙️ Configuration

All runtime settings are in config/config.yaml. Key options include:

STT: model selection and parameters
LLM: endpoint URLs and model names
TTS: voice options and server settings
Vision: enable/disable frame analysis and thresholds
Memory: persistence and retrieval parameters
Logging: level and file path (app.log)

Secrets and credentials reside in .env.local, following the template in .env.example.

🏗️ Project Structure

conversify/
├── config/
│   └── config.yaml         # All application settings
├── conversify/
│   ├── core/               # Orchestration and agent logic
│   ├── stt/                # Speech-to-text client
│   ├── tts/                # Text-to-speech client
│   ├── llm/                # LLM integration client
│   ├── livekit/            # LiveKit session & media management
│   └── utils/              # Logger and shared utilities
├── prompts/
│   └── llm.txt             # System prompt for LLM
├── scripts/
│   ├── run_llm.sh
│   ├── run_kokoro.sh
│   └── run_app.sh
├── .env.example            # Template for environment variables
├── .env.local              # Local secrets (ignored)
├── requirements.txt
├── .gitignore
└── README.md

📚 References

LiveKit Agents: https://github.com/livekit/agents
Faster Whisper: https://github.com/SYSTRAN/faster-whisper
Kokoro FastAPI: https://github.com/remsky/Kokoro-FastAPI
Memoripy: https://github.com/caspianmoon/memoripy

📜 License

This project is released under the Apache License 2.0. See the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Conversify 🗣️ ✨

Demo Video

✨ Key Features

⚙️ Prerequisites

🛠️ Installation

🏃 Running the Application

⚙️ Configuration

🏗️ Project Structure

📚 References

📜 License

About

Uh oh!

Uh oh!

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
conversify		conversify
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt

License

taresh18/conversify

Folders and files

Latest commit

History

Repository files navigation

Conversify 🗣️ ✨

Demo Video

✨ Key Features

⚙️ Prerequisites

🛠️ Installation

🏃 Running the Application

⚙️ Configuration

🏗️ Project Structure

📚 References

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Languages