Skip to content

mc095/LUMA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LUMA - Voice AI Assistant

A real-time voice chatbot powered by Groq LLaMA 3.1 and Moonshine speech recognition.

LUMA Interface

Just pip install and GROQ_API_KEY, Speak with your favourite Bot on all availiable models on GroqCloud Everything Locally!

video demo

Features

Core Capabilities

  • Seamless voice-to-voice conversations
  • Real-time speech recognition
  • Natural language AI responses
  • Text-to-speech output
  • Beautiful terminal interface

Smart Features

  • Context-aware conversations
  • Automatic speech detection
  • Real-time processing
  • Command system for control
  • Session statistics

System Architecture

LUMA Flow Diagram

The diagram above illustrates LUMA's core components and their interactions:

  1. Audio Processing:

    • Raw audio input capture
    • Silero VAD for speech detection
    • Audio buffering and chunking
  2. Speech Processing Pipeline:

    • Moonshine ASR for speech-to-text
    • Groq LLaMA 3.1 for language processing
    • pyttsx3 for text-to-speech synthesis
  3. System Components:

    • Terminal UI for interaction
    • Session statistics tracking
    • Command handler system

Quick Start

python -m venv venv && venv\Scripts\activate
pip install -r requirements.txt
# Add GROQ_API_KEY to .env file
python main.py

For detailed setup instructions, configuration, and troubleshooting, see our Technical Documentation


Feel free to clone it, use it, and have fun! 🌟

Make a pull request to refactor the code, model usage, or contribute features.

About

An AI Voice Assistant For Personal Work (Llama-3.1 based)

Topics

Resources

Stars

Watchers

Forks

Languages