A real-time voice chatbot powered by Groq LLaMA 3.1 and Moonshine speech recognition.
Just pip install
and GROQ_API_KEY
, Speak with your favourite Bot on all availiable models on GroqCloud Everything Locally
!
- Seamless voice-to-voice conversations
- Real-time speech recognition
- Natural language AI responses
- Text-to-speech output
- Beautiful terminal interface
- Context-aware conversations
- Automatic speech detection
- Real-time processing
- Command system for control
- Session statistics
The diagram above illustrates LUMA's core components and their interactions:
-
Audio Processing:
- Raw audio input capture
- Silero VAD for speech detection
- Audio buffering and chunking
-
Speech Processing Pipeline:
- Moonshine ASR for speech-to-text
- Groq LLaMA 3.1 for language processing
- pyttsx3 for text-to-speech synthesis
-
System Components:
- Terminal UI for interaction
- Session statistics tracking
- Command handler system
python -m venv venv && venv\Scripts\activate
pip install -r requirements.txt
# Add GROQ_API_KEY to .env file
python main.py
For detailed setup instructions, configuration, and troubleshooting, see our Technical Documentation
Feel free to clone it, use it, and have fun! 🌟
Make a pull request to refactor the code, model usage, or contribute features.