This subsystem enables wake word detection, speech recognition using Azure Cognitive Services, and speech synthesis for Pickasso. Commands are published directly to ROS topics for real-time robot control.
- β
Wake Word Detection:
"Hey Turtle"
(Offline using Vosk) - β Speech Recognition: Azure Cognitive Speech API
- β Speech Synthesis: Azure Text-to-Speech
- β
ROS Integration: Publishes commands to
/speech_commands
- β Lightweight and Edge-Friendly
Component | Purpose | Integration Notes |
---|---|---|
Microphone | Audio Input | USB Microphone / Respeaker 4-Mic USB Array |
Speakers | Audio Output | USB/Bluetooth Speaker |
Compute | Processing Speech | Raspberry Pi 4, Jetson Nano, or Onboard PC |
pip install azure-cognitiveservices-speech pyaudio vosk rospy std_msgs
Download and extract Vosk model (small English model):
wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
unzip vosk-model-small-en-us-0.15.zip
mv vosk-model-small-en-us-0.15 model
- Start ROS:
roscore
- Run Wake Word Detector:
python wake_word_detector.py
Topic | Message Type | Purpose |
---|---|---|
/speech_commands | std_msgs/String | Publishes recognized commands |
MIT License