Skip to content

Speech Interaction Subsystem for Pickasso Autonomous Robot Enables wake word detection, speech recognition using Azure Cognitive Services, and real-time ROS integration for robot command execution.

License

Notifications You must be signed in to change notification settings

AapseMatlb/Pickasso-Speech

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🐒 Pickasso Speech Interaction Subsystem

This subsystem enables wake word detection, speech recognition using Azure Cognitive Services, and speech synthesis for Pickasso. Commands are published directly to ROS topics for real-time robot control.

πŸ“š Features

  • βœ… Wake Word Detection: "Hey Turtle" (Offline using Vosk)
  • βœ… Speech Recognition: Azure Cognitive Speech API
  • βœ… Speech Synthesis: Azure Text-to-Speech
  • βœ… ROS Integration: Publishes commands to /speech_commands
  • βœ… Lightweight and Edge-Friendly

πŸ› οΈ Hardware Integration

Component Purpose Integration Notes
Microphone Audio Input USB Microphone / Respeaker 4-Mic USB Array
Speakers Audio Output USB/Bluetooth Speaker
Compute Processing Speech Raspberry Pi 4, Jetson Nano, or Onboard PC

πŸ“¦ Installation

pip install azure-cognitiveservices-speech pyaudio vosk rospy std_msgs

Download and extract Vosk model (small English model):

wget https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip
unzip vosk-model-small-en-us-0.15.zip
mv vosk-model-small-en-us-0.15 model

πŸš€ Usage

  1. Start ROS:
roscore
  1. Run Wake Word Detector:
python wake_word_detector.py

πŸ“‘ ROS Communication

Topic Message Type Purpose
/speech_commands std_msgs/String Publishes recognized commands

πŸ“„ License

MIT License

About

Speech Interaction Subsystem for Pickasso Autonomous Robot Enables wake word detection, speech recognition using Azure Cognitive Services, and real-time ROS integration for robot command execution.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages