Skip to content

An Arduino version using Secure Websockets for ESP32-S3 #31

@akdeb

Description

@akdeb

We are excited to announce an Arduino solution using secure websockets ready to launch your OpenAI Realtime API project. Check it out here: https://github.com/akdeb/ElatoAI

🍔 Our Stack

  1. OpenAI Realtime
  2. Vercel NextJS
  3. ESP32-S3 on Arduino (PlatformIO)
  4. Supabase (for a DB / Edge functions)
  5. Deno Edge functions

🌟 Our Features

  1. Realtime Speech-to-Speech: Instant speech conversion powered by OpenAI's Realtime APIs.
  2. Create Custom AI Agents: Create custom agents with different personalities and voices.
  3. Customizable Voices: Choose from a variety of voices and personalities.
  4. Secure WebSockets: Reliable, encrypted WebSocket communication.
  5. Server VAD Turn Detection: Intelligent conversation flow handling for smooth interactions.
  6. Opus Audio Compression: High-quality audio streaming with minimal bandwidth.
  7. Global Edge Performance: Low latency Deno Edge Functions ensuring seamless global conversations.
  8. ESP32 Arduino Framework: Optimized and easy-to-use hardware integration.
  9. Conversation History: View your conversation history.
  10. Device Management: Register and manage your devices.
  11. User Authentication: Secure user authentication and authorization.
  12. Conversations with WebRTC and Websockets: Talk to your AI with WebRTC on the NextJS webapp and with websockets on the ESP32.
  13. Volume Control: Control the volume of the ESP32 speaker from the NextJS webapp.
  14. Realtime Transcripts: The realtime transcripts of your conversations are stored in the Supabase DB.
  15. OTA Updates: Over the Air Updates for the ESP32 firmware.
  16. Wifi Management: Connect to your Wifi network from the ESP32 device.
  17. Factory Reset: Factory reset the ESP32 device from the NextJS webapp.
  18. Button and Touch Support: Use the button OR touch sensor to control the ESP32 device.
  19. No PSRAM Required: The ESP32 device does not require PSRAM to run the speech to speech AI.

🙌 We'd Love to Hear From You!

We invite you to:

  1. Try out this updated version.
  2. Share your thoughts and ideas to help us make it even better.
  3. Drop your feedback right here or contribute directly via the main repository.

Your input means a lot to us — we're excited to learn from your experience!

📹 Demo Video

Check out this quick demo where we show how chat and voice-triggered commands work:
Watch the Demo

Thanks for being part of the journey! 💡

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions