Asterisk to OpenAI Real-Time Integration

README.md

Asterisk to OpenAI Real-Time Integration

This project connects Asterisk with OpenAI's real-time API to enable real-time voice interactions. It processes incoming audio from Asterisk SIP calls, sends it to OpenAI for processing, and streams the audio responses back to the caller seamlessly. (Please use headphones for testing, using speakers will constantly interrupt communication.)

Features

Asterisk Integration:
- Connects to Asterisk via the ARI (Asterisk REST Interface) at http://127.0.0.1:8088 using credentials asterisk:asterisk.
- Listens for SIP channels entering the Stasis application (stasis_app).
- Creates a mixing bridge for each call, answers the channel, and sets up an ExternalMedia channel.
RTP Audio Handling:
- Listens for μ-law audio from Asterisk on RTP port 12000.
- Receives RTP packets, strips headers, converts μ-law to 24kHz PCM with interpolation, normalizes audio (target RMS 0.15), and buffers it.
OpenAI Real-Time API Integration:
- Establishes a WebSocket connection to OpenAI’s real-time API (wss://api.openai.com/v1/realtime?model=gpt-4o-realtime-preview-2024-12-17).
- Sends normalized PCM audio chunks to OpenAI as base64-encoded data every 200ms.
- Receives PCM audio responses and transcripts from OpenAI, converting PCM back to μ-law for Asterisk.
Audio Streaming:
- Streams OpenAI’s audio responses to Asterisk via RTP with 10ms packet timing (80 samples at 8kHz).
- Manages buffers to avoid overflow (1MB max) and ensures real-time playback with silence padding.
Voice Activity Detection (VAD):
- Configures OpenAI’s server-side VAD with customizable threshold (default 0.1), prefix padding (default 300ms), and silence duration (default 500ms).
- Stops RTP streaming to Asterisk when speech is detected to avoid overlap.
Logging:
- Uses Winston for detailed logging with timestamps, colored output (cyan for client events, yellow for server events, gray for general logs).
- Logs RTP packet stats, audio processing details (RMS, gain), and OpenAI interactions.
File Saving (Optional):
- Saves Asterisk input audio as .raw (μ-law) and OpenAI-processed audio as .wav (24kHz PCM) if ENABLE_SENT_TO_OPENAI_RECORDING is true.
- Files are saved on call end (StasisEnd).
Cleanup:
- Handles call termination (StasisEnd), closing WebSockets, stopping RTP streams, clearing intervals, and destroying bridges.
- Cleans up resources on uncaught exceptions or SIGINT (Ctrl+C).
Configuration:
- Loads settings from .env (e.g., OPENAI_API_KEY, MAX_CALL_DURATION, VAD settings, logging options).
- Provides defaults for unspecified values.

Prerequisites

Node.js: Version 16 or higher.
Asterisk 20: Installed with ARI enabled (default: http://127.0.0.1:8088, user: asterisk, password: asterisk).
OpenAI API Key: Obtain from OpenAI's platform.
SIP Client: A SIP client (e.g., softphone like Linphone) to make calls to Asterisk.

Installation

Clone the Repository:

git clone https://github.com/infinitocloud/asterisk_to_openai_rt.git
cd asterisk_to_openai_rt# asterisk_to_openai_rt

**Rename .env.sample to .env and add your OpenAI key.

Asterisk to OpenAI RealTime

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
test-tools		test-tools
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
asterisk_to_openai_rt.js		asterisk_to_openai_rt.js
extensions.conf.sample		extensions.conf.sample
package-lock.json		package-lock.json
package.json		package.json
pjsip.conf.sample		pjsip.conf.sample

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README.md

Asterisk to OpenAI Real-Time Integration

Features

Prerequisites

Installation

About

Uh oh!

Releases

Packages

Languages

jersonjunior/asterisk_to_openai_rt

Folders and files

Latest commit

History

Repository files navigation

README.md

Asterisk to OpenAI Real-Time Integration

Features

Prerequisites

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages