AI-Powered Call Center Intelligence

NOTE: This project is no longer maintained due to OpenAI's deprecation of Whisper compatibility with GPT-3.5. However, for now, it can still be used by installing Whisper directly via pip install git+https://github.com/openai/whisper.git

AI-Powered Call Center Intelligence

A full-stack, local-first behavioral intelligence engine for telecom support calls.

Combining Whisper transcription, GPT-3.5 insights (affordable token cost), PII redaction, next best offer generation, and visual analytics — this project gives supervisors real-time understanding of what customers feel, need, and signal during calls. It doesn't stop at classification: it helps supervisors act, follow up, and retain.

Optional: Integration with a Trained Machine Learning Churn Predictor (~95% Accuracy)

This platform can be combined with the Telecom Churn Predictor to extend real-time call analysis into actionable retention strategies.

When enabled, the pipeline uses PII redaction to extract the customer’s phone number, then passes it into the model to return a churn risk classification — High, Medium, or Low — based on historical behavior patterns.

This integration is critical because only 1 in 26 customers will actually inform a company before leaving. The model surfaces silent churn signals that behavioral call analysis alone may miss, providing a more objective and data-grounded view of retention risk.

The churn model:

Predicts which customers are likely to leave using a stacked ensemble of four classifiers
Built with real telecom data, trained with stratified validation, and fully reproducible
Includes a complete pipeline: feature engineering, model stacking, and evaluation

The churn score is embedded directly into the insights output and drives personalized Next Best Offer (NBO) and follow-up script recommendations.

For context, Charter Communications — the telecom company this was built for — was previously relying on a spaCy-based model that achieved only ~40% accuracy.

By integrating both tools, telecom teams can surface cancellation intent in real time and predict long-term churn risk, enabling smarter retention offers and materially better customer outcomes.

If the churn model is not connected, the system defaults to GPT-3.5 to estimate churn risk using contextual patterns from the call — such as unresolved complaints, emotional volatility, and dissatisfaction cues. These inferences are supported by real-time sentiment signals extracted using HuggingFace transformers and a fine-tuned DistilBERT model. GPT then recommends offers and actions based on the combined emotional trajectory, conversational behavior of the customer, along with the Core Issue reported by the customer.

Use This If You Need:

Real-time speech-to-text pipelines that actually work
Churn risk, emotional escalation, and issue classification — extracted live
Upload audio or raw transcripts, get clean insights
Next Best Offer recommendations tailored to retention risk
Supervisor-ready follow-up script snippets based on issue and resolution
Resolution tactic suggestions customized by behavioral trajectory
Open architecture — no Azure, no Power BI
UI built with React + TypeScript

What It Does

Transcribes calls with OpenAI Whisper
Redacts PII with spaCy + Presidio
Analyzes behavior with GPT-3.5 customized prompts (cheaper tokens, just as effective)
Detects sentiment with HuggingFace models
Recommends personalized Next Best Offers to retain high-risk customers
Generates empathetic, ready-to-use follow-up scripts for supervisors
Visualizes post-call trends via DuckDB + Altair
Delivers results in-browser with a fast, styled React frontend

Next Best Offer and Script Snippets (v2.0)

The Next Best Offer (NBO) and script snippet additions serve two connected goals: improving customer retention and enabling supervisors to act quickly and effectively when churn risk is high.

Why This Matters

Most telecom summarization tools stop at classifying issues and identifying customer sentiment. But when a customer is flagged as a high churn risk, it's not enough to acknowledge their frustration — the system must recommend what to offer and how to communicate it.

These additions close that loop.

1. Next Best Offer (NBO)

What it does:

Recommends a concrete incentive to retain a high-risk customer.
Selects or generates a customer-facing offer using cues from the call.

Why it's important:

Telecoms routinely use NBO systems to reduce churn through tailored offers.
This model defaults to practical suggestions like a one-month service credit or a discounted data upgrade, avoiding vague or generic outputs like “None” or “Escalate to loyalty.”
It ensures that retention incentives can still be surfaced when no model output is available or integrated.

— Mehmet, at Charter Communications, suggested exploring a Next Best Offer (NBO) system — a concept that’s not a natural fit for telecom, where there are few core products (typically bundled) and limited upsell diversity. Traditional NBO logic works best in high-SKU, frequent-interaction domains like Amazon (thousands of products, high velocity) or Netflix (rapidly rotating catalogs). Telecom, by contrast, has sparse product sets but deep behavioral data. This implementation reframes NBO using behavioral clustering and usage pattern analysis, aligning offer logic with customer need-states rather than product similarity — a better fit for the domain than standard e-commerce recommenders.

2. Suggested Script Snippet

What it does:

Generates short, empathetic, ready-to-read lines for supervisors during follow-up calls.
Incorporates the customer’s issue, the resolution, and the NBO directly into the script.

Why it's important:

Supervisors must follow up with both clarity and empathy.
Script generation reduces guesswork and aligns tone across agents.
It ensures the offer recommended in the NBO is delivered in a persuasive, human-centered way.

The Resulting Flow

Call Transcript → High Churn Risk → NBO Suggested → Script Generated → Supervisor Follow-up → Customer Retained

This moves the system from passive labeling to proactive retention — combining automation with empathy.

Architecture

AI-Powered-Call-Center-Intelligence/
├── backend/                # Core logic: FastAPI, Whisper, GPT, redaction
│   ├── main.py             # FastAPI app entrypoint
│   ├── whisper_transcribe.py
│   ├── pii_redaction.py
│   ├── gpt_analysis.py
│   ├── sentiment_analysis.py
│   ├── utils.py
│   └── models/
│       ├── telecom_prompt.txt
│       ├── system_prompt.txt
│       └── pii_labels.yaml
│
├── frontend/               # React frontend (create-react-app + TypeScript)
│   ├── public/
│   └── src/
│       ├── App.tsx
│       ├── TextUpload.tsx
│       └── App.css
│
├── analytics/              # Post-call analytics and dashboards
│   ├── call_summary.db
│   ├── duckdb_loader.py
│   ├── analysis_notebook.ipynb
│   └── powerdash_components.py
│
├── data/                   # Sample data files
│   ├── transcript.json
│   ├── pii_output.json
│   ├── gpt_output.json
│   └── audio_sample.wav
│
├── config/                 # Configs and environment settings
│   └── settings.yaml
│
├── tests/                  # Unit tests for pipelines
│   ├── test_transcribe.py
│   ├── test_gpt_prompt.py
│   └── test_redaction.py
│
├── .env                    # Put your .env with your OpenAI key here
├── run_app.sh              # CLI launcher for backend
├── requirements.txt
└── README.md

✨ Features

Upload audio or text — get structured insights
Backend includes hooks for real-time call ingestion (e.g. Twilio, voice APIs)
Emotional arc detection (e.g., Calm → Angry)
Tactic recommendation engine using structured GPT prompting
PII masking that preserves useful metadata (e.g. phone/account)
Sentiment over time + issue heatmaps in notebook
Fully local: no cloud services required

Prompt Logic (telecom_prompt)

Analyze the following telecom customer service call and extract:

1. Core issue reported by the customer
2. Classification: Billing, Connectivity, Retention, Inquiry, Cancel
3. Agent resolution steps taken
4. Customer satisfaction at end
5. Was follow-up promised?
6. List any PII mentioned
7. Emotional tone progression
8. Churn risk
9. Recommended resolution tactic

Example Output

Here’s a sample of what you’ll see after analyzing a call:

⚡ Live UI (localhost)

Built with React + TypeScript, styled to be clean and user-friendly
Upload panel: audio file → transcript → insights
Text panel: paste a transcript → get analysis
Outputs display with JSON structure and preformatted blocks

📊 Post-Call Analytics

View post-call analytics via Jupyter dashboards (Altair + DuckDB)
In-memory or persistent call storage
Charts include:
- Emotional progression
- Resolution tactic frequency
- Satisfaction distribution
- Issue heatmap

No Azure or Power BI Required

This is a standalone, open-source version of Microsoft’s call intelligence accelerator:

Uses Whisper + GPT-3.5 from OpenAI
All redaction and classification handled locally
No subscriptions, no vendor lock-in

🔌 Requirements

Python 3.10.13

This project uses Python 3.10.13 for maximum compatibility with legacy OpenAI libraries (openai==0.28.0) and NLP tools like whisper, presidio, and torchaudio, which do not fully support Python 3.11+.

Python 3.10.13:

Works reliably with torch, torchaudio, keras==2.6.0, and Whisper
Avoids dependency issues with presidio, spacy, and openai==0.28.0(which works with the chatgpt 3.5 model, which has cheaper token use than more newer models)
Used as the target version throughout development

I recommend pyenv to install Python 3.10.13 locally:

pyenv install 3.10.13
pyenv local 3.10.13

Install all Python dependencies:

pip install -r requirements.txt

Or install them manually:

pip install fastapi==0.111.0
pip install uvicorn==0.19.0
pip install python-dotenv==1.0.1
pip install openai==0.28.0
pip install git+https://github.com/openai/whisper.git
pip install spacy==3.7.5
pip install presidio-analyzer
pip install presidio-anonymizer
pip install torchaudio==2.0.2
pip install transformers
pip install pandas==1.5.3
pip install duckdb
pip install altair
pip install jupyter
pip install scikit-learn
pip install xgboost
pip install imbalanced-learn
pip install requests
pip install pathlib
pip install keras==2.6.0

Then add a .env file in the project root:

OPENAI_API_KEY=sk-xxxxx

If you're using pipenv instead of pip:

pipenv install

Make sure python --version returns 3.10.13 inside your environment.

🚀 Running the App

1. Start the backend (FastAPI)

uvicorn backend.main:app --reload --port 8001

ℹ️ We use port 8001 instead of the default 8000 to avoid conflicts with React’s dev server or background processes.

2. Start the frontend (React + TypeScript)

cd frontend
npm install
npm start

Then open your browser at: http://localhost:3000

✅ Configure frontend to talk to backend

Make sure your React frontend points to http://localhost:8001.
Example in your fetch call:

const response = await fetch('http://localhost:8001/analyze-text', {
  method: 'POST',
  body: formData,
});

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Powered Call Center Intelligence

Optional: Integration with a Trained Machine Learning Churn Predictor (~95% Accuracy)

Use This If You Need:

What It Does

Next Best Offer and Script Snippets (v2.0)

Why This Matters

1. Next Best Offer (NBO)

2. Suggested Script Snippet

The Resulting Flow

Architecture

✨ Features

Prompt Logic (telecom_prompt)

Example Output

⚡ Live UI (localhost)

📊 Post-Call Analytics

No Azure or Power BI Required

🔌 Requirements

🚀 Running the App

1. Start the backend (FastAPI)

2. Start the frontend (React + TypeScript)

✅ Configure frontend to talk to backend

📝 License

About

Uh oh!

Releases 2

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
analytics		analytics
backend		backend
churn_model		churn_model
config		config
data		data
frontend		frontend
public		public
LICENSE		LICENSE
readme.md		readme.md
requirements.txt		requirements.txt
run_app.sh		run_app.sh

License

ReverendBayes/AI-Powered-Call-Center-Intelligence

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Call Center Intelligence

Optional: Integration with a Trained Machine Learning Churn Predictor (~95% Accuracy)

Use This If You Need:

What It Does

Next Best Offer and Script Snippets (v2.0)

Why This Matters

1. Next Best Offer (NBO)

2. Suggested Script Snippet

The Resulting Flow

Architecture

✨ Features

Prompt Logic (telecom_prompt)

Example Output

⚡ Live UI (localhost)

📊 Post-Call Analytics

No Azure or Power BI Required

🔌 Requirements

🚀 Running the App

1. Start the backend (FastAPI)

2. Start the frontend (React + TypeScript)

✅ Configure frontend to talk to backend

📝 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Languages

Packages