Viseme Processing Service

A Python-based service for converting audio files into viseme timing sequences. This tool uses whisper.cpp for speech recognition and converts the output into viseme sequences suitable for facial animation.

Installation

Clone the repository:

git clone https://github.com/edmundman/Viseme_generator
cd Viseme_generator

Install the required dependencies:

pip install -r requirements.txt

The service will automatically download and compile whisper.cpp and required models on first run.

Usage

As a Command Line Tool

Process an audio file directly:

python viseme_processor.py input_audio.wav --output output.timing

Options:

--output: Specify output file path (optional)
--install-path: Custom installation path for whisper.cpp (optional)

As a Web Service

Start the FastAPI server:

python vis_server.py

The server will start on http://localhost:8000 by default.

API Endpoints

POST /process/
- Upload a WAV file for processing
- Returns JSON with viseme timing data
GET /health/
- Health check endpoint
- Returns server status

API Examples

Using curl:

curl -X POST "http://localhost:8000/process/" \
     -H "accept: application/json" \
     -H "Content-Type: multipart/form-data" \
     -F "file=@your_audio.wav"

Output Format

The service generates JSON timing data with the following structure:

[
    {
        "time": 0,
        "type": "viseme",
        "value": "sil"
    },
    {
        "time": 100,
        "type": "word",
        "value": "hello",
        "start": 100,
        "end": 500
    },
    {
        "time": 100,
        "type": "viseme",
        "value": "h"
    }
    // ... more visemes
]

Viseme Types

The system uses the following viseme mappings: https://docs.aws.amazon.com/polly/latest/dg/ph-table-english-uk.html

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
vis_server.py		vis_server.py
viseme_processor.py		viseme_processor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Viseme Processing Service

Installation

Usage

As a Command Line Tool

As a Web Service

API Endpoints

API Examples

Output Format

Viseme Types

About

Uh oh!

Releases

Packages

Languages

edmundman/Viseme_generator

Folders and files

Latest commit

History

Repository files navigation

Viseme Processing Service

Installation

Usage

As a Command Line Tool

As a Web Service

API Endpoints

API Examples

Output Format

Viseme Types

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages