Skip to content

Idea: Pre-processing text with an LLM before audiobook generation #95

@hebbihebb

Description

@hebbihebb

Hey there Deniz and team,

I’ve been having a great time using Abogen and wanted to share how it fits into my little audiobook production chain.

My main use case is taking webnovel EPUBs, converting them to Markdown, and then turning them into TTS-ready markdown files. Before sending them to Abogen, I run the text through a small side app I built called TTS-Proof github.com/hebbihebb/tts-proof. It uses a local LLM to clean up the text for spelling, grammar, and TTS safety, fixing things like stylized letters or awkward capitalization that can confuse voices such as Kokoro or XTTS.

After the cleanup, I pass the corrected Markdown to Abogen, which handles the audio generation beautifully. That workflow made me wonder if a similar LLM preprocessing step might make sense as an optional feature inside Abogen itself. It could automatically tidy up rough EPUBs, fanfics, or webnovel text before TTS synthesis.

This isn’t a feature request, just an idea to think about. If Abogen eventually adds something like this internally, my helper app would be redundant, and I’d honestly be happy to see that happen.

Thanks for all your work on Abogen. The Markdown to audiobook pipeline has become incredibly smooth lately.

— Hebbi

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions