Idea: Pre-processing text with an LLM before audiobook generation

Hey there Deniz and team,

I’ve been having a great time using Abogen and wanted to share how it fits into my little audiobook production chain.

My main use case is taking webnovel EPUBs, converting them to Markdown, and then turning them into TTS-ready markdown files. Before sending them to Abogen, I run the text through a small side app I built called TTS-Proof [github.com/hebbihebb/tts-proof](https://github.com/hebbihebb/tts-proof). It uses a local LLM to clean up the text for spelling, grammar, and TTS safety, fixing things like stylized letters or awkward capitalization that can confuse voices such as Kokoro or XTTS.

After the cleanup, I pass the corrected Markdown to Abogen, which handles the audio generation beautifully. That workflow made me wonder if a similar LLM preprocessing step might make sense as an optional feature inside Abogen itself. It could automatically tidy up rough EPUBs, fanfics, or webnovel text before TTS synthesis.

This isn’t a feature request, just an idea to think about. If Abogen eventually adds something like this internally, my helper app would be redundant, and I’d honestly be happy to see that happen.

Thanks for all your work on Abogen. The Markdown to audiobook pipeline has become incredibly smooth lately.

— Hebbi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Idea: Pre-processing text with an LLM before audiobook generation #95

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Idea: Pre-processing text with an LLM before audiobook generation #95

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions