Skip to content

v1.1.0: Intelligent Text Chunking

Latest
Compare
Choose a tag to compare
@TSavo TSavo released this 31 May 08:04

🎯 Major Feature Release: Intelligent Text Chunking

🚀 New Features

📏 Smart Text Chunking System

  • 3-tier chunking strategy for optimal quality:
    • Under 25s: No chunking (optimal quality)
    • 25-40s: Gentle chunking at natural boundaries
    • Over 40s: Aggressive but intelligent chunking
  • Respects natural language boundaries: paragraphs, sentences, clauses
  • Seamless audio concatenation using ffmpeg
  • Voice cloning consistency across chunks
  • Production-ready error handling and cleanup

🎵 What This Enables

Before v1.1.0:

  • ❌ Long texts failed or produced poor quality
  • ❌ Users had to manually split text
  • ❌ Inconsistent results with long content

After v1.1.0:

  • Any length text works automatically
  • Professional quality maintained
  • Seamless user experience
  • Production-grade reliability

🔧 Technical Implementation

  • Duration estimation: ~12 characters per second
  • Intelligent break points: Paragraphs → Sentences → Clauses → Words
  • Audio concatenation: Professional ffmpeg integration
  • Memory efficient: Sequential chunk processing
  • Configurable thresholds: Easy to adjust for different use cases

🐳 Docker Images

# Latest with chunking
docker pull tsavo/chatterbox-tts-api:latest
docker pull tsavo/chatterbox-tts-api:v1.1.0
docker pull tsavo/chatterbox-tts-api:v1.1
docker pull tsavo/chatterbox-tts-api:v1

📖 Usage Examples

Long Text Processing (Now Supported!)

import requests

# This now works seamlessly with any length text!
long_text = """Your very long text here...
Multiple paragraphs...
Complex sentences...
All handled automatically!"""

response = requests.post("http://localhost:8000/tts", json={
    "text": long_text,
    "output_format": "mp3"
})

# Get your MP3 file - chunking handled transparently!
with open("output.mp3", "wb") as f:
    f.write(response.content)

🎯 Breaking Changes

  • None! Fully backward compatible
  • Existing API calls work exactly the same
  • Chunking is transparent to users

🔄 Migration

  • No migration needed
  • Simply update to v1.1.0
  • Long texts that previously failed now work automatically

This release transforms Chatterbox TTS from a demo tool into a production-ready service capable of handling real-world text processing needs! 🚀