Skip to content

BurkimbIA/toolskit-tts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Table of Contents

  1. Introduction
  2. Our Datasets
  3. Working with Audio Datasets
  4. Transformers for Audio
  5. Saving Steps to Drive Path
  6. References & Further Reading

Introduction

Toolskit-TTS is a comprehensive toolkit designed to facilitate research and development in Text-to-Speech (TTS) systems. It provides resources, scripts, and best practices for working with TTS datasets, training models, and evaluating results. The toolkit supports multilingual and multi-speaker scenarios, making it suitable for a wide range of TTS research projects.

Our Datasets

Our current dataset consists of approximately 50 hours of audio sourced from Bible JW and Mooreburkina. These audio files have undergone preprocessing steps such as denoising and enhancement. For more details, visit:

s3://burkimbia/audios/final_dataset

Working with Audio Datasets

Working with audio datasets for TTS involves several challenges, including alignment, diversity, and quality. For a detailed guide on audio preprocessing and dataset creation, refer to this blog:

Audio Dataset Creation Guide

Transformers for Audio

Transformers have revolutionized the field of audio processing, enabling advanced capabilities in TTS. Key resources for understanding and implementing audio transformers include:

These resources provide a comprehensive overview of transformer architectures and their applications in audio tasks.

Saving Steps to Drive Path

To save outputs or intermediate steps to a specific drive path, follow these instructions:

  1. Specify the Drive Path

    • Ensure the drive path is accessible and has sufficient storage.
    • Example: D:\TTS_Outputs\
  2. Modify Scripts

    • Update the output directory in relevant scripts or configuration files.
    • Example in Python:
      output_path = "D:\TTS_Outputs\"
      save_to_path(output_path, data)
  3. Automate Saving

    • Use logging or checkpointing mechanisms to save intermediate results.
    • Example:
      def save_checkpoint(model, path):
          torch.save(model.state_dict(), path)
      save_checkpoint(model, "D:\TTS_Outputs\checkpoint.pt")
  4. Verify Saved Files

    • Check the drive path to ensure files are saved correctly.
    • Use tools like os.listdir() to list saved files.

References & Further Reading

Explore the following references for additional insights and tools:

About

Inference and training library for high-quality TTS models.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 15