Skip to content

Krish0909/Audio-Synthesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Accent Conversion Using Deep Learning

This repository contains the implementation of an Accent Conversion system, which transforms speech from one accent to another while preserving the speaker's identity. The project utilizes advanced deep learning techniques to achieve high-quality and natural-sounding accent conversion.

Table of Contents

Overview

Accent conversion is a challenging task in the field of speech synthesis. This project aims to:

  • Convert speech from one accent to another while retaining the speaker's unique voice characteristics.
  • Provide tools and models for high-quality accent conversion.

Features

  • Speaker preservation: Maintains the speaker's original voice.
  • Multiple accents: Supports conversion between multiple accents.
  • Modular design: Easy to extend and customize for new datasets or architectures.

Installation

  1. Clone the repository:

    git clone https://github.com/yourusername/accent-conversion.git
    cd accent-conversion
  2. Install dependencies:

    pip install -r requirements.txt
  3. (Optional) Set up GPU acceleration for faster training and inference.

Usage

  1. Preprocess your dataset:

    python preprocess.py --data_dir <path_to_dataset>
  2. Train the model:

    python train.py --config config.yaml
  3. Perform inference:

    python infer.py --input <input_audio_path> --output <output_audio_path>
  4. Evaluate the model's performance:

    python evaluate.py --results_dir <path_to_results>

Dataset

Model Architecture

The core model is based on a neural network architecture with the following components:

  • Encoder-Decoder: Captures the content of speech while normalizing accent.
  • Style Transfer Module: Transfers accent features from target to source.
  • Adversarial Loss: Ensures the converted speech matches the target accent.

Contributing

Contributions are welcome! To contribute:

  1. Fork the repository.
  2. Create a feature branch.
  3. Submit a pull request.

Please read the Contributing Guidelines for more details.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

Accent Conversion using Transfer Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published