🧠 Synthetic-Voice-Detection-Vocoder-Artifacts

📁 LibriSeVoc Dataset

We are the first to identify neural vocoders as a source of features to expose synthetic human voices.
Here are the differences shown by the six vocoders compared to the original audio:
We provide LibriSeVoC as a dataset of self-vocoding samples created with six state-of-the-art vocoders to highlight and exploit the vocoder artifacts.
The composition of the dataset is shown in the following table:

The source of our dataset ground truth comes from LibriTTS. Therefore, we follow the naming logic of LibriTTS.
For example:
27_123349_000006_000000.wav →
- 27 is the reader's ID
- 123349 is the ID of the chapter

🎯 Deepfake Detection

We propose a new approach to detecting synthetic human voices by:

Exposing signal artifacts left by neural vocoders
Modifying and improving the RawNet2 baseline by adding multi-loss

✅ This lowers the error rate from 6.10% to 4.54% on the ASVspoof Dataset.

Here is the framework of the proposed synthesized voice detection method:

📄 Paper & Dataset

📘 Paper:
AI-Synthesized Voice Detection Using Neural Vocoder Artifacts – CVPRW 2023
📦 Dataset:
Download LibriSeVoc

🛠️ Usage

🏋️‍♀️ To train the model, run:

python main.py --data_path /your/path/to/LibriSeVoc/ --model_save_path /your/path/to/models/

🧪 To test with your sample, run:

python eval.py --input_path /your/path/to/sample.wav --model_path /your/path/to/your_model.pth

📥 Pretrained Model Weights

Download the trained model weights from the link below:

https://drive.google.com/file/d/15qOi26czvZddIbKP_SOR8SLQFZK8cf8E/view?usp=sharing

🌐 In-the-Wild Testing

You can test audio samples live on our lab's Deepfake O Meter platform:

https://zinc.cse.buffalo.edu/ubmdfl/deep-o-meter/landing_page

📄 License

This repository is licensed under the MIT License.
You are free to use, modify, and distribute the code with proper attribution.

🔗 MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
__pycache__		__pycache__
core_scripts		core_scripts
LICENSE		LICENSE
README copy.md		README copy.md
README.md		README.md
dev.txt		dev.txt
eval.py		eval.py
main.py		main.py
model.py		model.py
model_config_RawNet.yaml		model_config_RawNet.yaml
requirements.txt		requirements.txt
test.txt		test.txt
train.txt		train.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Synthetic-Voice-Detection-Vocoder-Artifacts

📁 LibriSeVoc Dataset

🎯 Deepfake Detection

📄 Paper & Dataset

🛠️ Usage

🏋️‍♀️ To train the model, run:

🧪 To test with your sample, run:

📥 Pretrained Model Weights

🌐 In-the-Wild Testing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

csun22/Synthetic-Voice-Detection-Vocoder-Artifacts

Folders and files

Latest commit

History

Repository files navigation

🧠 Synthetic-Voice-Detection-Vocoder-Artifacts

📁 LibriSeVoc Dataset

🎯 Deepfake Detection

📄 Paper & Dataset

🛠️ Usage

🏋️‍♀️ To train the model, run:

🧪 To test with your sample, run:

📥 Pretrained Model Weights

🌐 In-the-Wild Testing

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages