MLSA Neural Vocoder

この記事のMLSAニューラルボコーダーの学習コードです。

使い方

Python 3.10以上が必要です。あらかじめ環境にあったPyTorch 2を導入してください。

pip install -r requirements.txt

設定ファイル

configディレクトリにサンプルの設定ファイルがあります。適宜data_pathやpreprocessed_path、log_dirなどのパラメータを変更することで前処理・学習に使用できます。

data_path: 学習に使用したいwavファイルが入ったディレクトリを指定してください。
preprocessed_path: 前処理データを格納するディレクトリを指定してください。
log_dir: 学習ログ(tensorboardのデータ)とチェックポイントを保存するディレクトリを指定してください。

前処理

python preprocessor.py <config file>

長い音声(歌声データなど)を使用する場合は-sもしくは--splitオプションを使ってください。

python preprocessor.py <config file> -s

学習

python train.py <config file>

サンプル音声がTensorboard上に出力されます。

ライセンス

MIT ライセンスです。

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dataset.py		dataset.py
preprocessor.py		preprocessor.py
requirements.txt		requirements.txt
tensorboard.sh		tensorboard.sh
train.py		train.py
yaml_to_dataclass.py		yaml_to_dataclass.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MLSA Neural Vocoder

使い方

設定ファイル

前処理

学習

ライセンス

About

Uh oh!

Contributors 2

Uh oh!

Languages

License

DwangoMediaVillage/mlsa_neural_vocoder

Folders and files

Latest commit

History

Repository files navigation

MLSA Neural Vocoder

使い方

設定ファイル

前処理

学習

ライセンス

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages