Learning Relationships between Separate Audio Tracks for Creative Applications

This repository contains the official codebase accompanying the AIMC2025 paper:

"Learning Relationships between Separate Audio Tracks for Creative Applications"
Bujard et al., 2025

Audio examples can be found at https://ircam-ismm.github.io/MoisesDB-audio-examples/

Overview

This project explores learning-based approaches to model relationships between separate audio tracks, enabling creative applications such as symbolic generation and guided audio synthesis.

The repository includes all the code necessary to reproduce the results presented in the paper, except for:

The pretrained Wav2Vec 2.0 model trained on music, introduced in Ragano et al., 2023.
→ Please contact the authors of that paper directly to obtain access to the model weights.
The MICA dataset, which is proprietary and not publicly available.
→ As a result, only experiments using MoisesDB can be reproduced with the current repository.

Tutorials

To facilitate usage, three tutorial scripts are provided:

train_model.py Train the Decision module on a pair of audio tracks.
use_decision.py Generate a symbolic specification (e.g., structure or timing) from an audio input using a trained Decision module.
generate_audio.py Given a guide track, a memory track, and a trained model, this script generates a response audio track conditioned on the guide.

Citation

If you use this code in your work, please cite:

@inproceedings{bujard2025relationships,
  title={Learning Relationships between Separate Audio Tracks for Creative Applications},
  author={Bujard Balthazar, Nika Jérôme, Obin Nicolas, Bevilacqua Frédéric},
  booktitle={Proceedings of the 6th Conference on AI Music Creativity (AIMC 2025)},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
MusicDataset		MusicDataset
architecture		architecture
clustering/old		clustering/old
utils		utils
wav2vec2		wav2vec2
.gitignore		.gitignore
README.md		README.md
create_eval_folders.py		create_eval_folders.py
distributed_training.py		distributed_training.py
fine_tuning.py		fine_tuning.py
generate.py		generate.py
generate_audio.py		generate_audio.py
k-means_clustering.py		k-means_clustering.py
launch_ddp.py		launch_ddp.py
my_evaluation.py		my_evaluation.py
my_training.py		my_training.py
print_model_params.py		print_model_params.py
re_concatenate.py		re_concatenate.py
rescale_audio_files.py		rescale_audio_files.py
test_couplingdataset.py		test_couplingdataset.py
top_k_validity.py		top_k_validity.py
train_model.py		train_model.py
trainer.py		trainer.py
use_decision.py		use_decision.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning Relationships between Separate Audio Tracks for Creative Applications

Overview

Tutorials

Citation

About

Uh oh!

Releases

Packages

Languages

ircam-ismm/learning-from-paired-tracks

Folders and files

Latest commit

History

Repository files navigation

Learning Relationships between Separate Audio Tracks for Creative Applications

Overview

Tutorials

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages