Fuzzy Frequencies

Authors Mirjam Visscher Frans Wiering , Information and Computing Sciences, Utrecht University, The Netherlands

Description

Fuzzy Frequencies is a framework to analyse the pitch content of symbolic encodings and multiple f0 estimations. Multiple f0 estimators are algorithms that extract the fundamental frequencies of multiple voices and instruments in an audio recording.

The method is introduced and described in the article:

Visscher, M., & Wiering, F. (2025). Fuzzy Frequencies: Finding Tonal Structures in Audio Recordings of Renaissance Polyphony. Heritage, 8(5), 164. https://doi.org/10.3390/heritage8050164

Folder structure

FuzzyFrequencies
├── data
│   ├── processed
│   └── raw
│       ├── CANTO-JRP
│       ├── experiment_template
│       └── Palestrina
├── results
│   ├── figures
│   └── output
└── src

Installation

List of dependencies

python 3.10
scipy
numpy
matplotlib
pandas
seaborn
scikit-learn
statsmodels
librosa
music21
pydub
essentia
scikit-posthocs

To install the package on a local computer, please follow these steps:

Clone the repository:

git clone https://github.com/MirjamVisscher/FuzzyFrequencies.git

Navigate into the project directory:
```
cd FuzzyFrequencies
```
Create the Conda environment from the provided YAML file:
```
conda env create -f fuzzy_environment.yml
```
Activate the environment:
```
conda activate fuzzy
```

Project workflow and code

The project workflow is partially covered by the code in this repository (see the purple boxes in Figure 1). The creation of the dataset was mainly handywork, whereas the pitch extraction were done by large algorithms that are not integrated in this repository.

Figure 1 Project workflow, the purple boxes are covered by the code in this repository.

Pitch extraction methods

These are the extraction types for which currently the analysis is implemented:

Symbolic encodings is music, encoded with the use of a finite alphabet to make music computer-readable. In this experiment, the MusicXML tutorial format has been used.

Multiple f0 estimations by four different algorithms:

The Basicpitch extractions are created by applying the model by Bittner et al. (2022) [1] to the set of audio recordings. The implementation can be found at https://github.com/spotify/basic-pitch
The Multipitch extractions are created by applying the model by Weiß and Müller (2024) [5] to the set of audio recordings, with model 214c and 195f. The implementation can be found at https://github.com/christofw/multipitch_architectures
The Multif0 extractions are created by applying the model by Cuesta et al. (2020) [2] to the audio files. The original implementation is based on Tensorflow 1 and can be found on https://github.com/helenacuesta/multif0-estimation-polyvocals. An implementation based on Tensorflow 2 can be found on https://github.com/MirjamVisscher/multif0-estimation-polyvocals-tf2, with many thanks to Sebastian Stober, who migrated the code from tf1 to tf2!
The symbolic encodings are downloaded from The Josquin Research Project

And two other methods for pitch extraction:

CQT is the Constant Q Transform[6], and forms the input of many multiple f0 estimators. It is the only method that does not filter our harmonics and is fast, but not very effective. It is implemented in Librosa as chroma_cqt.
HPCP is a fast method to extract harmonic pitch class profiles, developed by Gómez[4] and implemented in the Essentia toolbox as HPCP.

One method is under development:

The MT3 extractions are created by extracting the audio files smaller than ~110 MB using the Colab notebook provided by Gardner et al (2022) [3].The implementation can be found on https://github.com/magenta/mt3. For the dataset, we have used the colab notebook.

Class Overview

Experiment

__init__()
__str__()
check_zero_voices()
create_finals()
distances()
make_midi_mf0s()
piano_rolls()
plot_cycles()
repair_basicpitch_files()
repair_multif0_files()
save_profiles()
show_pcp()
show_pp()
sonify_multif0()

SymbolicFile

__init__()
__str__()
count_notes()
final_chord()
final_midi()
final_name()
partinfo()
pitch_class_profile()
pitch_profile()

BasicpitchExtraction

__init__()
__str__()
final_midi()
final_name()
lowest_final()
piano_roll()
pitch_class_profile()
pitch_profile()

Multif0Extraction

__init__()
__str__()
cadence_pattern()
concert_pitch()
final_midi()
final_name()
lowest_final()
make_midi_mf0()
piano_roll()
piano_roll_hist()
pitch_class_profile()
pitch_deviation()
pitch_profile()
sonify()

MultipitchExtraction

__init__()
__str__()
final_midi()
final_name()
lowest_final()
piano_roll()
pitch_class_profile()
pitch_profile()

MT3Extraction

__init__()
final_midi()
final_name()
lowest_final()
piano_roll()
pitch_class_profile()
pitch_profile()

Composition

__init__()
__str__()
pitch_class_profile()
pitch_profile()
show_pitch_class_profile()
show_pitch_class_profile_paper()
show_pitch_profile()

Recording

__init__()
__str__()
create_pitch_class_profiles()
pitch_profile()
show_pitch_class_profile()
show_pitch_profile()

Audio

__init__()
__str__()
chroma()
cqt_pp()
hpcp()
show_chroma()

Cycle

__init__()
__str__()
get_profile()
plot_combined_pitch_class_profiles()
plot_combined_pitch_class_profiles_all_modes()
plot_combined_pitch_class_profiles_all_modes_2col()
plot_combined_pitch_class_profiles_multiple()
plot_combined_pitch_class_profiles_single()

ToneConstants

Tones

Constants

Utility Functions Overview

utils_cluster.py

adjusted_mutual_info_score()
adjusted_rand_score()
cdist()
cluster()
create_clusters_with_evaluation()
create_clusters_with_subplots()
create_subplots()
evaluate_clusters()
get_chromatypes()
get_profiles()
n_clusters()
normalized_mutual_info_score()
perform_clustering()
plot_clusters()
reduce_dimensionality()
silhouette_score()

utils_multif0.py

apply_fade_in_out()
binary_dilation()
butter()
fill_gaps()
lfilter()
lowpass_filter()

utils_performance_effect.py

create_metrics()
plot_exploration()
plot_metrics()
regression_analysis()

utils.py

earth_movers_distance()
euclidean_distance()
get_csv_length()
get_frequency()
get_miditone()
get_pitch_class()
get_pitch_class_name()
get_pitch_name()
get_wav_length()
manhattan_distance()
phase()
repair_frequency_file()
squared_distance()
symmetric_kl_divergence()
transpose()
wasserstein_distance()

utils_paper.py

_get_final_pitch()
piano_roll_bp()
piano_roll_mp()
pitch_class_profile_fuzzypaper()
pitch_profile_fuzzypaper()

utils_stats.py

get_distances()
get_statistics()
kruskal()
kruskal_wallis_test()
mann_whitney_test()
mannwhitneyu()
perform_dunn_test()
visualise_distances()

Usage

Recreate the results in the article

Download the CANTO-JRP Dataset from Zenodo. Extract the compressed folders into your github clone of Fuzzy Frequencies into the folder FuzzyFrequencies/data/raw/CANTO-JRP/.
Activate the conda environment fuzzy.

conda activate fuzzy

Execute the main script in the command line:

python3 main.py CANTO-JRP --compute_results

Please note that this computation can be heavy on your pc. If you want to execute the steps seperately instead, please open the file main.py and execute the desired steps within main(experiment_name, new_experiment, compute_results, visualise, midi_creation) seperately.

For testing purposes, the experiment 'Palestrina' has been included. This experiment contains the 8 madrigals from the Vergine cycle by Palestrina. Due to its small size, you will get an error for the clustering algoritms, but the other results should be fine.

python3 main.py Palestrina --compute_results

You will find the results in /results/output/[experiment_name]/statistics/ and /results/output/[experiment_name]/finals/

Visualise and midification

To create visualisations (piano rolls and pitch(class) histograms).

python3 main.py [experiment_name] --visualise

You will find the results in /results/figures/[experiment_name]/piano_roll/, /results/figures/[experiment_name]/pp/ and /results/figures/[experiment_name]/pcp/.

The Multif0 output can be hard to interpret because the values are estimated frequencies in 20 cent bins. To create similar files, but with MIDI tones instead of

python3 main.py [experiment_name] --midi_creation

You will find the results in /data/processed/[experiment_name]/midi_mf0/.

Create your own experiment

To create your own experiment, take the following steps, and fill the experiment_template in the repository.

Get hold of MusicXML encodings and recordings.
Apply the multiple f0 estimation algorithm(s) of your interest to the recordings.
Fill out the experiment_metadata.csv.
Add the extractions to the corresponding folder in data/raw/[your new experiment].
Execute the experiment:

python3 main.py [experiment name] --new_experiment --compute_results --visualise --midi_creation

CANTO-JRP Dataset

Together with the code, the article releases the CANTO-JRP Dataset: a set of multiple f0 extractions of recorded compositions in the Josquin Research Project (JRP). The recordings are collected in the CANTO-JRP Spotify playlist. The order and numbering of the recordings in the playlist match the order in the data and metadata. The dataset is presented in a Zenodo repository.

Contributing to FuzzyFrequencies

Contributions are what make the open-source community an amazing place to learn, inspire, and create. Any contributions you make to FuzzyFrequencies are greatly appreciated.

To contribute to the FuzzyFrequencies project:

Fork the Project
Create your Feature Branch (git checkout -b feature/YourFeature)
Commit your Changes (git commit -m 'Add some YourFeature')
Push to the Branch (git push origin feature/YourFeature)
Open a Pull Request to the main branch of FuzzyFrequencies

License

This work is licensed under the MIT License.

Cite

Finally, if you use the code in a research project, please reference it as:

Visscher, M., & Wiering, F. (2025). Fuzzy Frequencies: Finding Tonal Structures in Audio Recordings of Renaissance Polyphony. Heritage, 8(5), 164. https://doi.org/10.3390/heritage8050164

@article{visscher2025fuzzy,
  title     = {Fuzzy Frequencies: Finding Tonal Structures in Audio Recordings of Renaissance Polyphony},
  author    = {Visscher, M. and Wiering, F.},
  journal   = {Heritage},
  volume    = {8},
  number    = {5},
  pages     = {164},
  year      = {2025},
  doi       = {10.3390/heritage8050164},
  url       = {https://doi.org/10.3390/heritage8050164}
}

Acknowledgments

The authors would like to thank Helena Cuesta and Christoph Weiß for helping us with their code; Sebastian Stober for upgrading Multif0 to Tensorflow 2; Jesse Rodin for help with the JRP; Christof van Nimwegen for statistical advice; Michel Maasdijk and Libio Gonsalvez Bras for help with the GPU cluster; and Léa Massé for advice on data management.

References

[1] Bittner, R.M.; Bosch, J.J.; Rubinstein, D.; Meseguer-Brocal, G.; Ewert, S. A lightweight instrument-agnostic model for polyphonic note transcription and multipitch estimation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Singapore, 2022.

[2] Cuesta, H.; McFee, B.; Gómez, E. Multiple f0 estimation in vocal ensembles using convolutional neural networks. In Proceedings of the International Society for Music Information Retrieval (ISMIR), Montréal, Canada, 2020.

[3] Gardner, J.P.; Simon, I.; Manilow, E.; Hawthorne, C.; Engel, J. MT3: Multi-task multitrack music transcription. In Proceedings of the International Conference on Learning Representations (ICLR), 2022.

[4] Gómez, E. Tonal description of music audio signals. PhD thesis, Universitat Pompeu Fabra, Department of Information and Communication Technologies, Barcelona, Spain, 2006.

[5] Weiß, C.; Müller, M. From music scores to audio recordings: Deep pitch-class representations for measuring tonal structures. ACM Journal on Computing and Cultural Heritage 2024.

[6] Schörkhuber, C.; Klapuri, A. Constant-Q transform toolbox for music processing. In Proceedings of the Sound and Music Computing Conference (SMC), Barcelona, Spain, 2010; pp. 3–64.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
results		results
src		src
.gitignore		.gitignore
FF_repo_card.png		FF_repo_card.png
LICENSE		LICENSE
README.md		README.md
fuzzy_environment.yml		fuzzy_environment.yml
fuzzy_methods_readme.drawio.png		fuzzy_methods_readme.drawio.png
inspect_classes.py		inspect_classes.py
repository-open-graph-template.png		repository-open-graph-template.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fuzzy Frequencies

Description

Folder structure

Installation

Project workflow and code

Pitch extraction methods

Class Overview

Utility Functions Overview

Usage

Recreate the results in the article

Visualise and midification

Create your own experiment

CANTO-JRP Dataset

Contributing to FuzzyFrequencies

License

Cite

Acknowledgments

References

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

MirjamVisscher/FuzzyFrequencies

Folders and files

Latest commit

History

Repository files navigation

Fuzzy Frequencies

Description

Folder structure

Installation

Project workflow and code

Pitch extraction methods

Class Overview

Utility Functions Overview

Usage

Recreate the results in the article

Visualise and midification

Create your own experiment

CANTO-JRP Dataset

Contributing to FuzzyFrequencies

License

Cite

Acknowledgments

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages