This dataset contains a total of 411 pieces from eight artists’ performances of piano music downloaded from YouTube for training and evaluating symbolic-domain style classification.
The artists included are Richard Clayderman (pop), Yiruma (pop), Herbie Hancock (jazz), Ludovico Einaudi (contemporary), Hisaishi Joe (contemporary), Ryuichi Sakamoto (contemporary), Bethel Music (religious) and Hillsong Worship (religious).
The paired MIDI files are machine-transcribed by the piano transcription model proposed by Kong et al. (2021).
Please refer to MIDI-BERT for the work.
Additionally, the dataset can also be downloaded on Zenodo.
If you plan to use this dataset, please cite our paper.
@article{midibertpiano,
title={{MidiBERT-Piano}: Large-scale Pre-training for Symbolic Music Understanding},
author={Yi-Hui Chou and I-Chun Chen and Chin-Jui Chang and Joann Ching, and Yi-Hsuan Yang},
journal={arXiv preprint arXiv:2107.05223},
year={2021}
}