GitHub - lazell/jazz_music: Swing music genre classification project: classifies one of 5 swing-dance styles for a given jazz-era song.

Lindy List - Swing Dance Music Classifier

LindyList is a music classifier that analyzes a jazz-era song to determine the most suitable style(s) of swing dance from the five predominant styles:

Lindy Hop
Slow Swing/Drag Blues
Balboa
Charleston
St Louis and Collegiate Shag

Introduction

Auto-tagging music genre has been an active area of deep learning research in recent years. The LindyList, uses many of those same approaches, but instead classifies by the most suitable style of swing dance from the five most common styles danced today. The prediction(s) can be used as a feature to improve song discovery and quality of playlist and song recommendations.

The LindyList is for jazz-era music buffs, jazz musicians, DJs, performers, social swing dancers and film/TV sound design. Dance-style knowledge would typically be acquired from years of exposure to a variety of jazz music and dance styles. With this project, I intend to make swing music discovery more accessible and engaging from the dancer’s perspective.

The Dataset

The data set consists of 3000 unique songs, recorded between 1926-1959. Each was randomly selected and downloaded from Jazz On Line, a public domain jazz music website. Since labels for dance styles did not exist for this dataset, I manually labelled songs by dance style and annotating each song with notes on whether a song was swing danceable, whether it was particularly infectious, and whether the quality of the recording was good. Since I labelled the dataset myself, the dataset is undoubtedly biased toward my own in swing dancing interpretations. Out of the 3000 songs, 1376 were “swing-danceable,” and were used for training and validation testing.

The Approach

This project has two components: the Ensemble model and the Neural Net model. To build a baseline to model, I first took the Ensemble model approach. I extracted from each song harmonic and; percussive tempo, total beat count per song, 12 pitch prominence scores, and relative root mean square energy values. I then prototyped various machine learning models and compared for accuracy:

Random Forest Classifier
Gradient Boosted Classifier
K Nearest Neighbors

From there, I generated multiple 30-second samples of each song and generated the log-power mel-spectrogram arrays to use as training inputs for the artificial neural network models:

Convolutional Neural Nets
Convolutional Recurrent Neural Nets

In doing so, I discovered that certain classes were difficult to predict and eventually decided to remove the slow swing/drag blues category from the neural network models since it had too much variability in the sub-category.

Results

Ensemble Model Results

The ensemble model, which consisted of two differently optimized random forest classifiers and one k nearest neighbors classifier achieved 50% accuracy across 5 classes and 60.9% accuracy across 4 classes (when excluding slow swing /drag blues)

Neural Net Model Results

Using a Convolutional Recurrent Neural Network I was able to achieve 81% using a 4-layer deep convolutional recurrent neural network across the same 4 classes.

More importantly, the recall results for individual dance styles had increased to over 52%, with Charleston and Shag styles performing particularly well compared to the ensemble baseline.

By using a neural network model, I was able to reduce prediction time to approximately the tenth of the time it takes the ensemble model to predict (both times are inclusive of audio pre-processing).

Real World Testing

When testing LindyList performance with songs sampled from various online sources, I discovered that the effects of compression in newer music degraded the contrast in pitch amplitude of the mel-spectograms to sufficient degree to skew LindyList recommendations toward Lindy Hop, the more mid-tempo dance style.

Plans for the Future

I have plans to incorporate Slow Swing/Drag Blues category into the neural net by splitting the labels into different sub-genres of blues dancing, since the data exists in the dataset for this level of subcategorization.

To make LindyList more accessible, I plan to develop a mobile app to enable users to identify swing dance styles and suggest playlists of similar style songs from Jazz On Line.

In order for the app to be production ready, the plan is to experiment with different pre-processing techniques of the mel-spectrograms to counteract any compression effects and reduce noise in audio samples so that the model generalizes well to any source type (e.g. compressed online mp3s, live recordings from user device).

Watch out for this space!

Project Files

CRNN_model.py - Convolutional recurrent neural network (CRNN) model.
CRNN_validation_test.py - Validates saved trained CRNN model against test data and displays accuracy results.
RF_KNN_ensemble_model.py - Random Forest and KNN Ensemble model (baseline).
get_swing_prediction.py - Predicts swing dance style for a single song file (mp3) with the option to recommend a dance playlist.
playlist_generator.py - Generates a 30 minute playlist based on a dance style with songs in the dataset.
swing_band_generator.py - Just for Fun! Randomlt generates jazz-era style band names based on artist in the dataset.

References

https://pdfs.semanticscholar.org AUTOMATIC TAGGING USING DEEP CONVOLUTIONAL NEURAL NETWORKS - Keunwoo Choi, Gyorgy Fazekas, Mark Sandler
http://www.iaeng.org Automatic Musical Pattern Feature Extraction Using Convolutional Neural Network - Tom LH. Li, Antoni B. Chan and Andy HW. Chun
http://music.ece.drexel.edu Modeling Genre with Musical Attributes - MetLab
https://chatbotslife.com Finding The Genre of a Song Using Deep Learning - AI Odyssey
http://image-net.org/ ELU-Networks: Fast and Accurate CNN Learning on ImageNet - Johannes Kelper University Linz
https://github.com/drscotthawley/audio-classifier-keras-cnn Audio Classifier Keras using Convolutional Neural Networks - Scott Hawley
https://github.com/meetshah1995/crnn-music-genre-classification CRNN Music Genre Classification - Pragnesh Shah

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
ec2_scripts		ec2_scripts
img		img
jazz_on_line		jazz_on_line
.DS_Store		.DS_Store
.gitignore		.gitignore
CNN_model_eval.py		CNN_model_eval.py
CRNN_model.py		CRNN_model.py
CRNN_validation_test.py		CRNN_validation_test.py
README.md		README.md
RF_KNN_ensemble_model.py		RF_KNN_ensemble_model.py
app.py		app.py
create_mel_spectrogram.py		create_mel_spectrogram.py
download_mp3.py		download_mp3.py
get_swing_prediction.py		get_swing_prediction.py
mp3_sampling.py		mp3_sampling.py
playlist_generator.py		playlist_generator.py
process_data_for_model.py		process_data_for_model.py
swing_band_generator.py		swing_band_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lindy List - Swing Dance Music Classifier

Introduction

The Dataset

The Approach

Results

Ensemble Model Results

Neural Net Model Results

Real World Testing

Plans for the Future

Watch out for this space!

Project Files

References

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

lazell/jazz_music

Folders and files

Latest commit

History

Repository files navigation

Lindy List - Swing Dance Music Classifier

Introduction

The Dataset

The Approach

Results

Ensemble Model Results

Neural Net Model Results

Real World Testing

Plans for the Future

Watch out for this space!

Project Files

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages