OrchestrAIte: Instrument Classification

Overview

OrchestrAIte processes .wav audio input, extracts features, and identifies multiple instruments using a convolutional neural network (CNN).

Key Features

Accepts .wav audio files as input
Uses log-mel spectrograms for feature extraction
Multi-label CNN for instrument identification
Web interface built with FastAPI and Streamlit
Deployable via Docker and Google Cloud Run

Installation

Fork this repository and clone it in your virtual environment.

Install Dependencies

pip install -r requirements.txt

Usage

Run Locally (without Docker)

Open a terminal and start the FastAPI server using Uvicorn:
```
uvicorn api.fast_api:app --reload
```
In a separate terminal, start the Streamlit application:
```
streamlit run interface/app.py
```

Run Locally with Docker

Ensure Docker is running, then build and start:
```
docker compose up --build
```
Open the Streamlit application in your browser ➡ http://localhost:8501
To stop and remove containers:
```
docker compose down
```

Deploy to Google Cloud Run with Docker

Set up a Google Cloud Project and enable Cloud Run.
Authenticate with Google Cloud.
Build and push the Docker image to Artifact Registry.
Deploy to Cloud Run.
Update API_URL in interface/app.py with the deployed URL.
Test the deployment.

Supported WAV File Format

32-bit PCM
Mono
44.1 kHz sample rate

UI Screenshots

User interface screenshots (click to enlarge):

Dataset

The training data comes from the MusicNet dataset on Kaggle, which is pre-split into training and test folders. Although MusicNet contains labels for 11 instruments in the training set, only 7 instruments are labeled in the test set. As a result, the model was trained to identify the following instruments:

Piano
Violin
Viola
Cello
Bassoon
Clarinet
Horn

Performance

The model was evaluated on the test set with the following results:

Test Loss: 0.08

Test Accuracy: 76.8%

Precision: 95.8%

Recall: 95.6%

While the model performs well on the test set, real-world performance may vary depending on the quality and complexity of the input audio.

Contributors

OrchestrAIte was developed by a four-person team as part of a project at Le Wagon Tokyo. The project was completed in two weeks and demoed on December 6, 2024.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
api		api
interface		interface
model		model
notebooks		notebooks
tests		tests
ui_screenshots		ui_screenshots
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
model_loader.py		model_loader.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OrchestrAIte: Instrument Classification

Table of Contents

Overview

Key Features

Installation

Install Dependencies

Usage

Run Locally (without Docker)

Run Locally with Docker

Deploy to Google Cloud Run with Docker

Supported WAV File Format

UI Screenshots

Dataset

Performance

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

erricrr/instrument_classification

Folders and files

Latest commit

History

Repository files navigation

OrchestrAIte: Instrument Classification

Table of Contents

Overview

Key Features

Installation

Install Dependencies

Usage

Run Locally (without Docker)

Run Locally with Docker

Deploy to Google Cloud Run with Docker

Supported WAV File Format

UI Screenshots

Dataset

Performance

Contributors

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages