WhaleMoanDetector: A fine-tuned Faster R-CNN for detecting and classifying blue and fin whale moans in audio data

Blue whales off the coast of San Diego. Photo credit: Manuel Mendieta

Overview

The WhaleMoanDetector repository contains Python code required to train, test, and run inference using WhaleMoanDetector, a fine-tuned Faster R-CNN model for blue and fin whale moans. We fine-tuned the final two layers of Faster R-CNN (Ren et al. 2015) with a ResNet-50 backbone using the pre-trained weights available in Pytorch.

A brief tutorial highlighting the steps needed to deploy WhaleMoanDetector and visualize results is included below in the workflow section.

WhaleMoanDetector is trained to identify eastern North Pacific blue whale A, B, and D moans and fin whale 20 Hz and 40 Hz moans in 60 second spectrograms.

Additional information about eastern North Pacific blue whale moans can be found in:

Oleson, E., J. Calambokidis, W. Burgess, M. Mcdonald, C. A. Leduc and J. A. Hildebrand. 2007. Behavioral context of Northeast Pacific blue whale call production. Marine Ecology-progress Series - MAR ECOL-PROGR SER 330:269-284. https://www.int-res.com/abstracts/meps/v330/p269-284/

Additional information about southern california fin whale moans can be found in:

Širović, A., L. N. Williams, S. M. Kerosky, S. M. Wiggins and J. A. Hildebrand. 2013. Temporal separation of two fin whale call types across the eastern North Pacific. Marine Biology 160:47-57. https://link.springer.com/article/10.1007/s00227-012-2061-z

WhaleMoanDetector Directory Structure:

WhaleMoanDetector/
├── LICENSE
├── README.md          <- The top-level README for users.
├── code               <- All code required to run detector
│   ├── AudioDetectionDataset.py     <- Dataset class.
│   ├── AudioStreamDescriptor.py     <- Parses xwav and wav headers.
│   ├── call_context_filter.py       <- Context filters predictions based on duration and frequency.
│   ├── config.yaml                  <- User defined variables for running inference.
│   ├── custom_collate.py            <- Collate function.
│   ├── extract_wav_header.py	     <- Function to extract xwav and wav headers.
│   ├── freq_bbox.py		     <- Converts bounding boxes to frequency (Hz).
│   ├── inference_functions.py       <- Necessary functions to make audio file chunks and spectrograms, run inference,
│   │						and save out prediction spreadsheet and images
│   ├── inference_pipeline.py	     <- Wrapper to run inference_functions and modify file paths.
│   ├── make_annotations.py          <- Wrapper to make_spectrograms.py for training/validation/testing examples
│   ├── make_spectrograms.py         <- Necessary functions to make audio file chunks, spectrograms, and labels for
│   │						training, testing, and validation.
│   ├── modify_timestamp.py  	     <- Convert Triton LoggerPro files to "modified annotation" format with 
│   │						start and end times as # of seconds into start of wav file.
│   ├── modify_timestamp_function.py <- Necessary function for modify_timestamp.py.
│   ├── plot_groundtruth.py          <- Plots groundtruth annotations in Spyder software.
│   ├── PR_curve.py      	     <- Generates PR curve for test data.
│   ├── plot_predictions.py          <- Visualize WhaleMoanDetector predictions in Spyder.
│   ├── time_bbox.py                 <- Converts bounding boxes to time (sec into file).
│   ├── train.py                     <- Train WhaleMoanDetector.
│   ├── train_val_test_split_all.py  <- Make train/val/test splits using outputs of make_annotations.py.
│   ├── validation.py                <- Run validation script
│
├── figures               <- Graphics and figures to be used in reporting
├── labeled_data
│   │	└── CalCOFI
│   │	    └── modified_annotations <- Modified start and end time .csv files 
│   │	└── HARP
│   │	    └── modified_annotations <-Modified start and end time .csv files 
│   │
│   ├── spectrograms   <- The final spectrogram images used for model training and testing (not uploaded to GitHub)
│   │	└── CalCOFI 
│   │	└── HARP
│   └── wav           <- The original wav files (not uploaded to GitHub)
│
├── models             <- Trained models (not uploaded to GitHub)

For new Python users:

WhaleMoanDetector Workflow

Clone the Repository:

git clone https://github.com/m1alksne/WhaleMoanDetector.git
cd WhaleMoanDetector

Create a Virtual Environment:

conda create -n whalemoandetector python=3.9
conda activate whalemoandetector

Install Dependencies:
```
pip install -r requirements.txt
```
Run WhaleMoanDetector on wav files:

a. Modify File Paths in the Configuration File:

Example of config.yaml:
```
wav_directory: 'L:/CHNMS_audio/CHNMS_NO_01/CHNMS_NO_01_disk01_df100'
txt_file_path: 'M:/Mysticetes/WhaleMoanDetector_outputs/CHNMS_NO_01/CHNMS_NO_01_raw_detections.txt'
model_path: 'L:/WhaleMoanDetector/models/WhaleMoanDetector.pth'
```
b. Exectute inference_pipeline.py
```
cd code
python code/inference_pipeline.py
```
Executing the inference pipeline will deploy WhaleMoanDetector on the directory of wav files listed in wav_directory. The inference_pipeline.py script preprocesses the audio data by generating 60 second non-overlapping audio segments. For each audio segment, a spectrogram is made using the librosa package with a hamming window and window length of 1 second with 90% overlap between consecutive frames. Spectrograms are truncated between 10-150 Hz and normalized individually between 0 and 1 for consistent input scaling:

WhaleMoanDetector will generate predictions for each spectrogram. Predictions are written to a .txt file stored in txt_file_path. PNG images of spectrograms containing predictions will also be saved to the txt_file_path directory. An example of the txt_file_path directory and txt file output is included below:

Download SOCAL_H_65_raw_detections example .txt file

Visualize WhaleMoanDetector predictions:

python code/python plot_predictions.py "L:\WhaleMoanDetector_predictions\SOCAL_H_65\SOCAL_H_65_raw_detections.txt"

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
code		code
evaluation		evaluation
figures		figures
labeled_data		labeled_data
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WhaleMoanDetector: A fine-tuned Faster R-CNN for detecting and classifying blue and fin whale moans in audio data

Overview

WhaleMoanDetector Directory Structure:

For new Python users:

WhaleMoanDetector Workflow

About

Uh oh!

Releases

Packages

Uh oh!

Languages

m1alksne/WhaleMoanDetector

Folders and files

Latest commit

History

Repository files navigation

WhaleMoanDetector: A fine-tuned Faster R-CNN for detecting and classifying blue and fin whale moans in audio data

Overview

WhaleMoanDetector Directory Structure:

For new Python users:

WhaleMoanDetector Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages