This repository contains methods for classifying spectrogram signals collected from geophone arrays in the Oman region. The geophone data has resulted in a wide variety of signal types, or a "zoo" of spectrograms, and we aim to categorize these signals using unsupervised clustering methods.
The rules-based model has proven to be the most effective clustering approach so far. To run the model, follow the steps below. Ensure that you have cloned this repository, have access to the Fox Supercluster, and meet the necessary environment prerequisites.
-
Navigate to the
process_spectrograms
folder:cd process_spectrograms
-
Load spectrograms:
Run the following command to load the spectrograms and binary spectrograms for the specified station (
A01
), using a given window size and activity threshold:sbatch execute_filtered_specs.sh A01 72 345
A01
: Station number72
: Window size in seconds345
: Activity threshold
-
Remove resonances:
To remove stationary resonance features, run the following command:
sbatch execute_remove_resonances.sh 72 345
This step removes resonance features and stores the modified binary spectrograms in the Fox cluster. Note: The current version of the rules-based model requires this step, and training without removing resonances is not supported.
-
Navigate to the
rules_based_model
folder:cd ../rules_based_model
-
Apply slope filters and binning:
To create a lower-dimensional space for clustering, run the following command. It applies slope filters, sums rows, and bins the results manually:
sbatch execute_rules_based_encode.sh 5
5
: Number of bins
-
Generate an interactive t-SNE plot:
Finally, run this command to generate an interactive t-SNE plot and a cumulative occurrence plot of the top 5 classes:
sbatch execute_interactive_rules_based_top5.sh 20 40 5
-
20
: Perplexity -
40
: Number of classes -
5
: Number of bins -
The t-SNE plot will be saved in:
/fp/projects01/ec332/data/tsne_plots/
-
The cumulative occurrence plot will be saved in:
/fp/projects01/ec332/data/cumulative_cluster_plots/
-