Feature extraction with pre-trained spectrogram auto-encoders (fe_saec)

Overview

A python package to extract encoder-based features from spectrograms
Extracts array features with pre-trained encoders and converts them to linear features (details in pic below)
Encoders perform partial pooling of time axis (latent array representation is 2D -> channel by time)
Extracted features are meant to be used in companion project and its frontend

Intallation (usage in Python project)

Tested for Python 3.11 and 3.12
Make a fresh venv an install fe_saec from Python package wheel found on this github repo
pip install https://github.com/sergezaugg/feature_extraction_saec/releases/download/vx.x.x/fe_saec-x.x.x-py3-none-any.whl
torch and torchvision must be installed separately for specific CUDA version
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu126 (e.g. for Windows with CUDA 12.6 and Python 3.12.8)
If other CUDA version needed, check official pytorch instructions

Usage

Prepare PNG formatted color images of spectrograms, e.g. with this tool
sample_code.py illustrates a pipeline to extract features
Extracted features are written to disk as NPZ files in parent of images dir.

Project Structure

├── dev/                # Data, models, and dirs for code development
├── pics/               # Pictures for documentation
├── src/                # Source code (Python package)
├── tests/              # Tests for CI
├── pyproject.toml      # Build configuration
├── README.md           # Project documentation
├── requirements.txt    # Python dependencies
└── sample_code.py      # Example usage script

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Feature extraction with pre-trained spectrogram auto-encoders (fe_saec)

Overview

Intallation (usage in Python project)

Usage

Project Structure

ML details

About

Uh oh!

Releases 11

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 258 Commits
.github/workflows		.github/workflows
dev		dev
pics		pics
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sample_code.py		sample_code.py

License

sergezaugg/feature_extraction_saec

Folders and files

Latest commit

History

Repository files navigation

Feature extraction with pre-trained spectrogram auto-encoders (fe_saec)

Overview

Intallation (usage in Python project)

Usage

Project Structure

ML details

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Uh oh!

Languages