GitHub - rachit6105/Non_speech_sound_classification: 7K Non speech sound dataset

Non-Speech Sound Classification

This project focuses on classifying non-speech sounds into seven distinct categories using a Convolutional Neural Network (CNN). The dataset used consists of 7,000 audio samples, and class imbalance was addressed as part of the preprocessing pipeline.

Model Highlights:

Implemented a custom CNN architecture for audio classification
Applied techniques to correct class imbalance
Evaluated on a held-out test set with the following performance:

Metric	Value
Test Loss	0.5746
Test Accuracy	82.48%
Precision	83.91%
Recall	82.48%
F1 Score	82.69%

Results:

Tools and Libraries:

PyTorch
SciPy

References:

Let me know if you'd like a LaTeX version of this for a resume, report, or poster.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
main		main
README.md		README.md
best_model.pth		best_model.pth
input_broken.ipynb		input_broken.ipynb
test_tensor.ipynb		test_tensor.ipynb
test_torch lstm.ipynb		test_torch lstm.ipynb
test_torch.ipynb		test_torch.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Non-Speech Sound Classification

Model Highlights:

Results:

Tools and Libraries:

References:

About

Uh oh!

Releases

Packages

Languages

rachit6105/Non_speech_sound_classification

Folders and files

Latest commit

History

Repository files navigation

Non-Speech Sound Classification

Model Highlights:

Results:

Tools and Libraries:

References:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages