The aSwine (Annotated Swine) dataset is a weakly labeled collection of ~15 hours of environmental audio recordings from a pig farm.
The annotation methodology employed a weak labeling approach where, for each 5-second interval, one or more labels are assigned based on the presence/absence of associated events. The events within the aSwine dataset are categorized into two primary groups: 1) animal events and 2) human events.
Human Events | Animal Events |
---|---|
Cleaning | Stress/Disputes |
Speech | Coughing/Sneezing |
Feed Provision | Other events |
Other events |
The distributions of annotated events between categories and labels are shown in the pie charts below:
The dataset is intended for the development and evaluation of machine learning models for the detection of human and animal events in a pig farm environment. The dataset is prearranged into a training set and a test set, with the training set containing 80% of the data and the test set containing the remaining 20%. The dataset is available in the form of .wav files, with each file corresponding to a 6-minute audio recording, with a sample rate of 16 kHz. The dataset is accompanied by CSV files containing the annotations for the training and test sets.
A more detailed description of the dataset is available in the original paper.
The dataset can be download as a single .zip file from this repository: Downlaod aSwine dataset
In the midst of the growing interest in applying machine learning (ML) techniques within precision livestock farming (PLF), researchers associated with the Faculty of Veterinary Medicine and Zootechnics (FMVZ) at the University of São Paulo (USP) conducted diverse studies utilizing sensing technologies across various animal species and environmental conditions. The Swine Sound Dataset originated from previous feed additive trials involving groups of finishing pigs. The FMVZ researchers established an infrastructure for continuous 24/7 monitoring, with the data stored in video format (.mp4) and subsequently converted into raw waveform format (.wav) at a sample rate of 16KHz.
The chosen labels were specifically curated to encompass crucial events for observing both human handling and animal behaviors. Detecting these events could significantly aid in auditing day-to-day operations. Throughout the annotation process, an additional category labeled "Other Events" was introduced for each main category (notated as "Others (H)" and "Others (A)"). This inclusive category aims to cover occurrences not addressed by existing labels but potentially relevant in detection scenarios. Examples of such events may include chirping birds, calls of wild animals, unidentified loud noises, and sounds from automobiles. This category accommodates diverse and unforeseen events that hold potential significance in the context of detection.
- audio/*.wav: audio recordings in .wav format (6-minute, 16 kHz, mono) with the following naming convention:
ALA_E_2_{YEAR}-{MONTH}-{DAY}_{HOUR}-{MINUTE}-{SECOND}_{MILLISECOND}.wav
- meta/raw/*.csv: Raw 5-second annotations in .csv format for the full set, training set, and test set, with the following structure:
Timestamp
: timestamp of the 5-second intervalDuração
: duration of the 5-second intervalLimpeza de Baia
: binary label for the cleaning eventFala Humana
: binary label for the speech eventAlimentação de Baias
: binary label for the feed provision eventOutrosH
: binary label for other human eventsEstresseDisputas
: binary label for the stress/disputes eventTosseEspirro
: binary label for the coughing/sneezing eventOutrosA
: binary label for other animal events
- meta/1s/*.csv: Processed 1-second annotations in .csv format for the full set, training set, and test set, with the following structure:
audio_file_path
: path to the corresponding audio fileoffset
: offset in seconds from the start of the audio fileduration
: duration of the 1-second intervalLimpeza de Baia
: binary label for the cleaning eventFala Humana
: binary label for the speech eventAlimentação de Baias
: binary label for the feed provision eventOutrosH
: binary label for other human eventsEstresseDisputas
: binary label for the stress/disputes eventTosseEspirro
: binary label for the coughing/sneezing eventOutrosA
: binary label for other animal events
- meta/1s_pruned/*.csv: Removed invalid annotations, i.e., annotations at the boundaries of two different files, then processed 1-second annotations in .csv format for the full set, training set, and test set, with the following structure:
audio_file_path
: path to the corresponding audio fileoffset
: offset in seconds from the start of the audio fileduration
: duration of the 1-second intervalLimpeza de Baia
: binary label for the cleaning eventFala Humana
: binary label for the speech eventAlimentação de Baias
: binary label for the feed provision eventOutrosH
: binary label for other human eventsEstresseDisputas
: binary label for the stress/disputes eventTosseEspirro
: binary label for the coughing/sneezing eventOutrosA
: binary label for other animal events
The dataset is released under the Creative Commons Attribution-NonCommercial 4.0 International license.
If you find this dataset useful, please consider citing the original paper:
@article{Souza2025,
title = {Deep learning solutions for audio event detection in a swine barn using environmental audio and weak labels},
volume = {55},
ISSN = {1573-7497},
url = {http://dx.doi.org/10.1007/s10489-025-06555-6},
DOI = {10.1007/s10489-025-06555-6},
number = {7},
journal = {Applied Intelligence},
publisher = {Springer Science and Business Media LLC},
author = {Souza, André Moreira and Kobayashi, Livia Lissa and Tassoni, Lucas Andrietta and Garbossa, Cesar Augusto Pospissil and Ventura, Ricardo Vieira and Machado de Sousa, Elaine Parros},
year = {2025},
month = apr
}