-
Notifications
You must be signed in to change notification settings - Fork 32
Home
Welcome to the Task-4-Large-scale-weakly-supervised-sound-event-detection-for-smart-cars wiki!
Script to download the development data for Task 4: Large-scale weakly supervised sound event detection for smart cars.
- youtube-dl - [sudo] pip install --upgrade youtube_dl
- pafy - [sudo] pip install pafy
- tqdm (progress bar) - [sudo] pip install tqdm
- multiprocessing - [sudo] pip install multiprocessing
- sox tool - sudo apt-get install sox
- Downloads the audio from the videos for the testing set first and then for the training set. - Multiprocessing - ensures three files are downloaded simultaneously to reduce the heavy download time to 40 percent as compared with single threaded performance.
- Formats the audio with consistent parameters - currently set as 1 channel, 16 bit precision, 44.1kHz sampling rate.
- Extracts the 10-sec segments from the formatted audio according to the start and end times.
- The script output includes the audio for 1,2 and 3, unless testing script is modified to remove audio from 2 and/or 3, that is the original audio and the formatted audio.
- To denote a unique identifier for every run/launch of downloading files - script stores the timestamp and assigns to each of the output files and folder names.
- Please, contact the Author in case one or more videos are not properly downloaded/available, or with any other issue.
Download audio: testing_set.csv, training_set.csv Groundtruth weak labels: groundtruth_weak_label_testing_set.csv groundtruth_weak_label_training_set.csv Groundtruth strong labels: groundtruth_strong_label_testing_set.csv groundtruth_strong_label_training_set.csv
$python download_youtube_audio_from_csv_and_delete_original_standalone.py <CSV filename - relative path is also fine> Sample Usage - python download_youtube_audio_from_csv_and_delete_original_standalone.py training_set.csv
- Audio formatting can be modified in the "format_audio" method defined in the script download_youtube_audio_from_csv_and_delete_original.py
- Removal of original audio and/or formatted audio paths can be done by uncommenting and modifying <os.system(cmdstring2)> in "download_audio_method" function defined in download_youtube_audio_from_csv_and_delete_original_standalone.py
Output Audio paths -First folder contains original best audio from youtube: <csv_name><testing/training>audio_downloaded -Second folder contains the corresponding formatted audio: <csv_name><testing/training>audio_formatted_downloaded -Third folder contains the extracted 10-sec segments: <csv_name><testing/training>_audio_formatted_downloaded_and_ssegmented_downloads
Note:- To each downloaded audio string "Y" is added as tools like sox and ffmpeg causes problem when filename starts with "--" or "-".