Data for Amharic speech recognition. Use together with melanet.
To extract:
git clone git@github.com:tilayealemu/melanet-data
cd melanet-data/data_st/wav
cat wav.tar.gz.* > wav.tar.gz
tar xzf wav.tar.gz
Data has been collected as part of a paper entitled An Amharic Speech Corpus for Large Vocabulary Continuous Speech Recognition and was extracted from ALFFA_PUBLIC github repo. A big thanks to the following people (in alphabetical order):
- Bairu Tafila
- Elodie Gauthier
- Laurent Besacier
- Martha Tachbelie
- Michael Melese
- Million Meshesha
- Solomon Teferra Abate
- Wolfgang Menzel