Open Speech and Language Resources


Identifier: SLR106

Summary: West African Virtual Assistant Speech Recognition Corpus

Category: Speech

License: Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)

Downloads (use a mirror closer to you):
nicolingua-0004-west-african-va-asr-corpus.tgz [254M]   ()   Mirrors: [US]   [EU]   [CN]  

About this resource:

This dataset contains 10,083 recorded utterances in French, Maninka, Pular and Susu from 49 speakers (16 female and 33 male) ranging from 5 to 76 years old on a variety of devices.

Please see our paper for more details on this dataset. Additional resources can be found in the following git repository:

You can cite our work using the following BibTeX entry.

    title={Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate Users},
    author={Doumbouya, Moussa and Einstein, Lisa and Piech, Chris},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},

External URL: