Crowdsourced high-quality Catalan speech data set.
Identifier: SLR69
Summary: Data set which contains recordings of Catalan.
Category: Speech
License: Attribution-ShareAlike 4.0 International
Downloads (use a mirror closer to you):
about.html [1.4K] (Information about the data set
) Mirrors:
[US]
[EU]
[CN]
LICENSE [20K] (License information for the data set
) Mirrors:
[US]
[EU]
[CN]
line_index_female.tsv [191K] (All utterances for the female speakers.
) Mirrors:
[US]
[EU]
[CN]
line_index_male.tsv [160K] (All utterances for the male speakers.
) Mirrors:
[US]
[EU]
[CN]
ca_es_female.zip [1.0G] (Archive file with all audio for the female speakers.
) Mirrors:
[US]
[EU]
[CN]
ca_es_male.zip [804M] (Archive file with all audio for the male speakers.
) Mirrors:
[US]
[EU]
[CN]
About this resource:
The data set has been manually quality checked, but there might still be errors.
Please report any issues in the following issue tracker on GitHub. https://github.com/googlei18n/language-resources/issues
See LICENSE file for license information.
Copyright 2018, 2019 Google, Inc.
If you use this data in publications, please cite it as follows:
@inproceedings{kjartansson-etal-2020-open, title = {{Open-Source High Quality Speech Datasets for Basque, Catalan and Galician}}, author = {Kjartansson, Oddur and Gutkin, Alexander and Butryna, Alena and Demirsahin, Isin and Rivera, Clara}, booktitle = {Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL)}, year = {2020}, pages = {21--27}, month = may, address = {Marseille, France}, publisher = {European Language Resources association (ELRA)}, url = {https://www.aclweb.org/anthology/2020.sltu-1.3}, ISBN = {979-10-95546-35-1}, }