Mozilla Common Voice 9.0 Voice Update

Mozilla has unveiled an update to the Common Voice voice data sets, which include pronunciation examples for about 200 people. Data released as public domain (CC0). The proposed sets can be used in machine learning systems to build speech recognition and synthesis models.

Compared to the last update, the volume of speech material in the collection has increased by 10% - from 18.2 to 20.2 thousand hours of speech. The number of supported languages ​​has increased from 87 to 93. Over 27 hours of voice data have been accumulated for 100 languages, and over 9 hours of voice data have been accumulated for 500 languages. For 9 languages, it was also possible to achieve a share of female speech of at least 45%.

More than 81 thousand people took part in the preparation of materials in English, dictating 2953 hours of speech (there were 79 thousand participants and 2886 hours). The set for the Belarusian language includes 6326 participants and 1054 hours of speech material (there were 6160 participants and 987 hours), the Russian language - 2585 participants and 201 hours (there were 2452 participants and 193 hours), the Uzbek language - 1503 participants and 231 hours (there were 1355 participants). and 227 hours), Ukrainian - 696 participants and 79 hours (there were 684 participants and 76 hours).

The Common Voice project is aimed at organizing joint work to accumulate a database of voice patterns that takes into account all the diversity of voices and manners of speech. Users are prompted to speak out phrases displayed on the screen or evaluate the quality of data added by other users. The accumulated database with records of various pronunciations of typical phrases of human speech without restrictions can be used in machine learning systems and in research projects.

Source: opennet.ru

Add a comment