Inkqubo evulekileyo yokudibanisa intetho ye-RHVoice 1.6.0 yakhululwa, ekuqaleni yaphuhliswa ukubonelela ngenkxaso esemgangathweni yolwimi lwesiRashiya, kodwa emva koko yalungiselelwa ezinye iilwimi, kuquka isiNgesi, isiPhuthukezi, isiUkrainian, isiKyrgyz, isiTatar kunye nesiGeorgia. Ikhowudi ibhalwe kwi-C ++ kwaye isasazwe phantsi kwelayisensi ye-LGPL 2.1. Ixhasa umsebenzi kwi-GNU/Linux, Windows kunye ne-Android. Inkqubo iyahambelana ne-TTS eqhelekileyo (i-text-to-speech) ujongano lokuguqula umbhalo kwintetho: SAPI5 (Windows), Speech Dispatcher (GNU / Linux) kunye ne-Android Text-To-Speech API, kodwa ingasetyenziswa kwi-NVDA. umfundi wesikrini. Umyili kunye nomphuhlisi ophambili we-RHVoice ngu-Olga Yakovleva, ophuhlisa iprojekthi nangona engaboni ngokupheleleyo.
Π Π½ΠΎΠ²ΠΎΠΉ Π²Π΅ΡΡΠΈΠΈ Π΄ΠΎΠ±Π°Π²Π»Π΅Π½ΠΎ 5 Π½ΠΎΠ²ΡΡ Π²Π°ΡΠΈΠ°Π½ΡΠΎΠ² Π³ΠΎΠ»ΠΎΡΠΎΠ² Π΄Π»Ρ ΡΡΡΡΠΊΠΎΠΉ ΡΠ΅ΡΠΈ. Π Π΅Π°Π»ΠΈΠ·ΠΎΠ²Π°Π½Π° ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΠ° Π°Π»Π±Π°Π½ΡΠΊΠΎΠ³ΠΎ ΡΠ·ΡΠΊΠ°. ΠΠ±Π½ΠΎΠ²Π»ΡΠ½ ΡΠ»ΠΎΠ²Π°ΡΡ Π΄Π»Ρ ΡΠΊΡΠ°ΠΈΠ½ΡΠΊΠΎΠ³ΠΎ ΡΠ·ΡΠΊΠ°. Π Π°ΡΡΠΈΡΠ΅Π½Π° ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΠ° ΠΎΠ·Π²ΡΡΠΈΠ²Π°Π½ΠΈΡ ΡΠΈΠΌΠ²ΠΎΠ»ΠΎΠ² emoji. ΠΡΠΎΠ²Π΅Π΄Π΅Π½Π° ΡΠ°Π±ΠΎΡΠ° ΠΏΠΎ ΡΡΡΡΠ°Π½Π΅Π½ΠΈΡ ΠΎΡΠΈΠ±ΠΎΠΊ Π² ΠΏΡΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΠΈ Π΄Π»Ρ ΠΏΠ»Π°ΡΡΠΎΡΠΌΡ Android, ΡΠΏΡΠΎΡΡΠ½ ΠΈΠΌΠΏΠΎΡΡ ΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°ΡΠ΅Π»ΡΡΠΊΠΈΡ ΡΠ»ΠΎΠ²Π°ΡΠ΅ΠΉ, Π° ΡΠ°ΠΊΠΆΠ΅ Π΄ΠΎΠ±Π°Π²Π»Π΅Π½Π° ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΠ° ΠΏΠ»Π°ΡΡΠΎΡΠΌΡ Android 11. Π ΡΠ΄ΡΠΎ Π΄Π²ΠΈΠΆΠΊΠ° Π΄ΠΎΠ±Π°Π²Π»Π΅Π½Ρ Π½ΠΎΠ²ΡΠ΅ Π½Π°ΡΡΡΠΎΠΉΠΊΠΈ ΠΈ ΡΡΠ½ΠΊΡΠΈΠΎΠ½Π°Π»ΡΠ½ΡΠ΅ Π²ΠΎΠ·ΠΌΠΎΠΆΠ½ΠΎΡΡΠΈ, Π²ΠΊΠ»ΡΡΠ°Ρ g2p.case, word_break ΠΈ ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΡ ΡΠΈΠ»ΡΡΡΠΎΠ² ΡΠΊΠ²Π°Π»ΠΈΠ·Π°ΡΠΈΠΈ.
Masikhumbule ukuba i-RHVoice isebenzisa uphuhliso lweprojekthi ye-HTS (i-HMM/DNN-based Speech Synthesis System) kunye ne-parametric synthesis method with statistical models (Statistical Parametric Synthesis esekelwe kwi-HMM - i-Hidden Markov Model). Inzuzo yemodeli yezibalo ziindleko eziphezulu eziphantsi kunye nokungafuneki kwamandla e-CPU. Yonke imisebenzi yenziwa ekuhlaleni kwisixokelelwano somsebenzisi. Amanqanaba amathathu omgangatho wentetho axhaswayo (umgangatho ophantsi, uphezulu ukusebenza kunye nelifutshane ixesha lokuphendula).
Icala elisezantsi lemodeli yamanani lumgangatho ophantsi wokubizwa, ongafikeleli kwinqanaba le-synthesizers ezivelisa intetho esekwe kwindibaniselwano yamaqhekeza entetho yendalo, kodwa nangona kunjalo isiphumo sifundeka kakuhle kwaye sifana nokusasaza okurekhodiweyo kwisandisi-lizwi. . Ukuthelekisa, iprojekthi ye-Silero, ebonelela nge-injini yokudibanisa intetho evulekileyo ngokusekelwe kubuchwepheshe bokufunda koomatshini kunye nesethi yeemodeli zolwimi lwesiRashiya, iphezulu kumgangatho we-RHVoice.
Kukho iinketho zezwi ezili-13 ezifumanekayo ngolwimi lwesiRashiya, kunye ne-5 yesiNgesi. Kwizicwangciso unokutshintsha isantya, isantya kunye nevolumu. Ilayibrari ye-Sonic ingasetyenziselwa ukutshintsha i-tempo. Kuyenzeka ukuba uchonge ngokuzenzekelayo kwaye utshintshe iilwimi ngokusekwe kuhlalutyo lombhalo wegalelo (umzekelo, kumagama kunye neengcaphuno zolunye ulwimi, imodeli yokwenziwa kolu lwimi ingasetyenziswa). Iiprofayili zelizwi ziyaxhaswa, zichaza indibaniselwano yamazwi eelwimi ezahlukeneyo.
umthombo: opennet.ru