Isistimu yokuhlanganisa inkulumo evulekile i-RHVoice 1.8.0 yakhululwa, ekuqaleni yathuthukiswa ukuze inikeze ukusekelwa kwekhwalithi ephezulu yolimi lwesiRashiya, kodwa yabe isiguqulelwa kwezinye izilimi, okuhlanganisa isiNgisi, isiPutukezi, isi-Ukrainian, isiKyrgyz, isiTatar nesiGeorgia. Ikhodi ibhalwe ngo-C++ futhi isatshalaliswa ngaphansi kwelayisensi ye-LGPL 2.1. Isekela umsebenzi ku-GNU/Linux, Windows ne-Android. Uhlelo luhambisana nezisetshenziswa ezijwayelekile ze-TTS (umbhalo-kuya-enkulumweni) zokuguqula umbhalo ube enkulumweni: SAPI5 (Windows), Speech Dispatcher (GNU/Linux) kanye ne-Android Text-To-Speech API, kodwa futhi ingasetshenziswa ku-NVDA. isifundi sesikrini. Umqambi kanye nonjiniyela oyinhloko we-RHVoice ngu-Olga Yakovleva, othuthukisa iphrojekthi naphezu kokungaboni ngokuphelele.
Π Π²Π΅ΡΡΠΈΠΈ 1.8 Π΄Π»Ρ ΠΏΠ»Π°ΡΡΠΎΡΠΌΡ Android ΠΏΡΠ΅Π΄Π»ΠΎΠΆΠ΅Π½Π° Π½ΠΎΠ²Π°Ρ ΡΠΈΡΡΠ΅ΠΌΠ° ΡΠΏΡΠ°Π²Π»Π΅Π½ΠΈΡ Π³ΠΎΠ»ΠΎΡΠΎΠ²ΡΠΌΠΈ ΠΈ ΡΠ·ΡΠΊΠΎΠ²ΡΠΌΠΈ Π΄Π°Π½Π½ΡΠΌΠΈ, ΠΏΠΎΠ·Π²ΠΎΠ»ΡΡΡΠ°Ρ Π·Π°Π³ΡΡΠΆΠ°ΡΡ ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΡ Π³ΠΎΠ»ΠΎΡΠΎΠ²ΡΡ Π΄Π°Π½Π½ΡΡ Π±Π΅Π· ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΡ ΠΌΠΎΠ±ΠΈΠ»ΡΠ½ΠΎΠ³ΠΎ ΠΏΡΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΡ. ΠΡΠΎΠ²Π΅ΡΠΊΠ° ΠΏΠΎΡΠ²Π»Π΅Π½ΠΈΡ ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΠΉ Π΄Π°Π½Π½ΡΡ Π΄Π»Ρ Π΄ΠΎΠ±Π°Π²Π»Π΅Π½Π½ΡΡ Π³ΠΎΠ»ΠΎΡΠΎΠ² ΠΈ ΡΠ·ΡΠΊΠΎΠ² ΠΏΡΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡΡΡ Π°Π²ΡΠΎΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠΈ. ΠΡΠΎΠΌΠ΅ ΡΠΎΠ³ΠΎ, Π² Π½ΠΎΠ²ΠΎΠΌ Π²ΡΠΏΡΡΠΊΠ΅ ΡΠ΅Π°Π»ΠΈΠ·ΠΎΠ²Π°Π½Π° ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΠ° ΠΏΠΎΠ»ΡΡΠΊΠΎΠ³ΠΎ ΡΠ·ΡΠΊΠ° ΠΈ Π΄ΠΎΠ±Π°Π²Π»Π΅Π½ Π½ΠΎΠ²ΡΠΉ Π³ΠΎΠ»ΠΎΡ Π΄Π»Ρ ΠΌΠ°ΠΊΠ΅Π΄ΠΎΠ½ΡΠΊΠΎΠ³ΠΎ ΡΠ·ΡΠΊΠ°. ΠΠ±Π΅ΡΠΏΠ΅ΡΠ΅Π½Π° ΡΠΎΠ²ΠΌΠ΅ΡΡΠΈΠΌΠΎΡΡΡ ΡΠΎ ΡΠ²Π΅ΠΆΠΈΠΌΠΈ Π°Π»ΡΡΠ°- ΠΈ Π±Π΅ΡΠ°-Π²ΡΠΏΡΡΠΊΠ°ΠΌΠΈ ΡΠΊΡΠ°Π½Π½ΠΎΠ³ΠΎ ΡΠΈΠ΄Π΅ΡΠ° NVDA. Π£ΡΡΡΠ°Π½Π΅Π½Ρ ΠΏΡΠΎΠ±Π»Π΅ΠΌΡ ΡΠΎ ΡΠ±ΠΎΡΠΊΠΎΠΉ Π½Π° ΠΏΠ»Π°ΡΡΠΎΡΠΌΠ΅ Linux, Π²ΠΎΠ·Π½ΠΈΠΊΠ°Π²ΡΠΈΠ΅ ΠΏΡΠΈ ΠΎΡΡΡΡΡΡΠ²ΠΈΠΈ Speech Dispatcher.
Masikhumbule ukuthi i-RHVoice isebenzisa ukuthuthukiswa kwephrojekthi ye-HTS (HMM/DNN-based Speech Synthesis System) kanye nendlela yokuhlanganisa ye-parametric enamamodeli ezibalo (Statistical Parametric Synthesis esekelwe ku-HMM - Hidden Markov Model). Inzuzo yemodeli yezibalo izindleko eziphansi ze-overhead namandla e-CPU angafuneki. Yonke imisebenzi yenziwa endaweni ohlelweni lomsebenzisi. Amazinga amathathu ekhwalithi yenkulumo asekelwa (izinga eliphansi, ukusebenza okuphezulu kanye nesikhathi sokuphendula sibe mfushane).
Uhlangothi olubi lwemodeli yezibalo izinga eliphansi lokuphimisa, elingafinyeleli ezingeni lama-synthesizers akhiqiza inkulumo esekelwe kwinhlanganisela yezingcezwana zenkulumo yemvelo, kodwa nokho umphumela uyafundeka futhi ufana nokusakaza okurekhodiwe kumbhobho. . Uma kuqhathaniswa, iphrojekthi ye-Silero, ehlinzeka ngenjini evulekile yokuhlanganisa inkulumo esekelwe kubuchwepheshe bokufunda komshini kanye nesethi yamamodeli olimi lwesiRashiya, iphakeme ngekhwalithi kune-RHVoice.
Kunezinketho zezwi eziyi-14 ezitholakalayo zolimi lwesiRashiya, kanye nesiNgisi ezi-6. Amazwi akhiwe ngokusekelwe ekurekhodweni kwenkulumo yemvelo. Kuzilungiselelo ungashintsha isivinini, iphimbo kanye nevolumu. Umtapo wolwazi we-Sonic ungasetshenziswa ukushintsha i-tempo. Kungenzeka ukuthi uthole ngokuzenzakalelayo futhi ushintshe izilimi ngokusekelwe ekuhlaziyweni kombhalo ofakiwe (isibonelo, amagama nezingcaphuno ngolunye ulimi, imodeli yokuhlanganisa yomdabu kulolo limi ingasetshenziswa). Amaphrofayili ezwi asekelwa, achaza inhlanganisela yamazwi ezilimi ezahlukene.
Source: opennet.ru