I-RHVoice 1.8.0 ukukhululwa kwe-synthesizer yenkulumo

Isistimu yokuhlanganisa inkulumo evulekile i-RHVoice 1.8.0 yakhululwa, ekuqaleni yathuthukiswa ukuze inikeze ukusekelwa kwekhwalithi ephezulu yolimi lwesiRashiya, kodwa yabe isiguqulelwa kwezinye izilimi, okuhlanganisa isiNgisi, isiPutukezi, isi-Ukrainian, isiKyrgyz, isiTatar nesiGeorgia. Ikhodi ibhalwe ngo-C++ futhi isatshalaliswa ngaphansi kwelayisensi ye-LGPL 2.1. Isekela umsebenzi ku-GNU/Linux, Windows ne-Android. Uhlelo luhambisana nezisetshenziswa ezijwayelekile ze-TTS (umbhalo-kuya-enkulumweni) zokuguqula umbhalo ube enkulumweni: SAPI5 (Windows), Speech Dispatcher (GNU/Linux) kanye ne-Android Text-To-Speech API, kodwa futhi ingasetshenziswa ku-NVDA. isifundi sesikrini. Umqambi kanye nonjiniyela oyinhloko we-RHVoice ngu-Olga Yakovleva, othuthukisa iphrojekthi naphezu kokungaboni ngokuphelele.

Π’ вСрсии 1.8 для ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΡ‹ Android ΠΏΡ€Π΅Π΄Π»ΠΎΠΆΠ΅Π½Π° новая систСма управлСния голосовыми ΠΈ языковыми Π΄Π°Π½Π½Ρ‹ΠΌΠΈ, ΠΏΠΎΠ·Π²ΠΎΠ»ΡΡŽΡ‰Π°Ρ Π·Π°Π³Ρ€ΡƒΠΆΠ°Ρ‚ΡŒ обновлСния голосовых Π΄Π°Π½Π½Ρ‹Ρ… Π±Π΅Π· обновлСния мобильного прилоТСния. ΠŸΡ€ΠΎΠ²Π΅Ρ€ΠΊΠ° появлСния ΠΎΠ±Π½ΠΎΠ²Π»Π΅Π½ΠΈΠΉ Π΄Π°Π½Π½Ρ‹Ρ… для Π΄ΠΎΠ±Π°Π²Π»Π΅Π½Π½Ρ‹Ρ… голосов ΠΈ языков производится автоматичСски. ΠšΡ€ΠΎΠΌΠ΅ Ρ‚ΠΎΠ³ΠΎ, Π² Π½ΠΎΠ²ΠΎΠΌ выпускС Ρ€Π΅Π°Π»ΠΈΠ·ΠΎΠ²Π°Π½Π° ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΠ° польского языка ΠΈ Π΄ΠΎΠ±Π°Π²Π»Π΅Π½ Π½ΠΎΠ²Ρ‹ΠΉ голос для макСдонского языка. ΠžΠ±Π΅ΡΠΏΠ΅Ρ‡Π΅Π½Π° ΡΠΎΠ²ΠΌΠ΅ΡΡ‚ΠΈΠΌΠΎΡΡ‚ΡŒ со свСТими Π°Π»ΡŒΡ„Π°- ΠΈ Π±Π΅Ρ‚Π°-выпусками экранного Ρ€ΠΈΠ΄Π΅Ρ€Π° NVDA. УстранСны ΠΏΡ€ΠΎΠ±Π»Π΅ΠΌΡ‹ со сборкой Π½Π° ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΠ΅ Linux, возникавшиС ΠΏΡ€ΠΈ отсутствии Speech Dispatcher.

Masikhumbule ukuthi i-RHVoice isebenzisa ukuthuthukiswa kwephrojekthi ye-HTS (HMM/DNN-based Speech Synthesis System) kanye nendlela yokuhlanganisa ye-parametric enamamodeli ezibalo (Statistical Parametric Synthesis esekelwe ku-HMM - Hidden Markov Model). Inzuzo yemodeli yezibalo izindleko eziphansi ze-overhead namandla e-CPU angafuneki. Yonke imisebenzi yenziwa endaweni ohlelweni lomsebenzisi. Amazinga amathathu ekhwalithi yenkulumo asekelwa (izinga eliphansi, ukusebenza okuphezulu kanye nesikhathi sokuphendula sibe mfushane).

Uhlangothi olubi lwemodeli yezibalo izinga eliphansi lokuphimisa, elingafinyeleli ezingeni lama-synthesizers akhiqiza inkulumo esekelwe kwinhlanganisela yezingcezwana zenkulumo yemvelo, kodwa nokho umphumela uyafundeka futhi ufana nokusakaza okurekhodiwe kumbhobho. . Uma kuqhathaniswa, iphrojekthi ye-Silero, ehlinzeka ngenjini evulekile yokuhlanganisa inkulumo esekelwe kubuchwepheshe bokufunda komshini kanye nesethi yamamodeli olimi lwesiRashiya, iphakeme ngekhwalithi kune-RHVoice.

Kunezinketho zezwi eziyi-14 ezitholakalayo zolimi lwesiRashiya, kanye nesiNgisi ezi-6. Amazwi akhiwe ngokusekelwe ekurekhodweni kwenkulumo yemvelo. Kuzilungiselelo ungashintsha isivinini, iphimbo kanye nevolumu. Umtapo wolwazi we-Sonic ungasetshenziswa ukushintsha i-tempo. Kungenzeka ukuthi uthole ngokuzenzakalelayo futhi ushintshe izilimi ngokusekelwe ekuhlaziyweni kombhalo ofakiwe (isibonelo, amagama nezingcaphuno ngolunye ulimi, imodeli yokuhlanganisa yomdabu kulolo limi ingasetshenziswa). Amaphrofayili ezwi asekelwa, achaza inhlanganisela yamazwi ezilimi ezahlukene.

Source: opennet.ru

Engeza amazwana