I-RHVoice 1.6.0 i-speech synthesizer ikhululiwe

Inkqubo evulekileyo yokudibanisa intetho ye-RHVoice 1.6.0 yakhululwa, ekuqaleni yaphuhliswa ukubonelela ngenkxaso esemgangathweni yolwimi lwesiRashiya, kodwa emva koko yalungiselelwa ezinye iilwimi, kuquka isiNgesi, isiPhuthukezi, isiUkrainian, isiKyrgyz, isiTatar kunye nesiGeorgia. Ikhowudi ibhalwe kwi-C ++ kwaye isasazwe phantsi kwelayisensi ye-LGPL 2.1. Ixhasa umsebenzi kwi-GNU/Linux, Windows kunye ne-Android. Inkqubo iyahambelana ne-TTS eqhelekileyo (i-text-to-speech) ujongano lokuguqula umbhalo kwintetho: SAPI5 (Windows), Speech Dispatcher (GNU / Linux) kunye ne-Android Text-To-Speech API, kodwa ingasetyenziswa kwi-NVDA. umfundi wesikrini. Umyili kunye nomphuhlisi ophambili we-RHVoice ngu-Olga Yakovleva, ophuhlisa iprojekthi nangona engaboni ngokupheleleyo.

Π’ Π½ΠΎΠ²ΠΎΠΉ вСрсии Π΄ΠΎΠ±Π°Π²Π»Π΅Π½ΠΎ 5 Π½ΠΎΠ²Ρ‹Ρ… Π²Π°Ρ€ΠΈΠ°Π½Ρ‚ΠΎΠ² голосов для русской Ρ€Π΅Ρ‡ΠΈ. Π Π΅Π°Π»ΠΈΠ·ΠΎΠ²Π°Π½Π° ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΠ° албанского языка. ΠžΠ±Π½ΠΎΠ²Π»Ρ‘Π½ ΡΠ»ΠΎΠ²Π°Ρ€ΡŒ для украинского языка. Π Π°ΡΡˆΠΈΡ€Π΅Π½Π° ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΠ° озвучивания символов emoji. ΠŸΡ€ΠΎΠ²Π΅Π΄Π΅Π½Π° Ρ€Π°Π±ΠΎΡ‚Π° ΠΏΠΎ ΡƒΡΡ‚Ρ€Π°Π½Π΅Π½ΠΈΡŽ ошибок Π² ΠΏΡ€ΠΈΠ»ΠΎΠΆΠ΅Π½ΠΈΠΈ для ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΡ‹ Android, ΡƒΠΏΡ€ΠΎΡ‰Ρ‘Π½ ΠΈΠΌΠΏΠΎΡ€Ρ‚ ΠΏΠΎΠ»ΡŒΠ·ΠΎΠ²Π°Ρ‚Π΅Π»ΡŒΡΠΊΠΈΡ… словарСй, Π° Ρ‚Π°ΠΊΠΆΠ΅ Π΄ΠΎΠ±Π°Π²Π»Π΅Π½Π° ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΠ° ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΡ‹ Android 11. Π’ ядро Π΄Π²ΠΈΠΆΠΊΠ° Π΄ΠΎΠ±Π°Π²Π»Π΅Π½Ρ‹ Π½ΠΎΠ²Ρ‹Π΅ настройки ΠΈ Ρ„ΡƒΠ½ΠΊΡ†ΠΈΠΎΠ½Π°Π»ΡŒΠ½Ρ‹Π΅ возмоТности, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ g2p.case, word_break ΠΈ ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ Ρ„ΠΈΠ»ΡŒΡ‚Ρ€ΠΎΠ² эквализации.

Masikhumbule ukuba i-RHVoice isebenzisa uphuhliso lweprojekthi ye-HTS (i-HMM/DNN-based Speech Synthesis System) kunye ne-parametric synthesis method with statistical models (Statistical Parametric Synthesis esekelwe kwi-HMM - i-Hidden Markov Model). Inzuzo yemodeli yezibalo ziindleko eziphezulu eziphantsi kunye nokungafuneki kwamandla e-CPU. Yonke imisebenzi yenziwa ekuhlaleni kwisixokelelwano somsebenzisi. Amanqanaba amathathu omgangatho wentetho axhaswayo (umgangatho ophantsi, uphezulu ukusebenza kunye nelifutshane ixesha lokuphendula).

Icala elisezantsi lemodeli yamanani lumgangatho ophantsi wokubizwa, ongafikeleli kwinqanaba le-synthesizers ezivelisa intetho esekwe kwindibaniselwano yamaqhekeza entetho yendalo, kodwa nangona kunjalo isiphumo sifundeka kakuhle kwaye sifana nokusasaza okurekhodiweyo kwisandisi-lizwi. . Ukuthelekisa, iprojekthi ye-Silero, ebonelela nge-injini yokudibanisa intetho evulekileyo ngokusekelwe kubuchwepheshe bokufunda koomatshini kunye nesethi yeemodeli zolwimi lwesiRashiya, iphezulu kumgangatho we-RHVoice.

Kukho iinketho zezwi ezili-13 ezifumanekayo ngolwimi lwesiRashiya, kunye ne-5 yesiNgesi. Kwizicwangciso unokutshintsha isantya, isantya kunye nevolumu. Ilayibrari ye-Sonic ingasetyenziselwa ukutshintsha i-tempo. Kuyenzeka ukuba uchonge ngokuzenzekelayo kwaye utshintshe iilwimi ngokusekwe kuhlalutyo lombhalo wegalelo (umzekelo, kumagama kunye neengcaphuno zolunye ulwimi, imodeli yokwenziwa kolu lwimi ingasetyenziswa). Iiprofayili zelizwi ziyaxhaswa, zichaza indibaniselwano yamazwi eelwimi ezahlukeneyo.

umthombo: opennet.ru

Yongeza izimvo