Sakin tsarin gane rubutu Tesseract 5.3.4

An buga sakin tsarin gane rubutu na gani na Tesseract 5.3.4, yana goyan bayan fahimtar haruffa da matani na UTF-8 a cikin harsuna sama da 100, gami da Rashanci, Kazakh, Belarushiyanci da Ukrainian. Ana iya adana sakamakon a cikin rubutu na fili ko a cikin HTML (hOCR), ALTO (XML), PDF da tsarin TSV. An kirkiro tsarin ne a cikin 1985-1995 a cikin dakin gwaje-gwaje na Hewlett Packard; a cikin 2005, an buɗe lambar a ƙarƙashin lasisin Apache kuma an ƙara haɓaka tare da haɗin gwiwar ma'aikatan Google. Ana rarraba lambar tushe na aikin a ƙarƙashin lasisin Apache 2.0.

Tesseract ya ƙunshi kayan aikin wasan bidiyo da ɗakin karatu na libtesseract don shigar da ayyukan OCR cikin wasu aikace-aikace. Hanyoyin haɗin GUI na ɓangare na uku waɗanda ke tallafawa Tesseract sun haɗa da gImageReader, VietOCR da YAGF. Ana ba da injunan fitarwa guda biyu: na al'ada wanda ke gane rubutu a matakin ƙirar halayen mutum ɗaya, da kuma wani sabon dangane da amfani da tsarin koyo na na'ura dangane da hanyar sadarwar jijiya mai maimaita ta LSTM, wanda aka inganta don gane duka kirtani da ba da izini ga gagarumin karuwa a daidaito. An buga samfuran horarwa na shirye-shiryen don harsuna 123. Don haɓaka aiki, ana ba da kayayyaki ta amfani da umarnin OpenMP da SIMD AVX2, AVX, AVX512F, NEON ko SSE4.1.

Babban haɓakawa:

  • Inganta hoton hoto ta URL tare da zazzage fayil ta amfani da ɗakin karatu na libcurl. Lokacin lodawa, ana saita taken-Agent mai amfani. An ƙara sabon siga curl_cookiefile don amfani da fayil ɗin kuki.
  • Sabar ScrollView tana amfani da TCP azaman ƙa'idar da aka fi so.
  • Lokacin amfani da umarnin "combine_tessdata -d", ana ba da fitarwa zuwa stdout maimakon stderr.
  • Kafaffen al'amurran ginawa lokacin amfani da autoconf da dangi.

source: budenet.ru

Add a comment