Sakin tsarin gane rubutu Tesseract 5.2

An buga sakin tsarin gane rubutu na gani na Tesseract 5.2, yana goyan bayan fahimtar haruffa da matani na UTF-8 a cikin harsuna sama da 100, gami da Rashanci, Kazakh, Belarushiyanci da Ukrainian. Ana iya adana sakamakon a cikin rubutu na fili ko a cikin HTML (hOCR), ALTO (XML), PDF da tsarin TSV. An kirkiro tsarin ne a cikin 1985-1995 a cikin dakin gwaje-gwaje na Hewlett Packard; a cikin 2005, an buɗe lambar a ƙarƙashin lasisin Apache kuma an ƙara haɓaka tare da haɗin gwiwar ma'aikatan Google. Ana rarraba lambar tushe na aikin a ƙarƙashin lasisin Apache 2.0.

Tesseract ya ƙunshi kayan aikin wasan bidiyo da ɗakin karatu na libtesseract don shigar da ayyukan OCR cikin wasu aikace-aikace. Hanyoyin haɗin GUI na ɓangare na uku waɗanda ke tallafawa Tesseract sun haɗa da gImageReader, VietOCR da YAGF. Ana ba da injunan fitarwa guda biyu: na al'ada wanda ke gane rubutu a matakin ƙirar halayen mutum ɗaya, da kuma wani sabon dangane da amfani da tsarin koyo na na'ura dangane da hanyar sadarwar jijiya mai maimaita ta LSTM, wanda aka inganta don gane duka kirtani da ba da izini ga gagarumin karuwa a daidaito. An buga samfuran horarwa na shirye-shiryen don harsuna 123. Don haɓaka aiki, ana ba da kayayyaki ta amfani da umarnin OpenMP da SIMD AVX2, AVX, AVX512F, NEON ko SSE4.1.

Babban haɓakawa a cikin Tesseract 5.2:

  • Ƙara ingantawa da aka aiwatar ta amfani da umarnin Intel AVX512F.
  • API ɗin C yana aiwatar da aiki don ƙaddamar da tesseract tare da loda samfurin koyon injin daga ƙwaƙwalwar ajiya.
  • An ƙara ma'aunin invert_threshold, wanda ke ƙayyade matakin juyar da igiyoyin rubutu. Matsakaicin ƙima shine 0.7. Don kashe jujjuyawar, saita ƙimar zuwa 0.
  • Ingantattun sarrafa manyan takardu akan runduna 32-bit.
  • An canza canjin daga amfani da ayyukan std :: regex zuwa std :: kirtani.
  • Ingantattun rubutun ginawa don Autotools, CMake da ci gaba da tsarin haɗin kai.

    source: budenet.ru

Add a comment