Daim ntawv ceeb toom tias muaj ntau qhov blocky artifacts hauv daim duab ntawm sab xis. Qhov no yog ploj ntaub ntawv. Cov pixels nyob sib ze ntawm cov xim zoo sib xws yog compressed li ib cheeb tsam kom txuag tau qhov chaw, tab sis cov ntaub ntawv hais txog qhov tseeb pixels ploj. Tau kawg, cov algorithms siv nyob rau hauv JPEG, PNG, thiab lwm yam codecs yog ntau nyuaj, tab sis qhov no yog ib tug zoo intuitive piv txwv ntawm lossy compression. Lossless compression yog qhov zoo, tab sis lossless compressed cov ntaub ntawv siv ntau qhov chaw disk. Muaj ntau txoj hauv kev zoo dua los ua kom cov duab tsis poob ntau cov ntaub ntawv, tab sis lawv qeeb heev thiab siv ntau txoj hauv kev. Qhov no txhais tau hais tias lawv tsis tuaj yeem khiav ua ke ntawm ntau lub CPU lossis GPU cores. Qhov kev txwv no ua rau lawv ua tsis tiav hauv kev siv txhua hnub.
Convolutional Neural Network tswv yim
Yog hais tias ib yam dab tsi yuav tsum tau muab xam thiab cov kev xam yuav kwv yees, ntxiv neural network. Cov kws sau ntawv tau siv tus qauv convolutional neural network los txhim kho cov duab compression. Cov txheej txheem nthuav tawm tsis yog tsuas yog ua tau zoo ntawm cov kev daws teeb meem zoo tshaj plaws (yog tias tsis zoo), nws kuj tuaj yeem siv cov kev sib txuas sib txuas, uas ua rau muaj kev nrawm nrawm. Qhov laj thawj yog vim li cas Convolutional Neural Networks (CNNs) zoo heev ntawm kev rho tawm cov ntaub ntawv spatial los ntawm cov duab, uas tau nthuav tawm hauv daim ntawv ntau dua (piv txwv li, tsuas yog "tseem ceeb" ntawm daim duab tau khaws cia). Cov kws sau ntawv xav siv lub peev xwm CNN no los sawv cev rau cov duab zoo dua.
architecture
Cov kws sau ntawv tau thov kom muaj ob lub network. Thawj lub network yuav siv cov duab raws li cov tswv yim thiab tsim ib qho kev sib cog lus sawv cev (ComCNN). Cov zis ntawm lub network no ces ua tiav los ntawm tus qauv codec (xws li JPEG). Tom qab ua tiav los ntawm tus codec, cov duab tau dhau mus rau lub network thib ob, uas "kho" cov duab los ntawm codec hauv kev sim rov qab cov duab qub. Cov neeg sau npe hu ua lub network Reconstructive CNN (RecCNN). Zoo li GANs, ob lub network tau kawm rov ua dua.
ComCNN Compact sawv cev raug xa mus rau tus qauv codec
RecCNN. ComCNN cov zis tau nce thiab pub rau RecCNN, uas yuav sim kawm qhov seem
Cov zis codec tau nce thiab tom qab ntawd dhau mus rau RecCNN. RecCNN yuav sim ua cov duab kom ze rau qhov qub li sai tau.
Re() sawv cev rau RecCNN. Qhov kev sib npaug no tsuas yog qhia lub ntsiab lus ntawm kab zauv 1.1 rau RecCNN. ΞΈ qhia txog RecCNN cov kev qhia tsis tau (lub kaus mom saum toj txhais tau tias qhov tsis raug kho).
Intuitive txhais
Qhov sib npaug 1.0 yuav ua rau ComCNN hloov nws qhov hnyav kom thaum rov tsim dua nrog RecCNN, cov duab kawg zoo li zoo ib yam li cov duab nkag. Qhov thib ob RecCNN poob muaj nuj nqi yog txhais raws li hauv qab no:
Kev sib npaug 2.0
Lus piav qhia
Ib zaug ntxiv, qhov ua haujlwm yuav zoo li nyuaj, tab sis qhov no yog rau feem ntau tus qauv neural network poob haujlwm (MSE).
Kev sib npaug 2.1
Co() txhais tau tias codec tso zis, x nrog lub kaus mom saum toj txhais tau tias ComCNN tso zis. ΞΈ2 yog RecCNN kev qhia tsis tau, res() tsuas yog RecCNN qhov seem tso zis. Nws yog ib qho tsim nyog sau cia tias RecCNN tau kawm txog qhov sib txawv ntawm Co() thiab cov duab nkag, tab sis tsis yog ntawm cov duab nkag.
Intuitive txhais
Qhov sib npaug 2.0 yuav ua rau RecCNN hloov nws qhov hnyav kom cov zis zoo ib yam li qhov ua tau rau cov duab nkag.
Txoj kev kawm
Cov qauv raug cob qhia rov ua dua, zoo li GAN. Qhov hnyav ntawm thawj tus qauv raug kho thaum qhov hnyav ntawm tus qauv thib ob tab tom hloov kho, ces qhov hnyav ntawm tus qauv thib ob raug kho thaum thawj tus qauv raug cob qhia.
Kev sim
Cov kws sau ntawv piv lawv txoj kev nrog rau txoj kev uas twb muaj lawm, suav nrog cov codecs yooj yim. Lawv txoj kev ua tau zoo dua li lwm tus thaum tswj kev kub ceev ntawm cov khoom siv tsim nyog. Tsis tas li ntawd, cov kws sau ntawv tau sim siv ib qho ntawm ob lub network nkaus xwb thiab sau tseg qhov kev poob qis hauv kev ua haujlwm.
Kev sib piv ntawm cov qauv sib piv (SSIM). Cov txiaj ntsig siab qhia qhov zoo ib yam li qhov qub. Bold hom qhia qhov tshwm sim ntawm kev ua haujlwm ntawm cov kws sau ntawv
xaus
Peb tau saib txoj hauv kev tshiab los siv kev kawm tob rau cov duab compression, thiab tham txog qhov ua tau ntawm kev siv neural networks hauv kev ua haujlwm dhau ntawm "kev ua haujlwm" dav dav xws li kev faib cov duab thiab kev ua cov lus. Txoj kev no tsis yog tsuas yog tsis zoo rau cov kev cai niaj hnub, tab sis kuj tso cai rau koj ua cov duab nrawm dua.
Kev kawm neural tes hauj lwm tau yooj yim dua, vim peb tau ua tus lej tshaj tawm tshwj xeeb rau Habravchan HABR, muab 10% luv nqi ntxiv rau qhov luv nqi qhia rau ntawm daim ntawv lo.