Linear regression uye nzira dzekupora kwayo

Linear regression uye nzira dzekupora kwayo
Source: xkdc

Linear regression ndeimwe yeakakosha algorithms yenzvimbo dzakawanda dzine chekuita nekuongorora data. Chikonzero cheizvi chiri pachena. Iyi iri nyore uye inonzwisisika algorithm, iyo yakakonzera kushandiswa kwayo kwakapararira kwemakumi mazhinji, kana asiri mazana emakore. Pfungwa ndeyokuti isu tinotora mutsara wekutsamira kweimwe shanduko pane seti yezvimwe zvinosiyana, uye toedza kudzoreredza kutsamira uku.

Asi chinyorwa ichi hachisi chekushandisa mutsara kudzoreredza kugadzirisa matambudziko anoshanda. Pano isu tichatarisa zvinonakidza maficha ekushandiswa kweakagoverwa algorithms kupora kwayo, iyo yatakasangana nayo pakunyora muchina wekudzidza module mu. Apache Ignite. Idiki diki svomhu, kudzidza muchina, uye kugoverwa komputa kunogona kukubatsira iwe kufunga maitiro ekuita mutsara regression kunyangwe data rako richigoverwa muzviuru zvemanodhi.

Tiri kutaura nezvei?

Isu takatarisana nebasa rekudzoreredza kutsamira kwemutsara. Se data yekupinza, seti yemavheji ezvinonzi akazvimirira akasiyana anopihwa, rimwe nerimwe rine chekuita nehumwe kukosha kweiyo inotsamira shanduko. Iyi data inogona kumiririrwa muchimiro chematiriki maviri:

Linear regression uye nzira dzekupora kwayo

Zvino, sezvo kutsamira kuchifungidzirwa, uyezve, mutsara, isu tichanyora fungidziro yedu muchimiro chechigadzirwa chematrices (kurerutsa kurekodha, pano uye pazasi zvinofungidzirwa kuti nguva yemahara ye equation yakavanzwa kumashure. Linear regression uye nzira dzekupora kwayo, uye koramu yekupedzisira yematrix Linear regression uye nzira dzekupora kwayo ine mayunitsi):

Linear regression uye nzira dzekupora kwayo

Inonzwika zvakanyanya senge system yemutsara equation, handizvo? Zvinoita senge, asi kazhinji panenge pasisina mhinduro kuhurongwa hwakadaro hweequations. Chikonzero cheichi ruzha, chiripo munenge chero data chaiyo. Chimwe chikonzero chinogona kunge chiri kushaikwa kwekutsamira kwemutsara sekudaro, izvo zvinogona kurwiswa nekuunza mamwe machinjiro asina mutsara anoenderana neaya ekutanga. Chimbofunga muenzaniso unotevera:
Linear regression uye nzira dzekupora kwayo
Source: Wikipedia

Uyu muenzaniso wakapfava wekudzokororwa kwemutsara unoratidza hukama hweimwe shanduko (pamwe neaxis Linear regression uye nzira dzekupora kwayo) kubva kune imwe shanduko (pamwe neaxis Linear regression uye nzira dzekupora kwayo) Kuti hurongwa hwemitsara yemitsara inoenderana nemuenzaniso uyu ive nemhinduro, mapoinzi ese anofanirwa kunge ari pamutsetse wakatwasuka wakafanana. Asi ichocho hachisi chokwadi. Asi havanyepe pamutsetse wakatwasanuka chaizvo nekuda kweruzha (kana nekuti fungidziro yehukama hwemutsara yaive isiriyo). Nekudaro, kuitira kudzoreredza hukama hwemutsara kubva kune chaiyo data, zvinowanzodikanwa kuunza imwezve fungidziro: iyo data yekupinza ine ruzha uye ruzha urwu rune. normal distribution. Iwe unogona kuita fungidziro nezve mamwe marudzi ekuparadzira ruzha, asi muhuwandu hwakawanda hwezviitiko ndiko kugoverwa kwakajairika kunotariswa, izvo zvinozokurukurwa zvakare.

Maximum mukana nzira

Saka, takafungidzira kuvepo kweruzha runowanzo kugovaniswa. Chii chekuita mumamiriro ezvinhu akadaro? Panyaya iyi mumasvomhu pane uye anoshandiswa zvakanyanya yakanyanya mukana nzira. Muchidimbu, kukosha kwayo kuri mukusarudza mukana mabasa uye kuwedzera kwayo kunotevera.

Isu tinodzokera kudzoreredza hukama hwemutsara kubva kune data neruzha rwakajairika. Ziva kuti hukama hunofungidzirwa hwemutsara ndiyo tarisiro yemasvomhu Linear regression uye nzira dzekupora kwayo kugovera kwagara kuriko. Panguva imwecheteyo, mukana wekuti Linear regression uye nzira dzekupora kwayo inotora imwe kukosha kana imwe, zvichienderana nekuvapo kwezvinhu zvinoonekwa Linear regression uye nzira dzekupora kwayo, sezvinotevera:

Linear regression uye nzira dzekupora kwayo

Ngatichitsivai zvino Linear regression uye nzira dzekupora kwayo ΠΈ Linear regression uye nzira dzekupora kwayo Izvo zvakasiyana zvatinoda ndezvi:

Linear regression uye nzira dzekupora kwayo

Chasara kuwana vector Linear regression uye nzira dzekupora kwayo, apo mukana uyu mukuru. Kuti uwedzere basa rakadaro, zviri nyore kutanga kutora logarithm yayo (iyo logarithm yebasa ichasvika pakakwirira panzvimbo imwechete nebasa racho pacharo):

Linear regression uye nzira dzekupora kwayo

Izvo, zvakare, zvinouya pasi pakuderedza basa rinotevera:

Linear regression uye nzira dzekupora kwayo

Nenzira, iyi inonzi nzira zvikweya zvishoma. Kazhinji zvese zviri pamusoro apa zvinosiiwa uye nzira iyi inongoshandiswa.

QR kuparara

Hushoma hwebasa riri pamusoro rinowanikwa nekutsvaga poindi iyo gradient yebasa iri razero. Uye iyo gradient ichanyorwa sezvinotevera:

Linear regression uye nzira dzekupora kwayo

QR kuparara inzira yematrix yekugadzirisa dambudziko rekudzikisa rinoshandiswa mudiki diki nzira. Panyaya iyi, tinonyora zvakare equation mune matrix fomu:

Linear regression uye nzira dzekupora kwayo

Saka tinoparadza matrix Linear regression uye nzira dzekupora kwayo ku matrices Linear regression uye nzira dzekupora kwayo ΠΈ Linear regression uye nzira dzekupora kwayo uye ita shanduko dzakatevedzana (iyo QR decomposition algorithm pachayo haizotariswe pano, chete kushandiswa kwayo zvine chekuita nebasa riripo):

Linear regression uye nzira dzekupora kwayo

Matrix Linear regression uye nzira dzekupora kwayo iri orthogonal. Izvi zvinotibvumira kubvisa basa Linear regression uye nzira dzekupora kwayo:

Linear regression uye nzira dzekupora kwayo

Uye kana ukatsiva Linear regression uye nzira dzekupora kwayo pamusoro Linear regression uye nzira dzekupora kwayo, zvobva zvaita Linear regression uye nzira dzekupora kwayo. Tichifunga izvozvo Linear regression uye nzira dzekupora kwayo ndeyepamusoro petriangular matrix, inoita seizvi:

Linear regression uye nzira dzekupora kwayo

Izvi zvinogona kugadziriswa uchishandisa nzira yekutsiva. Element Linear regression uye nzira dzekupora kwayo iri se Linear regression uye nzira dzekupora kwayo, yapfuura element Linear regression uye nzira dzekupora kwayo iri se Linear regression uye nzira dzekupora kwayo uye zvichingodaro.

Zvakakosha kucherechedza pano kuti kuoma kweiyo algorithm inoguma nekuda kwekushandiswa kweQR decomposition yakaenzana Linear regression uye nzira dzekupora kwayo. Uyezve, zvisinei nekuti iyo matrix yekuwedzera mashandiro akanyatsoenderana, hazvigoneke kunyora inoshanda yakagoverwa shanduro yealgorithm iyi.

Gradient Descent

Paunenge uchitaura nezvekuderedza basa, zvinogara zvakakodzera kuyeuka nzira ye (stochastic) gradient descent. Iyi inzira yakapfava uye inoshanda yekudzikisa yakavakirwa pakudzokorora kuverengera gradient yebasa pane imwe nzvimbo wobva waichinjisa munzira yakatarisana negradient. Nhanho imwe neimwe yakadaro inounza mhinduro pedyo nekuderera. Iyo gradient ichiri kutaridzika zvakafanana:

Linear regression uye nzira dzekupora kwayo

Iyi nzira zvakare yakanyatso enzanirana uye yakagovaniswa nekuda kweiyo mutsara zvivakwa zve gradient opareta. Ziva kuti mufomula iri pamusoro, pasi pechiratidzo chehuwandu pane mazwi akazvimirira. Mune mamwe mazwi, tinogona kuverenga gradient takazvimiririra kune ese ma indices Linear regression uye nzira dzekupora kwayo kubva pakutanga kusvika Linear regression uye nzira dzekupora kwayo, mukufambirana neizvi, verenga gradient ye indices ne Linear regression uye nzira dzekupora kwayo up to Linear regression uye nzira dzekupora kwayo. Wobva wawedzera ma gradients anoguma. Mhedzisiro yekuwedzera ichave yakafanana kana isu takabva taverenga gradient ye indices kubva kune yekutanga kusvika Linear regression uye nzira dzekupora kwayo. Nekudaro, kana iyo data ikagoverwa pakati pezvimedu zvakati wandei zve data, gradient inogona kuverengerwa yakazvimirira pachidimbu chega chega, uyezve mhedzisiro yezviverengero izvi inogona kupfupikiswa kuti uwane yekupedzisira mhedzisiro:

Linear regression uye nzira dzekupora kwayo

Kubva pakuona kwekuita, izvi zvinoenderana neparadigm MepuDeredza. Panhanho imwe neimwe yekudzika kwe gradient, basa rinotumirwa kune yega data node kuverenga gradient, ipapo iwo akaverengerwa gradients anounganidzwa pamwechete, uye mhedzisiro yehuwandu hwavo inoshandiswa kugadzirisa mhedzisiro.

Zvisinei nekureruka kwekuita uye kugona kuita muMepuReduce paradigm, gradient descent inewo zvinokanganisa. Kunyanya, huwandu hwematanho anodiwa kuti uwane convergence yakanyanya kukwirira zvichienzaniswa nedzimwe nzira dzakanyanya hunyanzvi.

LSQR

LSQR ndiyo imwe nzira yekugadzirisa dambudziko, iyo inokodzera zvese kudzoreredza mutsara regression uye yekugadzirisa masisitimu emutsetse equation. Chinhu chayo chikuru ndechekuti inosanganisa zvakanakira matrix nzira uye nzira yekudzokorora. Kuitwa kweiyi nzira kunogona kuwanikwa mumaraibhurari ese ari maviri SciPyuye mukati MATLAB. Tsanangudzo yenzira iyi haizopiwi pano (inogona kuwanikwa muchinyorwa LSQR: Algorithm ye sparse linear equations uye sparse mashoma masikweya) Pane kudaro, nzira icharatidzwa kugadzirisa LSQR kuurayiwa munzvimbo yakagoverwa.

Iyo LSQR nzira yakavakirwa pa nzira yediagonalization. Iyi inzira yekudzokorora, imwe neimwe iteration ine matanho anotevera:
Linear regression uye nzira dzekupora kwayo

Asi kana tikafunga kuti matrix Linear regression uye nzira dzekupora kwayo yakaganhurwa yakatwasuka, ipapo imwe neimwe iteration inogona kumiririrwa sematanho maviri MepuReduce. Nenzira iyi, zvinokwanisika kudzikisa kufambiswa kwedata panguva yega yega iteration (mavheji chete ane hurefu hwakaenzana nenhamba yezvisingazivikanwe):

Linear regression uye nzira dzekupora kwayo

Iyi ndiyo nzira inoshandiswa pakuita linear regression in Apache Ignite ML.

mhedziso

Kune akawanda mutsara regression kudzoreredza algorithms, asi haasi ese anogona kuiswa mumamiriro ese. Saka QR decomposition yakanakira mhinduro chaiyo pane madiki data seti. Gradient descent iri nyore kushandisa uye inobvumidza iwe nekukurumidza kuwana mhinduro yekufungidzira. Uye LSQR inosanganisa akanakisa zvivakwa zveaviri algorithms, sezvo ichigona kugovaniswa, inochinjika nekukurumidza kana ichienzaniswa nekudzika kwegradient, uye zvakare inobvumira kumisa kwekutanga kweiyo algorithm, kusiyana neQR kuora, kuwana mhinduro.

Source: www.habr.com

Voeg