Nekuda kwekuvharirwa, vazhinji zvino vanopedza chikamu cheshumba chenguva yavo kumba, uye nguva ino vanogona, uye kunyangwe inofanirwa, kushandiswa zvine mutsindo.
Pakutanga kwekuvharirwa, ndakafunga kupedzisa mamwe mapurojekiti andakatanga mwedzi mishoma yapfuura. Imwe yemapurojekiti aya yaive kosi yevhidhiyo "R Mutauro weVashandisi veExcel". Nekosi iyi, ndaida kudzikisa chipingamupinyi chekupinda muR, uye zvishoma kuzadza kushomeka kuripo kwezvinhu zvekudzidzisa pane iyi nyaya muchiRussia.
Kana zvese zvichishanda nedata mukambani yaunoshandira ichiri kuitwa muExcel, saka ini ndinokurudzira kuti uzive nezvechizvino-zvino, uye panguva imwechete yakasununguka zvachose, data yekuongorora chishandiso.
Zviri mukati
Kana iwe uchifarira kuongorora data, unogona kunge uchifarira zvangu
nezvakanyorwa Nezve iyo kosi Kosi iyi ndeyaani? Kosi purogiramu
4.1.Chidzidzo 1: Kuisa R mutauro uye RStudio budiriro nharaunda
4.2.Chidzidzo 2: Basic Data Structures muR
4.3.Chidzidzo 3: Kuverenga data kubva kuTSV, CSV, Excel mafaera uye Google Sheets
4.4.Chidzidzo 4: Kusefa mitsara, kusarudza uye kutumidzazve makoramu, mapaipi muR
4.5.Chidzidzo 5: Kuwedzera Makoramu Akaverengerwa paTafura iri muR
4.6.Chidzidzo 6: Kuunganidza uye Kuunganidza Data muR
4.7.Chidzidzo 7: Yakatwasuka uye Yakachinjika Kujoinwa kwematafura muR
4.8.Chidzidzo 8: Window Mabasa muR
4.9.Chidzidzo 9: Matafura anotenderera kana analogue yematafura epivot muR
4.10.Chidzidzo 10: Kuisa Mafaira eJSON muR uye Kushandura Mazita kuMatafura
4.11.Chidzidzo 11: Kuronga Nekukurumidza Kushandisa qplot() Basa
4.12.Chidzidzo 12: Kuronga Mutsetse neChirongwa Zvirongwa Uchishandisa iyo ggplot2 Package. mhedziso
nezvakanyorwa
Nezve iyo kosi
Iyo kosi yakarongeka yakatenderedza zvivakwa tidyverse
, uye mapakeji akabatanidzwa mairi: readr
, vroom
, dplyr
, tidyr
, ggplot2
. Ehe, kune mamwe mapakeji akanaka muR anoita mashandiro akafanana, semuenzaniso data.table
, asi mashoko akanyorwa tidyverse
intuitive, iri nyore kuverenga kunyangwe kumushandisi asina kudzidziswa, saka ndinofunga zviri nani kutanga kudzidza mutauro weR tidyverse
.
Iyo kosi inokutungamira iwe kuburikidza ese ekuongorora data mashandiro, kubva pakurodha kusvika pakuona mhedzisiro yapera.
Sei R uye kwete Python? Nekuti R mutauro unoshanda, zviri nyore kune vashandisi veExcel kuchinjira kwairi, nekuti hapana chikonzero chekunyura mune yechinyakare-inotungamirwa hurongwa.
Parizvino, zvidzidzo zvevhidhiyo gumi nembiri zvakarongwa, kubva pa12 kusvika kumaminitsi makumi maviri imwe neimwe.
Zvidzidzo zvichavhurwa zvishoma nezvishoma. Muvhuro wega wega ndichavhura mukana kune chidzidzo chitsva pawebhusaiti yangu.
Kosi iyi ndeyaani?
Ndinofunga izvi zvakajeka kubva mumusoro, zvisinei, ini ndichazvitsanangura zvakadzama.
Kosi yacho yakanangana neavo vanoshingairira kushandisa Microsoft Excel mubasa ravo uye kuita basa ravo rose nedata ipapo. Kazhinji, kana iwe ukavhura iyo Microsoft Excel application kamwechete pasvondo, saka kosi yacho yakakunakira iwe.
Iwe haufanirwe kuve nehunyanzvi hwekuronga kuti upedze kosi, nekuti ... Kosi yacho yakanangana nevanotanga.
Asi, zvichida, kutanga kubva pachidzidzo 4, pachava nezvinhu zvinonakidza zvevashandisi veR, zvakare, nokuti ... basa guru remapakeji akadai se dplyr
ΠΈ tidyr
ichakurukurwa zvakadzama.
Kosi purogiramu
Chidzidzo 1: Kuisa R mutauro uye RStudio budiriro nharaunda
Zuva rekuburitswa: March 23 2020
Mareferensi:
Vhidhiyo:
Description:
Chidzidzo chekutanga panguva yatichadhawunirodha nekuisa iyo inodiwa software, uye nekuongorora muchidimbu kugona uye chimiro cheiyo RStudio budiriro nharaunda.
Chidzidzo 2: Basic Data Structures muR
Zuva rekuburitswa: March 30 2020
Mareferensi:
Vhidhiyo:
Description:
Ichi chidzidzo chichakubatsira kuti unzwisise kuti ndeapi maumbirwo edata anowanikwa mumutauro weR. Tichatarisa zvakadzama pamavekita, mafuremu emazuva uye zvinyorwa. Ngatidzidzei kuzvigadzira uye kuwana zvinhu zvawo zvega.
Chidzidzo 3: Kuverenga data kubva kuTSV, CSV, Excel mafaera uye Google Sheets
Zuva rekuburitswa: April 6 2020
Mareferensi:
Vhidhiyo:
Description:
Kushanda nedata, zvisinei nechishandiso, kunotanga nekubviswa kwayo. Mapakeji anoshandiswa panguva yechidzidzo vroom
, readxl
, googlesheets4
yekurodha data muR nharaunda kubva kucsv, tsv, Excel mafaera uye Google Sheets.
Chidzidzo 4: Kusefa mitsara, kusarudza uye kutumidzazve makoramu, mapaipi muR
Zuva rekuburitswa: April 13 2020
Mareferensi:
Vhidhiyo:
Description:
Ichi chidzidzo chiri pamusoro pepasuru dplyr
. Mariri isu tichaona nzira yekusefa dataframes, sarudza makoramu anodiwa uye woatumidza zita.
Tichadzidzawo kuti mapaipi ndeapi uye kuti anobatsira sei kuti R kodhi yako iverengeke.
Chidzidzo 5: Kuwedzera Makoramu Akaverengerwa paTafura iri muR
Zuva rekuburitswa: April 20 2020
Mareferensi:
Vhidhiyo:
Description:
Muvhidhiyo iyi tinoenderera mberi nekuziva kwedu raibhurari tidyverse
uye package dplyr
.
Ngatitarisei mhuri yemabasa mutate()
, uye isu tichadzidza kuti tingazvishandisa sei kuwedzera makoramu matsva akaverengerwa patafura.
Chidzidzo 6: Kuunganidza uye Kuunganidza Data muR
Zuva rekuburitswa: April 27 2020
Mareferensi:
Vhidhiyo:
Description:
Ichi chidzidzo chakatsaurirwa kune chimwe chekushanda kukuru kwekuongorora data, kuunganidza uye kuunganidza. Munguva yechidzidzo tichashandisa pasuru dplyr
uye maitiro group_by()
ΠΈ summarise()
.
Tichatarisa mhuri yese yemabasa summarise()
, i.e. summarise()
, summarise_if()
ΠΈ summarise_at()
.
Chidzidzo 7: Yakatwasuka uye Yakachinjika Kujoinwa kwematafura muR
Zuva rekuburitswa: 4 May 2020
Mareferensi:
Vhidhiyo:
Description:
Ichi chidzidzo chinokubatsira iwe kunzwisisa mashandiro ekubatanidza matafura.
Mubatanidzwa wakatwasuka wakafanana nekushanda kweUNION mumutauro wemubvunzo weSQL.
Kujoinha kwakatwasuka kunozivikanwa zvirinani kune vashandisi veExcel nekuda kweVLOOKUP basa; muSQL, mavhiya akadai anoitwa neJOIN opareta.
Munguva yechidzidzo tichagadzirisa dambudziko rinoshanda panguva yatichashandisa mapakeji dplyr
, readxl
, tidyr
ΠΈ stringr
.
Iwo makuru mabasa atichatarisa:
bind_rows()
- vertical kubatana kwematafuraleft_join()
- kujoinwa kwakatwasuka kwematafurasemi_join()
- kusanganisira matafura ekubatanidzaanti_join()
- yakasarudzika tafura kujoinha
Chidzidzo 8: Window Mabasa muR
Zuva rekuburitswa: 11 May 2020
Mareferensi:
Description:
Mahwindo emabasa akafanana muchirevo kune aggregating iwo; ivo zvakare vanotora akatevedzana ezvikoshero sekupinza uye kuita arithmetic mashandiro pavari, asi usachinje huwandu hwemitsara mune inobuda mhedzisiro.
Muchidzidzo ichi tinoenderera mberi nekudzidza pasuru dplyr
, uye mabasa group_by()
, mutate()
, uyewo itsva cumsum()
, lag()
, lead()
ΠΈ arrange()
.
Chidzidzo 9: Matafura anotenderera kana analogue yematafura epivot muR
Zuva rekuburitswa: 18 May 2020
Mareferensi:
Description:
Vazhinji vashandisi veExcel vanoshandisa matafura epivot; ichi chishandiso chiri nyore chaunogona kushandura dhata nyoro kuita mishumo inoverengwa mumasekondi.
Muchidzidzo ichi tichatarisa maitiro ekutenderedza matafura muR, uye nekuashandura kubva pahupamhi kuenda kureba fomati uye zvichipesana.
Zvizhinji zvechidzidzo zvakatsaurirwa pasuru tidyr
uye mashandiro pivot_longer()
ΠΈ pivot_wider()
.
Chidzidzo 10: Kuisa Mafaira eJSON muR uye Kushandura Mazita kuMatafura
Zuva rekuburitswa: 25 May 2020
Mareferensi:
Description:
JSON neXML mafomati akanyanya kufarirwa ekuchengetedza nekupanana ruzivo, kazhinji nekuda kwekubatana kwavo.
Asi zvakaoma kuongorora dhiyabhorosi inoratidzwa mumhando dzakadaro, saka tisati taongorora zvakakosha kuti tiuye nayo muchimiro chetabular, izvo ndizvo chaizvo zvatichadzidza muvhidhiyo iyi.
Chidzidzo chakatsaurirwa pasuru tidyr
, yakabatanidzwa pakati peraibhurari tidyverse
, uye mabasa unnest_longer()
, unnest_wider()
ΠΈ hoist()
.
Chidzidzo 11: Kuronga Nekukurumidza Kushandisa qplot() Basa
Zuva rekuburitswa: 1 2020 June
Mareferensi:
Description:
Package ggplot2
ndeimwe yeanonyanya kufarirwa data kuona maturusi kwete muR.
Muchidzidzo chino tichadzidza kugadzira magirafu akareruka tichishandisa basa racho qplot()
, uye ngatinzverei nharo dzake dzose.
Chidzidzo 12: Kuronga Mutsetse neChirongwa Zvirongwa Uchishandisa iyo ggplot2 Package.
Zuva rekuburitswa: 8 2020 June
Mareferensi:
Description:
Chidzidzo chinoratidza simba rakazara repasuru ggplot2
uye girama yekuvaka magirafu muzvikamu zvakabatanidzwa mairi.
Isu tichaongorora iwo makuru ma geometries aripo mupakeji uye tidzidze mashandisiro ekuisa mataira kuvaka girafu.
mhedziso
Ndakaedza kusvika pakuumbwa kwechirongwa chekosi muchidimbu sezvinobvira, kuratidza chete ruzivo rwakakosha rwauchazoda kuti utore matanho ekutanga mukudzidza chishandiso chine simba chekuongorora data semutauro weR.
Iyo kosi haisi iyo inopedza dhairekitori yekuongorora data uchishandisa R mutauro, asi ichakubatsira iwe kunzwisisa ese anodiwa matekiniki eizvi.
Nepo chirongwa chekosi chakagadzirirwa mavhiki e12, svondo rega rega neMuvhuro ndinovhura mukana kune zvidzidzo zvitsva, saka ndinokurudzira
Source: www.habr.com