Ulwimi olu-R lwabasebenzisi be-Excel (ikhosi yevidiyo yasimahla)

Ngenxa yokuvalelwa yedwa, abaninzi ngoku bachitha isabelo sengonyama sexesha labo ekhaya, kwaye eli xesha linokuthi, kwaye kufanele, lisetyenziswe ngokufanelekileyo.

Ekuqaleni kokuvalelwa, ndigqibe kwelokuba ndigqibezele iiprojekthi endiziqale kwiinyanga ezimbalwa ezidlulileyo. Enye yezi projekthi yayiyikhosi yevidiyo ethi β€œR Language for Excel Users”. Ngale khosi, bendifuna ukuthoba umqobo wokungena kwi-R, kwaye ndigcwalise kancinci ukunqongophala okukhoyo kwezinto zoqeqesho kwesi sihloko ngesiRashiya.

Ukuba yonke imisebenzi kunye nedatha kwinkampani oyisebenzelayo isenziwa kwi-Excel, ke ndicebisa ukuba uqhelane neyona mihla, kwaye kwangaxeshanye isimahla, isixhobo sokuhlalutya idatha.

Ulwimi olu-R lwabasebenzisi be-Excel (ikhosi yevidiyo yasimahla)

Iziqulatho

Ukuba unomdla kuhlalutyo lwedatha, unokuba nomdla kum yilelegram ΠΈ youtube imijelo. Uninzi lomxholo lunikezelwe kulwimi R.

  1. iimbekiselo
  2. Malunga nekhosi
  3. Yenzelwe bani le khosi?
  4. Inkqubo yekhosi
    4.1. Isifundo 1: Ukufakela ulwimi lwe-R kunye nendawo yophuhliso lwe-RStudio
    4.2. Isifundo 2: Ulwakhiwo lweDatha eSisiseko kwi-R
    4.3. Isifundo 3: Ukufunda idatha kwi-TSV, CSV, iifayile ze-Excel kunye ne-Google Sheets
    4.4. Isifundo 4: Ukuhluza imiqolo, ukukhetha kunye nokuthiya ngokutsha iikholamu, imibhobho kwi-R
    4.5. Isifundo 5: Ukongeza amaKholam obaliweyo kwiTheyibhile ekwi-R
    4.6. Isifundo 6: Ukwahlulahlula kunye nokuHlanganisa iDatha kwi-R
    4.7. Isifundo sesi-7: Ukudityaniswa okuthe nkqo nokuthe tye kwi-R
    4.8. Isifundo 8: Imisebenzi yefestile kwi-R
    4.9. Isifundo 9: Iitafile ezijikelezayo okanye i-analogue yepivot tables kwi-R
    4.10. Isifundo 10: Ukulayisha iiFayile ze-JSON kwi-R kunye nokuguqula uluhlu kwiiTafile
    4.11. Isifundo 11: Ukuyila ngokukhawuleza usebenzisa i-qplot() Umsebenzi
    4.12. Isifundo 12: Umaleko wokuPlatha ngokwaMaleko weZiza usebenzisa i-ggplot2 Package.
  5. isiphelo

iimbekiselo

Malunga nekhosi

Ikhosi yakhiwe malunga noyilo lwezakhiwo tidyverse, kunye neepakethe ezibandakanyiweyo kuyo: readr, vroom, dplyr, tidyr, ggplot2. Ewe kunjalo, kukho ezinye iipakethe ezilungileyo kwi-R ezenza imisebenzi efanayo, umzekelo data.table, kodwa isivakalisi tidyverse enembile, kulula ukufunda nakumsebenzisi ongaqeqeshwanga, ke ndicinga ukuba kungcono ukuqalisa ukufunda ulwimi lwesi-R tidyverse.

Ikhosi iya kukukhokela kuyo yonke imisebenzi yokuhlalutya idatha, ukusuka ekulayisheni ukuya ekuboneni umphumo ogqityiweyo.

Kutheni i-R kwaye ingeyiyo iPython? Ngenxa yokuba i-R lulwimi olusebenzayo, kulula kubasebenzisi be-Excel ukutshintshela kuyo, kuba akukho sidingo sokungena kudweliso lwento olujoliswe kwinto eqhelekileyo.

Okwangoku, izifundo zevidiyo ze-12 zicetywayo, ezihlala kwi-5 ukuya kwi-20 imizuzu nganye.

Izifundo ziya kuvuleka ngokuthe ngcembe. Rhoqo ngoMvulo ndiya kuvula ukufikelela kwisifundo esitsha kwiwebhusayithi yam. Isitishi sikaYouTube kuluhlu lokudlala olwahlukileyo.

Yenzelwe bani le khosi?

Ndicinga ukuba oku kucacile kwisihloko, nangona kunjalo, ndiya kuyichaza ngokubanzi.

Ikhosi ijolise kwabo basebenzisa ngokusebenzayo iMicrosoft Excel emsebenzini wabo kwaye baphumeze wonke umsebenzi wabo ngedatha apho. Ngokubanzi, ukuba uvula usetyenziso lweMicrosoft Excel kanye ngeveki, ke ikhosi ikulungele.

Awunyanzelekanga ukuba ube nezakhono zokucwangcisa ukugqiba ikhosi, kuba... Ikhosi ijolise kwabaqalayo.

Kodwa, mhlawumbi, ukuqala kwisifundo sesi-4, kuya kubakho izinto ezinomdla kubasebenzisi be-R abasebenzayo, kwakhona, kuba... umsebenzi ophambili weepakethe ezinjalo njenge dplyr ΠΈ tidyr kuya kuxutyushwa ngokweenkcukacha.

Inkqubo yekhosi

Isifundo 1: Ukufakela ulwimi lwe-R kunye nendawo yophuhliso lwe-RStudio

Umhla wokupapashwa: March 23 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Isifundo esiyintshayelelo apho siya kukhuphela kwaye sifake isoftware eyimfuneko, kwaye sivavanye ngokufutshane amandla kunye nojongano lwemeko yophuhliso lweRStudio.

Isifundo 2: Ulwakhiwo lweDatha eSisiseko kwi-R

Umhla wokupapashwa: March 30 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Esi sifundo siya kukunceda uqonde ukuba zeziphi izakhiwo zedatha ezikhoyo ngolwimi lwe-R. Siza kujonga ngokweenkcukacha kwiivekhtha, iifreyimu zomhla kunye noluhlu. Masifunde ukuzidala kunye nokufikelela kwizinto ezizimeleyo.

Isifundo 3: Ukufunda idatha kwi-TSV, CSV, iifayile ze-Excel kunye ne-Google Sheets

Umhla wokupapashwa: April 6 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Ukusebenza ngedatha, kungakhathaliseki ukuba sisixhobo, iqala ngokutsalwa kwayo. Iiphakheji zisetyenziswa ngexesha lesifundo vroom, readxl, googlesheets4 yokulayisha idatha kwindawo ye-R ukusuka kwi-csv, i-tsv, iifayile ze-Excel kunye ne-Google Sheets.

Isifundo 4: Ukuhluza imiqolo, ukukhetha kunye nokuthiya ngokutsha iikholamu, imibhobho kwi-R

Umhla wokupapashwa: April 13 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Esi sifundo simalunga nephakheji dplyr. Kuyo siza kujonga indlela yokucoca i-dataframes, khetha iikholamu eziyimfuneko kwaye uzibize ngokutsha.

Siza kufunda kwakhona ukuba yeyiphi imibhobho kunye nendlela enceda ngayo ukwenza ikhowudi yakho ye-R ifundeke ngakumbi.

Isifundo 5: Ukongeza amaKholam obaliweyo kwiTheyibhile ekwi-R

Umhla wokupapashwa: April 20 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Kule vidiyo siqhubeka siqhelana nethala leencwadi tidyverse kunye nephakheji dplyr.
Makhe sijonge kusapho lwemisebenzi mutate(), kwaye siya kufunda indlela yokuzisebenzisa ukongeza iikholamu ezintsha ezibaliweyo kwitheyibhile.

Isifundo 6: Ukwahlulahlula kunye nokuHlanganisa iDatha kwi-R

Umhla wokupapashwa: April 27 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Esi sifundo sinikezelwa komnye weyona misebenzi iphambili yohlalutyo lwedatha, amaqela kunye nokudibanisa. Ngexesha lesifundo siya kusebenzisa iphakheji dplyr kunye neempawu group_by() ΠΈ summarise().

Siza kujonga lonke usapho lwemisebenzi summarise(), o.k. summarise(), summarise_if() ΠΈ summarise_at().

Isifundo sesi-7: Ukudityaniswa okuthe nkqo nokuthe tye kwi-R

Umhla wokupapashwa: 4 May 2020

Izalathiso:

Ividiyo:

Inkcazelo:
Esi sifundo siyakunceda uqonde imisebenzi yokudibanisa ngokuthe nkqo nokuthe tye kwetafile.

Umanyano oluthe nkqo lulingana nokusebenza kwe-UNION kulwimi lombuzo lwe-SQL.

Ukudibanisa okuthe tye kwaziwa ngcono kubasebenzisi be-Excel enkosi ngomsebenzi we-VLOOKUP; kwi-SQL, imisebenzi enjalo yenziwa ngumsebenzisi we-JOIN.

Ngexesha lesifundo siya kusombulula ingxaki esebenzayo apho siza kusebenzisa iipakethi dplyr, readxl, tidyr ΠΈ stringr.

Imisebenzi ephambili esiza kuyiqwalasela:

  • bind_rows() - Ukudibanisa ngokuthe nkqo kweetafile
  • left_join() - Ukudibanisa okuthe tye kwetafile
  • semi_join() - kuquka iitafile zokujoyina
  • anti_join() - Itheyibhile ekhethekileyo ukujoyina

Isifundo 8: Imisebenzi yefestile kwi-R

Umhla wokupapashwa: 11 May 2020

Izalathiso:

Inkcazelo:
Imisebenzi yefestile iyafana ngentsingiselo kwi-aggregating; bakwathatha uluhlu lwamaxabiso njengegalelo kwaye benze imisebenzi ye-arithmetic kubo, kodwa alitshintshi inani lemiqolo kwisiphumo sesiphumo.

Kule tutorial siyaqhubeka nokufunda iphakheji dplyr, kunye nemisebenzi group_by(), mutate(), kunye nentsha cumsum(), lag(), lead() ΠΈ arrange().

Isifundo 9: Iitafile ezijikelezayo okanye i-analogue yepivot tables kwi-R

Umhla wokupapashwa: 18 May 2020

Izalathiso:

Inkcazelo:
Uninzi lwabasebenzisi be-Excel basebenzisa iitafile zepivot; esi sisixhobo esiluncedo onokuthi ngaso ujike uluhlu lwedatha ekrwada kwiingxelo ezifundekayo kwimizuzwana nje.

Kule tutorial siza kujonga indlela yokujikelezisa iitafile kwi-R, kwaye siziguqule ukusuka kububanzi ukuya kwifomathi ende kwaye ngokuchaseneyo.

Uninzi lwesifundo lunikezelwe kwiphakheji tidyr kunye nemisebenzi pivot_longer() ΠΈ pivot_wider().

Isifundo 10: Ukulayisha iiFayile ze-JSON kwi-R kunye nokuguqula uluhlu kwiiTafile

Umhla wokupapashwa: 25 May 2020

Izalathiso:

Inkcazelo:
I-JSON kunye ne-XML ziifomati ezidume kakhulu zokugcina kunye nokwabelana ngolwazi, ngokwesiqhelo ngenxa yokubumbana kwazo.

Kodwa kunzima ukuhlalutya idatha eboniswe kwiifomathi ezinjalo, ngoko ke ngaphambi kokuba uhlalutyo luyimfuneko ukuyizisa kwifom ye-tabular, eyona nto siya kufunda kule vidiyo.

Isifundo sinikezelwe kwiphakheji tidyr, iqukwe kumbindi wethala leencwadi tidyverse, kunye nemisebenzi unnest_longer(), unnest_wider() ΠΈ hoist().

Isifundo 11: Ukuyila ngokukhawuleza usebenzisa i-qplot() Umsebenzi

Umhla wokupapashwa: 1 2020 Juni

Izalathiso:

Inkcazelo:
Iphakheji ggplot2 sesinye sezona zixhobo zidumileyo zokubonwa kwedatha hayi kwi-R kuphela.

Kwesi sifundo siza kufunda indlela yokwakha iigrafu ezilula sisebenzisa umsebenzi qplot(), yaye makhe sizihlolisise zonke iingxoxo zakhe.

Isifundo 12: Umaleko wokuPlatha ngokwaMaleko weZiza usebenzisa i-ggplot2 Package.

Umhla wokupapashwa: 8 2020 Juni

Izalathiso:

Inkcazelo:
Isifundo sibonisa amandla apheleleyo ephakheji ggplot2 kunye negrama yokwakha iigrafu kumaleko afakwe kuyo.

Siza kuhlalutya iijometri eziphambili ezikhoyo kwiphakheji kwaye sifunde indlela yokufaka iileyile zokwakha igrafu.

isiphelo

Ndizamile ukusondela ekusekweni kwenkqubo yekhosi ngokufutshane kangangoko, ukuqaqambisa kuphela olona lwazi luyimfuneko oya kulufuna ukuze uthathe amanyathelo okuqala ekufundeni isixhobo esinamandla sokuhlalutya idatha njengolwimi lwe-R.

Ikhosi ayisosikhokelo esipheleleyo sohlalutyo lwedatha usebenzisa ulwimi lwe-R, kodwa iya kukunceda uqonde zonke iindlela eziyimfuneko kule nto.

Ngelixa inkqubo yekhosi yenzelwe iiveki ezili-12, veki nganye ngoMvulo ndiya kuvula ukufikelela kwizifundo ezitsha, ke ndicebisa bhalisa kwisitishi sikaYouTube ukuze ungaphoswa kukupapashwa kwesifundo esitsha.

umthombo: www.habr.com

Yongeza izimvo