éé¢ã®ãããå€ãã®äººãèªå® ã§éããæéã®å€§éšåãå ããŠããŸããããã®æéãææ矩ã«éããããšãã§ããŸããããããã¹ãã§ãã
éé¢ç掻ãå§ãŸãã«ããã£ãŠãç§ã¯æ°ãæåã«å§ããããã€ãã®ãããžã§ã¯ããå®äºãããããšã«ããŸããã ãããã®ãããžã§ã¯ãã® XNUMX ã€ã¯ããã㪠ã³ãŒã¹ãExcel ãŠãŒã¶ãŒã®ããã® R èšèªãã§ããã ãã®ã³ãŒã¹ã§ã¯ãR ãžã®åå ¥éå£ãäžãããã®ãããã¯ã«é¢ãããã·ã¢èªã®ãã¬ãŒãã³ã°è³æã®æ¢åã®äžè¶³ãå°ãåããããšèããŸããã
ããªããåããŠããäŒç€Ÿã§ããŒã¿ãæ±ããã¹ãŠã®äœæ¥ãäŸç¶ãšã㊠Excel ã§è¡ãããŠããå Žåã¯ãããææ°ã§ãããªããå®å šã«ç¡æã®ããŒã¿åæããŒã«ã«æ £ããããšããå§ãããŸãã
ããŒãžå 容
ããŒã¿åæã«èå³ãããå Žåã¯ãç§ã®èšäºã«èå³ããããããããŸããã
ãªãã¡ã¬ã³ã¹ ã³ãŒã¹ã«ã€ã㊠ãã®ã³ãŒã¹ã¯èª°ã察象ãšããŠããŸãã? ã³ãŒã¹ããã°ã©ã
4.1.ã¬ãã¹ã³ 1: R èšèªãš RStudio éçºç°å¢ã®ã€ã³ã¹ããŒã«
4.2.ã¬ãã¹ã³ 2: R ã®åºæ¬çãªããŒã¿æ§é
4.3.ã¬ãã¹ã³ 3: TSVãCSVãExcel ãã¡ã€ã«ãGoogle ã¹ãã¬ããã·ãŒãããããŒã¿ãèªã¿åã
4.4.ã¬ãã¹ã³ 4: R ã§ã®è¡ã®ãã£ã«ã¿ãŒåŠçãåã®éžæãšååå€æŽããã€ãã©ã€ã³
4.5.ã¬ãã¹ã³ 5: R ã®ããŒãã«ã«èšç®åãè¿œå ãã
4.6.ã¬ãã¹ã³ 6: R ã§ã®ããŒã¿ã®ã°ã«ãŒãåãšéèš
4.7.ã¬ãã¹ã³ 7: R ã§ã®ããŒãã«ã®åçŽçµåãšæ°Žå¹³çµå
4.8.ã¬ãã¹ã³ 8: R ã®ãŠã£ã³ããŠé¢æ°
4.9.ã¬ãã¹ã³ 9: R ã®å転ããŒãã«ãŸãã¯ãããã ããŒãã«ã®é¡äŒŒç©
4.10.ã¬ãã¹ã³ 10: R ãžã® JSON ãã¡ã€ã«ã®ããŒããšãªã¹ãããããŒãã«ãžã®å€æ
4.11.ã¬ãã¹ã³ 11: qplot() é¢æ°ã䜿çšããè¿ éãªãããã
4.12.ã¬ãã¹ã³ 12: ggplot2 ããã±ãŒãžã䜿çšããã¬ã€ã€ãŒããšã®ããããã®ãããã ãŸãšã
ãªãã¡ã¬ã³ã¹
YouTube ãã£ã³ãã«ã賌èªãã YouTube ã®ã³ãŒã¹ã®ãã¬ã€ãªã¹ã ãããªã¢ã«ã®ãªããžã㪠ã³ãŒã¹ããŒãž
ã³ãŒã¹ã«ã€ããŠ
ã³ãŒã¹ã¯å»ºç¯ãäžå¿ã«æ§æãããŠããŸã tidyverse
ãããã³ããã«å«ãŸããããã±ãŒãž: readr
, vroom
, dplyr
, tidyr
, ggplot2
ã ãã¡ãããR ã«ã¯åæ§ã®æäœãå®è¡ããä»ã®åªããããã±ãŒãžããããŸããããšãã°ã data.table
ããããæ§æ tidyverse
çŽèŠ³çã§ãèšç·ŽãåããŠããªããŠãŒã¶ãŒã§ãèªã¿ãããã®ã§ãR èšèªã®åŠç¿ãå§ããã®ãè¯ããšæããŸãã tidyverse
.
ãã®ã³ãŒã¹ã§ã¯ãèªã¿èŸŒã¿ããæçµçµæã®èŠèŠåãŸã§ããã¹ãŠã®ããŒã¿åææäœã説æããŸãã
ãªã Python ã§ã¯ãªã R ãªã®ã§ãããã? R ã¯é¢æ°åèšèªã§ãããããExcel ãŠãŒã¶ãŒã¯ R ã«åãæ¿ããã®ãç°¡åã§ãã åŸæ¥ã®ãªããžã§ã¯ãæåããã°ã©ãã³ã°ãæ·±ãæãäžããå¿ èŠã¯ãããŸããã
çŸæç¹ã§ã¯ããããã 12 ïœ 5 åã®ãã㪠ã¬ãã¹ã³ã 20 åäºå®ãããŠããŸãã
åŸã
ã«ã¬ãã¹ã³ãåéããŠãããŸãã æ¯é±æææ¥ã«ãç§ã®ãŠã§ããµã€ãã§æ°ããã¬ãã¹ã³ãžã®ã¢ã¯ã»ã¹ãå
¬éããŸãã
ãã®ã³ãŒã¹ã¯èª°ã察象ãšããŠããŸãã?
ã¿ã€ãã«ãããåãããšæããŸãããããå°ã詳ãã説æããŸãã
ãã®ã³ãŒã¹ã¯ãæ¥å㧠Microsoft Excel ãç©æ¥µçã«äœ¿çšãããã¹ãŠã®äœæ¥ã Microsoft Excel ã®ããŒã¿ã䜿çšããŠå®è£ ãã人ã察象ãšããŠããŸãã äžè¬ã«ãMicrosoft Excel ã¢ããªã±ãŒã·ã§ã³ãå°ãªããšãé±ã« XNUMX åéãå Žåã¯ããã®ã³ãŒã¹ãé©ããŠããŸãã
ã³ãŒã¹ãå®äºããããã«ããã°ã©ãã³ã° ã¹ãã«ã¯å¿ èŠãããŸãããçç±ã¯... åå¿è åãã®ã³ãŒã¹ã§ãã
ãããããããããã¬ãã¹ã³ 4 ããã¯ãã¢ã¯ãã£ã㪠R ãŠãŒã¶ãŒã«ãšã£ãŠãèå³æ·±ãå
容ã«ãªãã§ãããã ãªã©ã®ããã±ãŒãžã®äž»ãªæ©èœ dplyr
О tidyr
ã«ã€ããŠã¯ããå°ã詳ãã説æããŸãã
ã³ãŒã¹ããã°ã©ã
ã¬ãã¹ã³ 1: R èšèªãš RStudio éçºç°å¢ã®ã€ã³ã¹ããŒã«
åºçæ¥ïŒ æ23 2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
å¿
èŠãªãœãããŠã§ã¢ãããŠã³ããŒãããŠã€ã³ã¹ããŒã«ããRStudio éçºç°å¢ã®æ©èœãšã€ã³ã¿ãŒãã§ã€ã¹ãç°¡åã«èª¿ã¹ãå
¥éã¬ãã¹ã³ã§ãã
ã¬ãã¹ã³ 2: R ã®åºæ¬çãªããŒã¿æ§é
åºçæ¥ïŒ æ30 2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
ãã®ã¬ãã¹ã³ã¯ãR èšèªã§äœ¿çšã§ããããŒã¿æ§é ãç解ããã®ã«åœ¹ç«ã¡ãŸãããã¯ãã«ãæ¥ä»ãã¬ãŒã ããªã¹ãã«ã€ããŠè©³ãã説æããŸãã ããããäœæããåã
ã®èŠçŽ ã«ã¢ã¯ã»ã¹ããæ¹æ³ãåŠã³ãŸãããã
ã¬ãã¹ã³ 3: TSVãCSVãExcel ãã¡ã€ã«ãGoogle ã¹ãã¬ããã·ãŒãããããŒã¿ãèªã¿åã
åºçæ¥ïŒ 4æ6 2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
ããŒã¿ã®æäœã¯ãããŒã«ã«é¢ä¿ãªããããŒã¿ã®æœåºããå§ãŸããŸãã ããã±ãŒãžã¯ã¬ãã¹ã³äžã«äœ¿çšãããŸã vroom
, readxl
, googlesheets4
csvãtsvãExcel ãã¡ã€ã«ãGoogle ã¹ãã¬ããã·ãŒããã R ç°å¢ã«ããŒã¿ãããŒãããŸãã
ã¬ãã¹ã³ 4: R ã§ã®è¡ã®ãã£ã«ã¿ãŒåŠçãåã®éžæãšååå€æŽããã€ãã©ã€ã³
åºçæ¥ïŒ 4æ13 2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
ãã®ã¬ãã¹ã³ã¯ããã±ãŒãžã«ã€ããŠã§ã dplyr
ã ãã®äžã§ãããŒã¿ãã¬ãŒã ããã£ã«ã¿ãŒããå¿
èŠãªåãéžæããŠååãå€æŽããæ¹æ³ãèŠã€ããŸãã
ãŸãããã€ãã©ã€ã³ãšã¯äœãããããŠããã R ã³ãŒããèªã¿ãããããã®ã«ã©ã®ããã«åœ¹ç«ã€ã®ãã«ã€ããŠãåŠã³ãŸãã
ã¬ãã¹ã³ 5: R ã®ããŒãã«ã«èšç®åãè¿œå ãã
åºçæ¥ïŒ 4æ20 2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
ãã®ãããªã§ã¯ãå³æžé€šãšã®ä»ãåããç¶ããŸã tidyverse
ãããŠããã±ãŒãž dplyr
.
é¢æ°ãã¡ããªãŒãèŠãŠã¿ãŸããã mutate()
ãããŠããããã䜿çšããŠããŒãã«ã«æ°ããèšç®åãè¿œå ããæ¹æ³ãåŠã³ãŸãã
ã¬ãã¹ã³ 6: R ã§ã®ããŒã¿ã®ã°ã«ãŒãåãšéèš
åºçæ¥ïŒ 4æ27 2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
ãã®ã¬ãã¹ã³ã§ã¯ãããŒã¿åæã®äž»èŠãªæäœã® XNUMX ã€ã§ããã°ã«ãŒãåãšéèšã«ã€ããŠèª¬æããŸãã ã¬ãã¹ã³äžã¯ããã±ãŒãžã䜿çšããŸã dplyr
ãšæ©èœ group_by()
О summarise()
.
é¢æ°ãã¡ããªãŒå
šäœãèŠãŠãããŸã summarise()
ããªãã¡ summarise()
, summarise_if()
О summarise_at()
.
ã¬ãã¹ã³ 7: R ã§ã®ããŒãã«ã®åçŽçµåãšæ°Žå¹³çµå
åºçæ¥ïŒ 4æ2020
ãªã³ã¯ïŒ
ãããªïŒ
説æïŒ
ãã®ã¬ãã¹ã³ã¯ãããŒãã«ã®åçŽçµåãšæ°Žå¹³çµåã®æäœãç解ããã®ã«åœ¹ç«ã¡ãŸãã
åçŽãŠããªã³ã¯ãSQL ã¯ãšãªèšèªã® UNION æŒç®ã«çžåœããŸãã
æ°Žå¹³çµåã¯ãVLOOKUP é¢æ°ã®ããã㧠Excel ãŠãŒã¶ãŒã«ããç¥ãããŠããŸãããSQL ã§ã¯ããã®ãããªæäœã¯ JOIN æŒç®åã«ãã£ãŠå®è¡ãããŸãã
ã¬ãã¹ã³ã§ã¯ãããã±ãŒãžã䜿çšããå®è·µçãªåé¡ã解決ããŸãã dplyr
, readxl
, tidyr
О stringr
.
èæ ®ããäž»ãªæ©èœã¯æ¬¡ã®ãšããã§ãã
bind_rows()
- ããŒãã«ã®åçŽçµåleft_join()
â ããŒãã«ã®æ°Žå¹³çµåsemi_join()
- ããŒãã«ã®çµåãå«ãanti_join()
- æä»çããŒãã«çµå
ã¬ãã¹ã³ 8: R ã®ãŠã£ã³ããŠé¢æ°
åºçæ¥ïŒ 11æ2020
ãªã³ã¯ïŒ
説æïŒ
ãŠã£ã³ããŠé¢æ°ã¯éèšé¢æ°ãšæå³ã䌌ãŠãããå€ã®é
åãå
¥åãšããŠåãåããç®è¡æŒç®ãå®è¡ããŸãããåºåçµæã®è¡æ°ã¯å€æŽããŸããã
ãã®ãã¥ãŒããªã¢ã«ã§ã¯ãããã±ãŒãžã®åŠç¿ãç¶ããŸãã dplyr
ãããã³é¢æ° group_by()
, mutate()
ãæ°ååæ§ã« cumsum()
, lag()
, lead()
О arrange()
.
ã¬ãã¹ã³ 9: R ã®å転ããŒãã«ãŸãã¯ãããã ããŒãã«ã®é¡äŒŒç©
åºçæ¥ïŒ 18æ2020
ãªã³ã¯ïŒ
説æïŒ
ã»ãšãã©ã® Excel ãŠãŒã¶ãŒã¯ãããã ããŒãã«ã䜿çšããŠããŸããããã¯ãçããŒã¿ã®é
åãæ°ç§ã§èªã¿ãããã¬ããŒãã«å€æã§ãã䟿å©ãªããŒã«ã§ãã
ãã®ãã¥ãŒããªã¢ã«ã§ã¯ãR ã§ããŒãã«ãå転ããããŒãã«ãã¯ã€ã圢åŒãããã³ã°åœ¢åŒã«ããŸãã¯ãã®éã«å€æããæ¹æ³ãèŠãŠãããŸãã
ã¬ãã¹ã³ã®å€§éšåã¯ããã±ãŒãžã«åœãŠãããŸã tidyr
ãšæ©èœ pivot_longer()
О pivot_wider()
.
ã¬ãã¹ã³ 10: R ãžã® JSON ãã¡ã€ã«ã®ããŒããšãªã¹ãããããŒãã«ãžã®å€æ
åºçæ¥ïŒ 25æ2020
ãªã³ã¯ïŒ
説æïŒ
JSON ãš XML ã¯ãéåžžãã®ã³ã³ãã¯ããã®ãããæ
å ±ã®ä¿åãšäº€æã«éåžžã«äººæ°ã®ãã圢åŒã§ãã
ãããããã®ãããªåœ¢åŒã§æ瀺ãããããŒã¿ãåæããã®ã¯é£ãããããåæããåã«ããŒã¿ã衚圢åŒã«ããå¿ èŠããããŸããããããŸãã«ãã®ãããªã§åŠç¿ããå 容ã§ãã
ã¬ãã¹ã³ã¯ããã±ãŒãžå°çšã§ã tidyr
ãã©ã€ãã©ãªã®ã³ã¢ã«å«ãŸããŠããŸã tidyverse
ãããã³é¢æ° unnest_longer()
, unnest_wider()
О hoist()
.
ã¬ãã¹ã³ 11: qplot() é¢æ°ã䜿çšããè¿ éãªãããã
åºçæ¥ïŒ 1 2020 6æ
ãªã³ã¯ïŒ
説æïŒ
ããã±ãŒãž ggplot2
ã¯ãR ã«éããæã人æ°ã®ããããŒã¿èŠèŠåããŒã«ã® XNUMX ã€ã§ãã
ãã®ã¬ãã¹ã³ã§ã¯ãé¢æ°ã䜿çšããŠç°¡åãªã°ã©ããäœæããæ¹æ³ãåŠã³ãŸãã qplot()
ããããŠåœŒå¥³ã®è°è«ããã¹ãŠåæããŠã¿ãŸãããã
ã¬ãã¹ã³ 12: ggplot2 ããã±ãŒãžã䜿çšããã¬ã€ã€ãŒããšã®ããããã®ãããã
åºçæ¥ïŒ 8 2020 6æ
ãªã³ã¯ïŒ
説æïŒ
ã¬ãã¹ã³ã§ã¯ããã±ãŒãžã®èœåãæ倧éã«çºæ®ããŸã ggplot2
ãããŠãããã«åã蟌ãŸããã¬ã€ã€ãŒã§ã°ã©ããæ§ç¯ããææ³ã
ããã±ãŒãžã«å«ãŸããäž»èŠãªãžãªã¡ããªãåæããã¬ã€ã€ãŒãé©çšããŠã°ã©ããæ§ç¯ããæ¹æ³ãåŠã³ãŸãã
ãŸãšã
R èšèªã®ãããªåŒ·åãªããŒã¿åæããŒã«ãåŠç¿ããæåã®äžæ©ãèžã¿åºãããã«å¿ èŠãªæãå¿ èŠãªæ å ±ã®ã¿ã匷調ããããã«ãã³ãŒã¹ ããã°ã©ã ã®æ§æãã§ããã ãç°¡æœã«ããããšããŸããã
ãã®ã³ãŒã¹ã¯ãR ã䜿çšããããŒã¿åæã®å®å šãªã¬ã€ãã§ã¯ãããŸããããåæã«å¿ èŠãªãã¹ãŠã®ãã¯ããã¯ãç解ããã®ã«åœ¹ç«ã¡ãŸãã
ã³ãŒã¹ ããã°ã©ã 㯠12 é±éã§èšèšãããŠããŸãããæ¯é±æææ¥ã«æ°ããã¬ãã¹ã³ãžã®ã¢ã¯ã»ã¹ãå
¬éãããã®ã§ããå§ãããŸãã
åºæïŒ habr.com