Amazon Redshift Parallel Scaling Guide thiab Test Results
Ntawm Skyeng peb siv Amazon Redshift, suav nrog kev sib npaug sib npaug, yog li peb pom cov kab lus no los ntawm Stefan Gromoll, tus tsim ntawm dotgo.com, rau intermix.io nthuav. Tom qab kev txhais lus, me ntsis ntawm peb kev paub los ntawm cov ntaub ntawv engineer Daniyar Belkhodzhaev.
Amazon Redshift Architecture tso cai scaling los ntawm kev ntxiv cov nodes tshiab rau pawg. Qhov xav tau los tiv thaiv tus naj npawb ntawm qhov kev thov tuaj yeem ua rau muaj ntau tshaj ntawm cov nodes. Concurrency Scaling, as opposed to add new nodes, tsub kom suav zog raws li xav tau.
Amazon Redshift parallel scaling muab Redshift pawg ntxiv peev xwm los tswj cov ntawv thov siab kawg. Nws ua haujlwm los ntawm kev txav cov lus thov mus rau cov pawg tshiab "parallel" hauv keeb kwm yav dhau. Kev thov raug xa mus raws li WLM teeb tsa thiab cov cai.
Parallel scaling pricing is based on a credit model with a free tier. Tshaj li cov qhab nia pub dawb, kev them nyiaj yog nyob ntawm lub sijhawm Parallel Scaling Cluster ua cov txheej txheem thov.
Tus neeg sau tau sim ntsuas qhov sib npaug ntawm ib qho ntawm cov pawg sab hauv. Hauv tsab ntawv no, nws yuav tham txog cov txiaj ntsig kev xeem thiab muab cov lus qhia txog yuav pib li cas.
Cov kev cai ntawm pawg
Txhawm rau siv qhov sib npaug sib npaug, koj Amazon Redshift pawg yuav tsum ua raws li cov cai hauv qab no:
- platform: EC2-VPC; - hom node: dc2.8xlarge, ds2.8xlarge, dc2.large lossis ds2.xlarge; - tus naj npawb ntawm nodes: los ntawm 2 mus rau 32 (ib leeg ntawm pawg tsis tau txais kev txhawb nqa).
Koj tsuas yog them thaum koj qhov Parallel Scaling Clusters siv ntau dua li cov qhab nia koj tau txais.
Tus nqi yog xam ntawm tus nqi ib-thib ob ntawm qhov xav tau rau ib pawg sib npaug uas yog siv los ntawm tus nqi pub dawb. Koj tsuas yog them rau lub sijhawm ntawm koj qhov kev thov, nrog tus nqi tsawg kawg nkaus ntawm ib feeb txhua lub sijhawm ua haujlwm Parallel Scaling Cluster. Tus nqi ib-thib ob ntawm qhov xav tau yog xam raws li cov qauv nqi dav dav Amazon RedShift, uas yog, nws nyob ntawm hom node thiab tus naj npawb ntawm cov nodes hauv koj pawg.
Launching Parallel Scaling
Parallel scaling yog ua rau txhua WLM kab. Mus rau AWS Redshift console thiab xaiv Workload Management los ntawm cov ntawv qhia zaub mov sab laug. Xaiv koj pawg pawg WLM parameter los ntawm cov ntawv qhia zaub mov nram qab no.
Koj yuav pom kab tshiab hu ua "Concurrency Scaling Mode" ntawm ib sab ntawm txhua kab. Lub neej ntawd yog "Disabled". Nyem "Kho kom raug" thiab koj tuaj yeem hloov qhov chaw rau txhua kab.
Configuration
Parallel scaling ua hauj lwm los ntawm kev xa cov lus thov tsim nyog rau cov pab pawg tshiab. Cov pawg tshiab muaj qhov loj tib yam (hom thiab tus naj npawb ntawm cov nodes) raws li pawg loj.
Lub neej ntawd tus naj npawb ntawm pawg siv rau qhov sib npaug scaling yog ib (1), muaj peev xwm teeb tsa mus txog rau tag nrho kaum (10) pawg.
Tag nrho cov naj npawb ntawm pawg rau kev ntsuas qhov sib npaug tuaj yeem raug teeb tsa los ntawm max_concurrency_scaling_clusters parameter. Kev nce tus nqi ntawm qhov parameter no muab cov kab rov ua dua ntxiv.
Saib xyuas
Muaj ntau cov duab ntxiv muaj nyob hauv AWS Redshift console. Daim ntawv qhia Max Configured Concurrency Scaling Clusters qhia tus nqi ntawm max_concurrency_scaling_clusters dhau sijhawm.
Tus naj npawb ntawm cov scaling active pawg yog tso tawm nyob rau hauv tus neeg siv interface nyob rau hauv lub "Concurrency Scaling Activity" seem:
Hauv Cov Lus Nug tab, muaj ib kab ntawv qhia seb cov lus nug tau ua tiav hauv pawg loj lossis hauv pawg sib npaug sib npaug:
Txawm hais tias cov lus nug tshwj xeeb tau ua tiav hauv pawg tseem ceeb lossis los ntawm kev sib txuas ua ke scaling pawg, nws yog khaws cia hauv stl_query.concurrency_scaling_status.
Tus nqi ntawm 1 qhia tau hais tias cov lus nug raug tua nyob rau hauv parallel scale pawg, thaum lwm qhov tseem ceeb qhia tau hais tias nws tau raug tua nyob rau hauv thawj pawg.
Piv Txwv:
Cov ntaub ntawv sib npaug sib npaug kuj tseem khaws cia rau hauv qee lub rooj thiab kev pom, xws li SVCS_CONCURRENCY_SCALING_USAGE. Tsis tas li ntawd, muaj ntau lub rooj catalog uas khaws cov ntaub ntawv hais txog kev sib npaug.
Π Π΅Π·ΡΠ»ΡΡΠ°ΡΡ
Cov kws sau ntawv tau pib qhov sib npaug sib npaug rau ib kab hauv pawg sab hauv ntawm kwv yees li 18:30:00 GMT ntawm 29.03.2019/3/20. Hloov cov max_concurrency_scaling_clusters parameter rau 30 ntawm kwv yees li 00:29.03.2019:XNUMX ntawm XNUMX/XNUMX/XNUMX.
Txhawm rau simulate ib daim ntawv thov, peb txo tus naj npawb ntawm qhov qhib rau qhov kab no los ntawm 15 mus rau 5.
Hauv qab no yog ib daim ntawv qhia intermix.io dashboard qhia tus naj npawb ntawm kev thov khiav thiab queuing tom qab txo tus naj npawb ntawm cov slots.
Peb pom tias lub sijhawm tos rau kev thov hauv kab tau nce, nrog rau lub sijhawm siab tshaj plaws yog ntau tshaj 5 feeb.
Nov yog cov ntaub ntawv cuam tshuam los ntawm AWS console txog dab tsi tshwm sim thaum lub sijhawm no:
Redshift launched peb (3) parallel scaling clusters raws li configured. Nws zoo nkaus li tias cov pawg no tau siv tsis tau, txawm tias muaj ntau qhov kev thov hauv peb pawg tau teem caij.
Tom qab ob peb teev, cov kws sau ntawv tau tshuaj xyuas cov kab thiab nws zoo li 6 qhov kev thov tau khiav ntawm qhov sib npaug. Peb kuj random sim ob qhov kev thov los ntawm tus neeg siv interface. Peb tsis tau kuaj xyuas seb yuav siv cov txiaj ntsig no li cas thaum ob peb pawg sib txuas ua haujlwm ib zaug.