Ukwehliswa kwenkathi Yedatha Enkulu

Ababhali abaningi bakwamanye amazwe bayavuma ukuthi inkathi ye-Big Data isifike esiphethweni. Futhi kulokhu, igama elithi Big Data libhekisela kubuchwepheshe obusekelwe ku-Hadoop. Ababhali abaningi bangakwazi ngisho nokusho ngokuzethemba usuku okwashiya ngalo i-Big Data kulo mhlaba futhi lolu suku umhlaka-05.06.2019/XNUMX/XNUMX.

Kwenzekani ngalolu suku olubalulekile?

Ngalolu suku, i-MAPR ithembise ukumisa umsebenzi wayo uma ingakwazi ukuthola imali yokuqhubeka nokusebenza. I-MAPR kamuva yatholwa yi-HP ngo-Agasti 2019. Kodwa uma sibuyela ngoJuni, umuntu akanakuzibamba kodwa aqaphele usizi lwalesi sikhathi emakethe yeDatha Enkulu. Le nyanga ibone ukwehla kwezintengo zesitoko ze-CLOUDERA, umdlali ohamba phambili emakethe, ohlanganiswe ne-HORTOWORKS engenanzuzo engapheli ngoJanuwari wonyaka ofanayo. Ukuwa bekubaluleke kakhulu futhi kwafinyelela ku-43%; ekugcineni, imali ye-CLOUDERA yehla isuka ku-4,1 yaya kumadola ayizigidi eziyizinkulungwane eziyi-1,4.

Akunakwenzeka ukusho ukuthi amahemuhemu e-bubble emkhakheni wobuchwepheshe obusekelwe ku-Hadoop asakazwa kusukela ngoDisemba 2014, kodwa abambelele ngesibindi cishe iminyaka emihlanu. Lawa mahemuhemu ayesekelwe ekwenqabeni kwe-Google, inkampani lapho ubuchwepheshe be-Hadoop baqala khona, kusukela ekusungulweni kwayo. Kodwa ubuchwepheshe bagxila ngesikhathi sokushintshwa kwezinkampani kumathuluzi okucubungula amafu kanye nokuthuthukiswa okusheshayo kobuhlakani bokwenziwa. Ngakho-ke, uma sibheka emuva, singasho ngokuqiniseka ukuthi ukufa kwakulindelekile.

Ngakho-ke, inkathi ye-Big Data isifike esiphethweni, kodwa ohlelweni lokusebenza ku-Big Data, izinkampani ziye zaqaphela wonke ama-nuances okusebenza kuyo, izinzuzo ezingalethwa yi-Big Data ebhizinisini, futhi zafunda ukusebenzisa i-artificial. ubuhlakani bokukhipha inani kudatha eluhlaza.

Okuthakazelisa kakhulu kuba umbuzo wokuthi yini ezothatha isikhundla salobu buchwepheshe nokuthi ubuchwepheshe be-analytics buzothuthuka kanjani.

Izibalo ezithuthukisiwe

Phakathi nemicimbi echazwe, izinkampani ezisebenza emkhakheni wokuhlaziya idatha azizange zihlale. Yini engahlulelwa ngokusekelwe olwazini mayelana nokuthengiselana okwenzeka ngo-2019. Kulo nyaka, ukuthengiselana okukhulu kunazo zonke emakethe kwenziwa - ukutholwa kwe-platform yokuhlaziya i-Tableau yi-Salesforce ngamaRandi ayizigidi eziyizinkulungwane ezingu-15,7. Idili elincane lenzekile phakathi kwe-Google ne-Locker. Futhi-ke, umuntu akanakuhluleka ukuqaphela ukutholwa yi-Qlik ye-platform yedatha enkulu i-Attunity.

Abaholi bemakethe ye-BI kanye nochwepheshe bakwaGartner bamemezela ushintsho olukhulu ezindleleni zokuhlaziya idatha; lolu shintsho luzobhubhisa ngokuphelele imakethe ye-BI futhi luholele ekuthatheni esikhundleni se-BI nge-AI. Kulo mongo, kufanele kuqashelwe ukuthi isifinyezo esithi AI akuwona "ubuhlakani bokwenziwa" kodwa "Ubuhlakani Obungeziwe". Ake sibhekisise ukuthi yini ebangela amagama athi "Izibalo Ezithuthukisiwe."

Ukuhlaziya okungathandwa kwabathelisi esikubona, njengeqiniso elingathandwa kwabathelisi esikubona, kusekelwe kokuthunyelwe okujwayelekile okuningana:

  • ikhono lokuxhumana kusetshenziswa i-NLP (Natural Language Processing), i.e. ngolimi lwabantu;
  • ukusetshenziswa kobuhlakani bokwenziwa, lokhu kusho ukuthi idatha izocutshungulwa kusengaphambili ubuhlakani bomshini;
  • futhi kunjalo, izincomo ezitholakala kumsebenzisi wesistimu, ezakhiwe ubuhlakani bokwenziwa.

Ngokusho kwabakhiqizi bezinkundla zokuhlaziya, ukusetshenziswa kwabo kuzotholakala kubasebenzisi abangenawo amakhono akhethekile, njengolwazi lwe-SQL noma ulimi olufanayo lokubhala, abangenalo ukuqeqeshwa kwezibalo noma izibalo, abangenalo ulwazi lwezilimi ezidumile. ikakhulukazi ekucutshungulweni kwedatha kanye nemitapo yolwazi ehambisanayo. Abantu abanjalo, ababizwa ngokuthi "Citizen Data Scientists", kumele babe neziqu zebhizinisi ezivelele kuphela. Umsebenzi wabo uwukuthwebula imininingwane yebhizinisi kumathiphu nezibikezelo ezizonikezwa ubuhlakani bokwenziwa, futhi bangacwengisa ukuqagela kwabo besebenzisa i-NLP.

Echaza inqubo yabasebenzisi abasebenza nezinhlelo zaleli klasi, umuntu angacabanga ngesithombe esilandelayo. Umuntu, oza emsebenzini futhi wethula uhlelo lokusebenza oluhambisanayo, ngaphezu kwesethi evamile yemibiko namadeshibhodi angahlaziywa kusetshenziswa izindlela ezijwayelekile (ukuhlunga, ukuqoqa, ukwenza imisebenzi yezibalo), ubona amathiphu nezincomo ezithile, into efana nokuthi: ukuze uzuze i-KPI, inombolo yokuthengisa, kufanele usebenzise isaphulelo emikhiqizweni evela kusigaba "sokwenza ingadi". Ngaphezu kwalokho, umuntu angathinta isithunywa senkampani: Skype, Slack, njll. Angabuza imibuzo irobhothi, ngombhalo noma ngezwi: β€œNginike amakhasimende amahlanu anenzuzo kakhulu.” Ngemva kokuthola impendulo efanele, kufanele enze isinqumo esingcono kakhulu esisekelwe ekuhlangenwe nakho kwakhe kwebhizinisi futhi alethe inzuzo enkampanini.

Uma uthatha isinyathelo emuva futhi ubheka ukwakheka kolwazi oluhlaziywayo, futhi kulesi sigaba, imikhiqizo yokuhlaziya ethuthukisiwe ingenza izimpilo zabantu zibe lula. Ngokufanelekile, kucatshangwa ukuthi umsebenzisi uzodinga kuphela ukukhomba umkhiqizo wokuhlaziya emithonjeni yolwazi olufunayo, futhi uhlelo ngokwalo luzonakekela ukudala imodeli yedatha, amatafula wokuxhumanisa kanye nemisebenzi efanayo.

Konke lokhu kufanele, okokuqala, kuqinisekise "idemokhrasi" yedatha, i.e. Noma yimuphi umuntu angakwazi ukuhlaziya lonke uhlu lolwazi olutholakala enkampanini. Inqubo yokuthatha izinqumo kufanele isekelwe izindlela zokuhlaziya izibalo. Isikhathi sokufinyelela idatha kufanele sibe sincane, ngakho-ke asikho isidingo sokubhala imibhalo kanye nemibuzo ye-SQL. Futhi-ke, ungonga imali kochwepheshe abakhokhelwa kakhulu be-Data Science.

Ngokucabangela, ubuchwepheshe bunikeza amathemba aqhakazile kakhulu ebhizinisi.

Yini emiselela Idatha Enkulu?

Kodwa, eqinisweni, ngaqala isihloko sami nge-Big Data. Futhi angikwazanga ukuthuthukisa lesi sihloko ngaphandle kohambo olufushane kumathuluzi esimanje e-BI, isisekelo esivame ukuba yi-Big Data. Isiphetho sedatha enkulu manje sinqunywe ngokucacile, futhi ubuchwepheshe befu. Ngigxile ekuthengiseni okwenziwe nabathengisi be-BI ukuze ngibonise ukuthi manje zonke izinhlelo zokuhlaziya zinesitoreji samafu ngemuva kwayo, futhi izinsizakalo zamafu zine-BI njengesiphetho sangaphambili.

Singakhohlwa ngezinsika ezinjalo emkhakheni wolwazi njenge-ORACLE ne-Microsoft, kubalulekile ukuqaphela isiqondiso sabo esikhethiwe sokuthuthukiswa kwebhizinisi futhi leli yifu. Zonke izinkonzo ezinikeziwe zingatholakala efwini, kodwa ezinye izinsiza zamafu azisatholakali endaweni. Benze umsebenzi obalulekile ekusetshenzisweni kwamamodeli okufunda omshini, bakha amalabhulali atholakalayo kubasebenzisi, futhi balungisa izindawo zokusebenzelana ukuze kube lula ukusebenza namamodeli kusukela ekuwakhetheni kuya ekusetheni isikhathi sokuqala.

Enye inzuzo ebalulekile yokusebenzisa izinsizakalo zamafu, evezwa abakhiqizi, ukutholakala kwamasethi wedatha cishe angenamkhawulo kunoma yisiphi isihloko samamodeli okuqeqesha.

Kodwa-ke, umbuzo uphakama: ngabe ubuchwepheshe bamafu buzogxila kangakanani ezweni lethu?

Source: www.habr.com

Engeza amazwana