Txhua yam uas sau ntawm no tsuas yog ib qho kev paub ntawm tus kheej xwb, dab tsi yog qhov tseem ceeb, nrog rau cov lus piav qhia ntawm cov xwm txheej thiab "vim li cas nws thiaj li xav tau" - txhua tus tuaj yeem sim seb qhov xwm txheej zoo li cas tuaj yeem tuaj rau koj thiab sim npaj rau lawv ua ntej los ntawm kev nkag siab cov lus no ua ntej koj yuav tsum tau thov (kho) ntawm ib qhov project lossis txawm mus rau qhov project uas lawv xav tau.
"Standard" lus nug
Standard query languages yog precisely nyob rau hauv lub siab hais tias peb feem ntau xav txog lawv thaum peb tham txog queries.
Relational algebra
Vim li cas kev sib raug zoo algebra niaj hnub no? Txhawm rau kom muaj kev nkag siab zoo txog vim li cas cov lus nug tau teeb tsa hauv ib txoj hauv kev thiab siv lawv lub siab, koj yuav tsum nkag siab cov ntsiab lus hauv qab lawv.
Relational algebra yog dab tsi?
Cov ntsiab lus tseem ceeb yog raws li hauv qab no: kev sib raug zoo algebra yog qhov kaw kaw ntawm kev ua haujlwm ntawm kev sib raug zoo hauv cov ntaub ntawv sib raug zoo. Txhawm rau muab nws me ntsis ntxiv rau tib neeg, qhov no yog qhov kev ua haujlwm ntawm cov ntxhuav xws li qhov tshwm sim yog ib lub rooj.
Saib tag nrho cov haujlwm kev sib raug zoo hauv qhov no tsab xov xwm los ntawm Habr - ntawm no peb piav qhia vim li cas koj yuav tsum paub thiab qhov twg nws tuaj yeem siv tau.
Vim li cas?
Pib nkag siab txog cov lus nug dab tsi yog txhua yam hais txog thiab kev ua haujlwm dab tsi tom qab cov lus qhia hauv cov lus nug tshwj xeeb feem ntau muab kev nkag siab tob txog qhov ua haujlwm hauv cov lus nug thiab yuav ua li cas.
Sau los ntawm qhov no cov ntawv. Ib qho piv txwv ntawm kev ua haujlwm: koom, uas koom nrog cov rooj.
Relational DBMSs: Oracle, Postgres, SQL Server, thiab lwm yam tseem muaj nyob txhua qhov chaw thiab muaj lub caij nyoog siab kawg uas koj yuav tau cuam tshuam nrog lawv, uas txhais tau hais tias koj yuav tau nyeem SQL (uas yog qhov yuav muaj) lossis sau nws ( tsis zoo li ib yam).
Nyeem thiab kawm dab tsi
Raws li tib cov kev sib txuas saum toj no (hais txog kev sib raug zoo algebra), muaj cov khoom siv tsis txaus ntseeg, piv txwv li, qhov no.
Los ntawm txoj kev, NoSQL yog dab tsi?
"Nws tsim nyog hais dua ib zaug ntxiv tias lo lus "NoSQL" muaj lub hauv paus chiv keeb kiag li thiab tsis muaj cov ntsiab lus lees paub lossis lub koom haum tshawb fawb tom qab nws." Sib raug ib tsab xov xwm ntawm Habr.
Qhov tseeb, tib neeg pom tau hais tias tus qauv kev sib raug zoo tsis tas yuav tsum tau daws ntau yam teeb meem, tshwj xeeb tshaj yog rau cov uas, piv txwv li, kev ua tau zoo yog qhov tseem ceeb thiab qee cov lus nug yooj yim nrog kev sib sau ua ke - qhov tseem ceeb yog qhov tseem ceeb rau kev ntsuas kev ntsuas sai sai thiab sau rau hauv database, thiab feem ntau cov yam ntxwv yog kev sib raug zoo muab tawm tsis tau tsuas yog tsis tsim nyog, tab sis kuj muaj teeb meem - yog vim li cas normalize ib yam dab tsi yog hais tias nws yuav lwj qhov tseem ceeb tshaj plaws rau peb (rau tej hauj lwm tshwj xeeb) - productivity?
Tsis tas li ntawd, cov kev hloov pauv hloov tau feem ntau xav tau hloov pauv cov lej ruaj khov ntawm cov qauv kev sib raug zoo ntawm cov qauv qub - thiab qhov no ua kom yooj yim rau kev txhim kho daim ntawv thov thaum nws tseem ceeb heev rau kev xa cov kab ke thiab pib ua haujlwm sai, ua cov txiaj ntsig - lossis schema thiab hom khaws cia cov ntaub ntawv. tsis tseem ceeb li.
Ntawm no, theej, koj tsuas yog yuav tsum tau ua tib zoo txheeb xyuas koj txoj haujlwm, nws muaj dab tsi thiab cov NoSQL systems muaj dab tsi uas yuav haum rau cov lus piav qhia - thiab tom qab ntawd pib kawm cov kab ke no.
Scripting Query Languages
Thaum xub thawj, nws zoo li, dab tsi Python yuav tsum ua nrog nws feem ntau - nws yog lus programming, thiab tsis yog hais txog cov lus nug.
Pandas yog ib tug Swiss Army riam ntawm Data Science; ib tug loj npaum li cas ntawm cov ntaub ntawv transformation, aggregation, thiab lwm yam tshwm sim nyob rau hauv nws.
Numpy - vector xam, matrices thiab linear algebra muaj.
Scipy - muaj ntau cov lej hauv cov pob no, tshwj xeeb tshaj yog cov stats.
Jupyter lab - ntau qhov kev tshawb nrhiav cov ntaub ntawv txheeb xyuas haum zoo rau hauv lub laptops - pab tau kom paub.
Kev thov - ua haujlwm nrog lub network.
Pyspark yog nrov heev ntawm cov ntaub ntawv engineers, feem ntau koj yuav tau cuam tshuam nrog qhov no los yog Spark, tsuas yog vim lawv qhov chaw.
* Selenium - muaj txiaj ntsig zoo rau kev sau cov ntaub ntawv los ntawm cov chaw thiab cov peev txheej, qee zaum tsis muaj lwm txoj hauv kev kom tau txais cov ntaub ntawv.
Cov qauv kev txheeb cais tshwj xeeb thiab cov qauv kev kawm tshuab (uas feem ntau tuaj yeem pom nyob rau hauv tus sau version raws li pob R).
Vim li cas nws yeej yog lus nug? Nyob rau hauv daim ntawv nyob rau hauv uas nws yog feem ntau pom, nws yog tiag tiag ib tug thov los tsim ib tug qauv, nrog rau kev nyeem cov ntaub ntawv thiab kho cov lus nug (qauv) tsis, raws li zoo raws li pom cov ntaub ntawv nyob rau hauv tej pob khoom xws li ggplot2 - qhov no kuj yog ib daim ntawv ntawm kev sau queries. .
Piv txwv queries rau visualization
ggplot(data = beav,
aes(x = id, y = temp,
group = activ, color = activ)) +
geom_line() +
geom_point() +
scale_color_manual(values = c("red", "blue"))
Feem ntau, ntau lub tswv yim los ntawm R tau tsiv mus rau hauv cov pob python xws li pandas, numpy lossis scipy, zoo li dataframes thiab cov ntaub ntawv vectorization - yog li feem ntau ntau yam hauv R yuav zoo li paub thiab yooj yim rau koj.
Ntawm no kuv muaj kev paub txawv me ntsis, vim tias kuv feem ntau yuav tsum tau ua haujlwm nrog kev paub graphs thiab cov lus nug rau graphs. Yog li ntawd, cia peb luv luv mus dhau lub hauv paus, vim qhov no yog qhov txawv me ntsis ntxiv.
Xav txog tias koj tab tom ua qauv rau ib tug neeg thiab xav piav txog tej yam tseem ceeb, piv txwv li, cia peb coj ib tus neeg tshwj xeeb, Douglas Adams, thiab siv cov lus piav qhia no ua lub hauv paus.
Feem ntau, kuv yuav tsum tau nyeem SPARQL es tsis sau nws - nyob rau hauv qhov xwm txheej ntawd, nws yuav yog ib qho txuj ci tseem ceeb kom nkag siab cov lus yam tsawg kawg ntawm theem pib kom nkag siab raws nraim li cas cov ntaub ntawv rov qab tau.
Muaj ntau yam ntaub ntawv los kawm online: piv txwv li, ntawm no qhov no и qhov no. Kuv feem ntau google tshwj xeeb tsim thiab piv txwv thiab qhov txaus rau tam sim no.
Lus nug lus nug
Koj tuaj yeem nyeem ntxiv txog lub ntsiab lus hauv kuv tsab xov xwm S, SѓS,. Thiab ntawm no, peb tsuas yog tshawb xyuas luv luv vim li cas cov lus muaj tswv yim zoo tsim nyog rau kev sau cov lus nug. Qhov tseem ceeb, RDF tsuas yog ib txheej ntawm cov ntsiab lus ntawm daim ntawv p(X) thiab h(X,Y), thiab cov lus nug txog kev xav muaj hauv qab no:
output(X) :- country(X), member_of(X,“EU”).
Ntawm no peb tab tom tham txog kev tsim cov khoom lag luam tawm tshiab / 1 (/ 1 txhais tau tias tsis muaj npe), muab tias rau X nws yog qhov tseeb tias lub teb chaws (X) - piv txwv li, X yog lub teb chaws thiab kuj member_of(X,"EU").
Uas yog, nyob rau hauv cov ntaub ntawv no, ob qho tib si cov ntaub ntawv thiab cov kev cai yog nthuav nyob rau hauv tib txoj kev, uas ua rau nws muaj peev xwm ua qauv teeb meem yooj yim heev thiab zoo.
Koj ntsib qhov twg hauv kev lag luam?: tag nrho cov phiaj xwm loj nrog lub tuam txhab uas sau cov lus nug hauv cov lus zoo li no, nrog rau cov phiaj xwm tam sim no hauv lub hauv paus ntawm lub kaw lus - nws yuav zoo li qhov no yog qhov txawv txawv, tab sis qee zaum nws tshwm sim.