Kuburitswa kweTileDB 2.0 yekuchengetedza injini

Yakabudiswa pa kuchengetedza TileDB 2.0, yakagadziridzwa kuchengetedza multidimensional arrays uye data rinoshandiswa mukuverenga kwesainzi. Masisitimu akasiyana-siyana ekugadzirisa ruzivo rwemajini, nzvimbo uye data rezvemari dzinotaurwa senzvimbo dzekushandisa kweTileDB, i.e. masisitimu anoshanda sparse kana kuramba kuzadzwa multidimensional arrays. TileDB inopa raibhurari yeC ++ yekuburitsa pachena kuwana data uye metadata mumashandisirwo, kutarisira basa rose repasi-pasi rekuchengetedza kwakanaka. Iyo kodhi yeprojekiti yakanyorwa muC ++ uye inoparadzirwa ne pasi peMIT rezinesi. Inotsigira basa paLinux, macOS uye Windows.

Zvinonyanya kukosha zveTileDB:

  • Nzira dzinoshanda dzekuchengeta sparse arrays, iyo data mairi isingaenderere mberi; iyo dhizaini izere nezvimedu uye mazhinji ezvinhu zvinoramba zvisina chinhu kana kutora kukosha kwakafanana.
  • Kugona kuwana data mune kiyi-kukosha fomati kana makoramu seti (DataFrame);

    Kuburitswa kweTileDB 2.0 yekuchengetedza injini

  • Inotsigira kubatanidzwa nekuchengetedza gore AWS S3, Google Cloud Storage uye Azure Blob Storage;
  • Tsigiro yemataira (block) arrays;
  • Kugona kushandisa akasiyana data compression uye encryption algorithms;
  • Tsigiro yekutarisa kuvimbika uchishandisa checksums;
  • Shanda mu-multi-threaded mode ine parallel input/output;
  • Tsigiro yekushandura data yakachengetwa, kusanganisira yekudzoreredza nyika pane imwe nguva munguva yakapfuura kana maatomu anogadziridza emaseti makuru.
  • Kugona kubatanidza metadata;
  • Tsigiro yekuunganidza data;
  • Kubatanidza ma modules ekushandisa sejini yakaderera yekuchengetedza injini muSpark, Dask, MariaDB, GDAL, PDAL, Rasterio, gVCF uye PrestoDB;
  • Kusunga maraibhurari eC++ API yePython, R, Java uye Go.

Kuburitswa 2.0 kunozivikanwa nerutsigiro rwayo rwe "DataFrame" pfungwa, iyo inobvumira data kuti ichengetwe mumhando yemakoramu ehukoshi hwehurefu hwekupokana, hwakasungirirwa kune humwe hunhu. Iyo yekuchengetera yakagadziridzwa zvakare kugadzirisa sparse arrays eheterogeneous saizi (masero anogona kuchengeta data remhando dzakasiyana uye anogona kuita mashandiro ekubatanidza pamakoramu emhando dzakasiyana, semuenzaniso, iwo anochengeta zita, nguva uye mutengo). Yakawedzera tsigiro yemakoramu ane tambo data. Akawedzera mamodule ekubatanidzwa neGoogle Cloud Storage uye Azure Blob Storage. Iyo API yemutauro weR yakagadziridzwa patsva.

Source: opennet.ru

Voeg