Ua hauj lwm nyob rau hauv multi-threaded hom nrog parallel input / output;
Kev them nyiaj yug rau kev hloov kho cov ntaub ntawv khaws tseg, suav nrog rau kev rov qab los ntawm lub xeev ntawm qee qhov taw qhia yav dhau los lossis kev hloov kho atomic ntawm tag nrho cov teeb tsa loj.
Muaj peev xwm txuas metadata;
Kev them nyiaj yug rau cov ntaub ntawv pab pawg;
Kev koom ua ke modules siv los ua lub cav qis qis hauv Spark, Dask, MariaDB, GDAL, PDAL, Rasterio, gVCF thiab PrestoDB;
Kev khi cov tsev qiv ntawv rau C ++ API rau Python, R, Java thiab Go.
Tso tawm 2.0 yog qhov tseem ceeb rau nws txoj kev txhawb nqa rau "DataFrame" lub tswv yim, uas tso cai rau cov ntaub ntawv khaws cia rau hauv daim ntawv ntawm kab ntawm qhov tseem ceeb ntawm qhov ntev ntev, khi rau qee yam cwj pwm. Qhov chaw cia kuj tseem ua kom zoo rau kev ua cov khoom sib txawv ntawm ntau qhov sib txawv (cov cell tuaj yeem khaws cov ntaub ntawv ntawm ntau hom thiab tuaj yeem ua haujlwm sib koom ua ke ntawm txhua hom sib txawv, piv txwv li, cov npe khaws cia, sijhawm thiab nqi). Ntxiv kev txhawb nqa rau kab nrog cov ntaub ntawv hlua. Ntxiv cov qauv rau kev koom ua ke nrog Google Cloud Storage thiab Azure Blob Storage. API rau R hom tau raug kho dua tshiab.