Dab tsi Pandas 1.0 coj peb

Dab tsi Pandas 1.0 coj peb

Thaum Lub Ib Hlis 9, Pandas 1.0.0rc tau tso tawm. Cov ntawv dhau los ntawm lub tsev qiv ntawv yog 0.25.

Thawj qhov kev tso tawm loj muaj ntau yam tshiab zoo, suav nrog kev txhim kho tsis siv neeg dataframe summarization, ntau cov ntaub ntawv tawm tswv yim, hom ntaub ntawv tshiab, thiab txawm tias muaj cov ntaub ntawv tshiab.

Txhua qhov kev hloov pauv tuaj yeem pom no, nyob rau hauv tsab xov xwm peb yuav txwv peb tus kheej mus rau ib qho me me, tsis tshua muaj kev tshuaj xyuas ntawm qhov tseem ceeb tshaj plaws.

Koj tuaj yeem nruab lub tsev qiv ntawv li niaj zaus siv pip, tab sis txij li thaum lub sijhawm sau ntawv Pandas 1.0 tseem yog tso tus neeg sib tw, koj yuav tsum tau qhia meej meej lub version:

pip install --upgrade pandas==1.0.0rc0

Ceev faj: vim qhov no yog qhov kev tso tawm loj, qhov hloov tshiab tuaj yeem rhuav tshem cov cai qub!

Los ntawm txoj kev, kev txhawb nqa rau Python 2 tau raug txiav tag nrho txij li qhov version no (dab tsi yuav yog ib qho laj thawj zoo hloov tshiab - kwv yees. txhais lus). Pandas 1.0 xav tau yam tsawg kawg nkaus Python 3.6+, yog li yog tias koj tsis paub meej, xyuas seb koj tau nruab qhov twg:

$ pip --version
pip 19.3.1 from /usr/local/lib/python3.7/site-packages/pip (python 3.7)

$ python --version
Python 3.7.5

Txoj hauv kev yooj yim tshaj plaws los txheeb xyuas Pandas version yog qhov no:

>>> import pandas as pd
>>> pd.__version__
1.0.0rc0

Txhim kho nws pib-summarization nrog DataFrame.info

Kuv qhov kev nyiam tshiab yog qhov hloov tshiab rau txoj kev DataFrame.info. Cov haujlwm tau dhau los ua ntau yam nyeem tau, ua rau cov txheej txheem ntawm kev tshawb nrhiav cov ntaub ntawv yooj yim dua:

>>> df = pd.DataFrame({
...:   'A': [1,2,3], 
...:   'B': ["goodbye", "cruel", "world"], 
...:   'C': [False, True, False]
...:})
>>> df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 3 entries, 0 to 2
Data columns (total 3 columns):
 #   Column  Non-Null Count  Dtype
---  ------  --------------  -----
 0   A       3 non-null      int64
 1   B       3 non-null      object
 2   C       3 non-null      object
dtypes: int64(1), object(2)
memory usage: 200.0+ bytes

Outputting cov lus hauv Markdown hom

Ib qho zoo sib xws innovation yog lub peev xwm export dataframes rau Markdown ntxhuav siv DataFrame.to_markdown.

>>> df.to_markdown()
|    |   A | B       | C     |
|---:|----:|:--------|:------|
|  0 |   1 | goodbye | False |
|  1 |   2 | cruel   | True  |
|  2 |   3 | world   | False |

Qhov no ua rau nws yooj yim dua los tshaj tawm cov ntxhuav ntawm cov chaw xws li Nruab Nrab siv github gists.

Dab tsi Pandas 1.0 coj peb

Hom tshiab rau cov hlua thiab booleans

Lub Pandas 1.0 tso tawm kuj ntxiv tshiab kev sim hom. Lawv API tseem tuaj yeem hloov pauv, yog li siv nws nrog ceev faj. Tab sis feem ntau, Pandas pom zoo siv hom tshiab nyob qhov twg nws ua rau kev nkag siab.

Tam sim no, lub cam khwb cia yuav tsum tau ua kom meej meej:

>>> B = pd.Series(["goodbye", "cruel", "world"], dtype="string")
>>> C = pd.Series([False, True, False], dtype="bool")
>>> df.B = B, df.C = C
>>> df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 3 entries, 0 to 2
Data columns (total 3 columns):
 #   Column  Non-Null Count  Dtype
---  ------  --------------  -----
 0   A       3 non-null      int64
 1   B       3 non-null      string
 2   C       3 non-null      bool
dtypes: int64(1), object(1), string(1)
memory usage: 200.0+ bytes

Daim ntawv ceeb toom yuav ua li cas rau kab Dtype qhia hom tshiab βˆ’ txoj hlua ΠΈ boov.

Qhov tseem ceeb tshaj plaws ntawm cov hlua tshiab yog lub peev xwm xaiv tsuas yog kab kab los ntawm dataframes. Qhov no tuaj yeem ua rau parsing cov ntaub ntawv yooj yim dua:

df.select_dtypes("string")

Yav dhau los, kab kab tsis tuaj yeem raug xaiv yam tsis tau qhia meej meej cov npe.

Koj tuaj yeem nyeem ntxiv txog hom tshiab no.

Ua tsaug rau koj nyeem! Cov npe tag nrho ntawm kev hloov pauv, raws li tau hais dhau los, tuaj yeem pom no.

Tau qhov twg los: www.hab.com

Ntxiv ib saib