Iseti yedatha yabaThengi baseMall -Idatha yeendwendwe zevenkile: id, isini, iminyaka, umvuzo, ukulinganisa inkcitho. (Ukhetho lwesicelo:Iprojekthi yoLwahlulo lwabaThengi ngokuFunda koomatshini )Iris Dataset - isethi yedatha yabaqalayo, equlethe ubungakanani be-sepals kunye neepetali zeentyatyambo ezahlukeneyo.Iseti yedatha ye-MNIST β iseti yedatha yamanani abhalwe ngesandla. Imifanekiso ye-60 yoqeqesho kunye nemifanekiso yovavanyo ye-000.Iseti yedatha yeZindlu yaseBoston yiseti yedatha edumileyo yoqwalaselo lwepateni. Iqulethe ulwazi malunga nezindlu eBoston: inani lezindlu, amaxabiso okurenta, isalathiso solwaphulo-mthetho.Iseti yedatha yokuFumana iindaba ezingeyonyani - iqulethe amangeno angama-7796 anophawu lweendaba: yinyani okanye bubuxoki. (Inketho yesicelo ngekhowudi yomthombo kwiPython:Iprojekthi yePython yokufunyanwa kweendaba ezingeyonyani )Idatha yomgangatho wewayini - iqulethe ulwazi malunga newayini: iirekhodi ze-4898 ezineeparamitha ezili-14.Idatha ye-SOCR-Ubude kunye neseti yedatha yobunzima -inketho elungileyo ukuqala ngayo. Iqulethe iirekhodi ze-25 zobude kunye nobunzima babantu abaneminyaka eyi-000 ubudala.
Eli nqaku liguqulelwe ngenkxaso ye-EDISON Software, leyouzalisekisa iiodolo ezivela kuMazantsi eTshayina "ngokugqwesileyo" , kwakunyeiphuhlisa usetyenziso lwewebhu kunye neewebhusayithi .Parkinson Dataset - Iirekhodi ze-195 zezigulane ezine-Parkinson's disease, kunye neeparitha ze-25 zokuhlalutya. Ingasetyenziselwa uvavanyo lokuqala lomahluko phakathi kwabantu abagulayo kunye nabantu abasempilweni. (Inketho yesicelo ngekhowudi yomthombo kwiPython:Iprojekthi yokuFunda ngoomatshini ekuFundeni isifo sikaParkinson )Iseti yedatha yeTitanic - iqulethe ulwazi malunga nabagibeli (ubudala, isini, izihlobo ebhodini, njl.) I-891 kwisethi yoqeqesho kunye ne-418 kwisethi yovavanyo.Uber Pickups Dataset - ulwazi malunga ne-4.5 yezigidi zeehambo ku-Uber ngo-2014 kunye ne-14 yezigidi ngo-2015. (Ukhetho lwesicelo ngekhowudi yemvelaphi kwi-R:Iprojekthi yoHlahlo lweDatha ka-Uber kwi-R )Chars74k isethi yedatha - iqulethe imifanekiso yaseBrithani kunye neCanada yeempawu ze-64 iiklasi: 0-9, A-Z, a-z. 7700 7.7k imifanekiso yendalo, 3400k ebhalwe ngesandla, 62000 iifonti ezenziwe ngekhompyutha.Iseti yedatha yokubona ubuqhophololo kwikhadi letyala - inolwazi malunga nokuthengiselana kwamakhadi okuthenga ngetyala. (Ukhetho losetyenziso olunomthombo:IProjekthi yokuFundisa uMatshini woBuqhetseba beKhadi leTyala )Iseti yedatha yeeNjongo zeChatbot - ifayile ye-JSON equlethe iithegi ezahlukeneyo: imibuliso, sala kakuhle, isibhedlele_ukukhangela, ikhemesti_uphendlo, njl. Iqulethe uluhlu lweetemplate zempendulo yemibuzo. (Inketho yesicelo ngekhowudi yomthombo kwiPython:Iprojekthi yeChatbot kwiPython )Iseti yedatha ye-imeyile ye-Enron - iqulethe isiqingatha sesigidi seeleta ezivela kubaphathi be-150 Enron.Iseti yedatha yeYelp - iqulethe i-1,2 yezigidi zeengcebiso ezivela kubasebenzisi be-1,6 yezigidi malunga nemibutho ye-1,2 yezigidi.Iseti yedatha yeNgcipheko β ngaphezu kwe-200 yemibuzo neempendulo ezirekhodwe kumdlalo kamabonakude odumileyo.Umcebisi weSeti yedatha yeeNkqubo - i-portal enengqokelela yedatha evela kwiYunivesithi yase-UCSD. Iqulethe iirekhodi zokuphononongwa kwiindawo ezidumileyo (i-Goodreads, i-Amazon). Inkulu ekudaleni iinkqubo zokuncoma. (Ukhetho lwesicelo ngekhowudi yemvelaphi kwi-R:IProjekthi yeNkqubo yeeNgcono zeMovie kwi-R )UCI Spambase Dataset - isethi yedatha yoqeqesho lokukhangela ugaxekile. Iqulethe iileta ezingama-4601 ezineeparamitha zemethadatha ezingama-57.Flickr 30k Dataset - ngaphezu kwe-30 yemifanekiso kunye nezihloko. (Flickr 8k Dataset - 8000 imifanekiso. Iprojekthi yomthombo wePython:Iprojekthi yePython yeProjekthi yeNgcaciso yoMfanekiso )Uphononongo lwe-IMDB - Uphononongo lwe-movie lwe-25 kwiseti yoqeqesho kunye ne-000 kwisethi yovavanyo. (Ukhetho lwesicelo ngekhowudi yemvelaphi kwi-R:Iprojekthi yeNzululwazi yoHlalutyo lweNzelo )Iseti yedatha ye-MS COCO - 1,5 yezigidi zemifanekiso ephawulweyo.CIFAR-10 kunye ne-CIFAR-100 iseti yedatha - CIFAR-10 iqulethe 60,000 imifanekiso encinane 32 * 32 pixels amanani 0-9. CIFAR-100 - ngokulandelanayo, 0-100.I-GTSRB (ibenchmark yophawu lwetrafikhi yaseJamani) Iseti yedatha β Imifanekiso engama-50 yemiqondiso yendlela engama-000. (Inketho yesicelo ngekhowudi yomthombo kwiPython:IProjekthi yePython yokuQatshelwa kweempawu zeTrafikhi )ImageNet iseti yedatha - iqulethe ngaphezu kwe-100 yamabinzana kunye nemifanekiso emalunga ne-000 kwibinzana ngalinye.Iseti yedatha yeMifanekiso ye-Breastathology - isethi yedatha iqulethe imifanekiso yeesampulu zomhlaza wamabele. (Ukhetho lwesicelo ngekhowudi yemvelaphiIPython yoHlelo loMhlaza wamabele )Iseti yedatha yeCityscapes -Iqulethe amanqaku aphezulu olandelelwano lwevidiyo kwizitrato kwizixeko ezahlukeneyo.Iseti yedatha yeKinetics -iqulethe ikhonkco ye-URL malunga ne-6,5 yezigidi zeevidiyo ezikumgangatho ophezulu.Iseti yedatha yeMPII yabantu - i-dataset iqulethe imifanekiso engama-25 yokuma komntu kunye neenkcazo ezidibeneyo.20BN-into-into iseti yedatha v2 -Iseti yeevidiyo ezikumgangatho ophezulu ezibonisa indlela umntu enza ngayo isenzo esithile.Into 365 Dataset - isethi yedatha yemifanekiso ekumgangatho ophezulu kunye neebhokisi zokubopha izinto.Iseti yedatha yokuzoba ifoto -inemifanekiso engaphezu kwe-1000 kunye nemizobo yayo.Idatha ye-CQ500 - i-dataset iqulethe i-491 CT scans zentloko kunye nezilayi ze-193.I-IMDB-Wiki iseti yedatha - isethi yedatha engaphezulu kwezigidi ezi-5 zemifanekiso yobuso ephawulwe ngokwesini kunye nobudala. (Ukhetho lwesicelo ngekhowudi yemvelaphiIsini kunye neProjekthi yePython yokuFumana iminyaka )Youtube 8M Dataset -Iseti yevidiyo ebhalwe igama equlathe izigidi ezi-6,1 zee-ID zevidiyo ze-YoutubeIsandi sedolophu 8K iseti yedatha - isethi yedatha yesandi sasedolophini (iqulethe i-8732 izandi zasedolophini ezivela kwiiklasi ezili-10).Iseti yedatha ye-LSUN - isethi yedatha yezigidi zemifanekiso yemibala yemifanekiso kunye nezinto (malunga nezigidi ezingama-59 zemifanekiso, i-10 iindidi zeendawo ezahlukeneyo kunye ne-20 iindidi zezinto ezahlukeneyo).Iseti yedatha yeRAVDESS -idathabheyisi ye-audiovisual yentetho yeemvakalelo. (Ukhetho lwesicelo ngekhowudi yemvelaphiIProjekthi yePython yeNtetho yokuQatshelwa kweMvakalelo )Iseti yedatha yeLibrispeech β iseti yedatha iqulethe iiyure ezili-1000 zentetho yesiNgesi eneempawu ezahlukeneyo.Iseti yedatha ye-Baidu Apolloscape - isethi yedatha yophuhliso lobuchwepheshe bokuziqhuba.Quandl Data Portal - indawo yokugcina idatha yezoqoqosho kunye nezezimali (kukho umxholo wamahhala kunye nohlawulelwayo).IWorld Bank Open Data Portal β ulwazi lwemali-mboleko ekhutshwe yiBhanki yeHlabathi kumazwe asakhasayo.I-IMF Data Portal yingxowa-mali yemali yezizwe ngezizwe epapasha iinkcukacha ngezemali zamazwe ngamazwe, amaxabiso etyala, utyalo-mali, oovimba botshintshiselwano lwangaphandle kunye neempahla zorhwebo.Umbutho wezoQoqosho waseMelika (AEA) Data Portal -Isixhobo sokukhangela idatha ye-US macroeconomic.Google Trends Data Portal -Idatha yendlela kaGoogle ingasetyenziselwa ukujonga nokuhlalutya idatha.I-Financial Times Market Data Portal sisixhobo solwazi lwangoku kwiimalike zezemali ezivela kwihlabathi jikelele.Idatha.gov Portal - Urhulumente wase-US uvule i-data portal (ezolimo, impilo, imozulu, imfundo, amandla, imali, isayensi kunye nophando, njl.).Data Portal: Vula idatha karhulumente (India) liqonga ledatha likarhulumente waseIndiya elivulekileyo.Indawo yokutya I-Atlas Data Portal -iqulethe idatha yophando kwisondlo e-United States.Health Data Portal yi-portal yeSebe lezeMpilo lase-US kunye neeNkonzo zoLuntu.Amaziko oLawulo lweSifo kunye noThintelo lweDatha yeDatha - iqulethe uluhlu olubanzi lwedatha enxulumene nempilo.London Datastore Portal - data malunga nobomi babantu eLondon.Canada Government Open Data Portal -I-portal yedatha evulekileyo malunga nabantu baseCanada (ezolimo, ubugcisa, umculo, imfundo, urhulumente, ukhathalelo lwezempilo, njl.)
Funda ngokugqithisileyo
Iiprojekthi ezili-14 ezivulelekileyo zokuphucula izakhono zeSayensi yeDatha (elula, eqhelekileyo, enzima) I-dojo yangaphambili: iiprojekthi zokuqeqesha izakhono zomphuhlisi (i-5 entsha + 43 endala) Top 12 Eyona nto inomdla IT Dynamic Infographics
umthombo: www.habr.com