Ngama-52 amaxwebhu edatha yeeprojekthi zoqeqesho

  1. Iseti yedatha yabaThengi baseMall -Idatha yeendwendwe zevenkile: id, isini, iminyaka, umvuzo, ukulinganisa inkcitho. (Ukhetho lwesicelo: Iprojekthi yoLwahlulo lwabaThengi ngokuFunda koomatshini)
  2. Iris Dataset - isethi yedatha yabaqalayo, equlethe ubungakanani be-sepals kunye neepetali zeentyatyambo ezahlukeneyo.
  3. Iseti yedatha ye-MNIST β€” iseti yedatha yamanani abhalwe ngesandla. Imifanekiso ye-60 yoqeqesho kunye nemifanekiso yovavanyo ye-000.
  4. Iseti yedatha yeZindlu yaseBoston yiseti yedatha edumileyo yoqwalaselo lwepateni. Iqulethe ulwazi malunga nezindlu eBoston: inani lezindlu, amaxabiso okurenta, isalathiso solwaphulo-mthetho.
  5. Iseti yedatha yokuFumana iindaba ezingeyonyani - iqulethe amangeno angama-7796 anophawu lweendaba: yinyani okanye bubuxoki. (Inketho yesicelo ngekhowudi yomthombo kwiPython: Iprojekthi yePython yokufunyanwa kweendaba ezingeyonyani )
  6. Idatha yomgangatho wewayini - iqulethe ulwazi malunga newayini: iirekhodi ze-4898 ezineeparamitha ezili-14.
  7. Idatha ye-SOCR-Ubude kunye neseti yedatha yobunzima -inketho elungileyo ukuqala ngayo. Iqulethe iirekhodi ze-25 zobude kunye nobunzima babantu abaneminyaka eyi-000 ubudala.

    Ngama-52 amaxwebhu edatha yeeprojekthi zoqeqesho

    Eli nqaku liguqulelwe ngenkxaso ye-EDISON Software, leyo uzalisekisa iiodolo ezivela kuMazantsi eTshayina "ngokugqwesileyo", kwakunye iphuhlisa usetyenziso lwewebhu kunye neewebhusayithi.

  8. Parkinson Dataset - Iirekhodi ze-195 zezigulane ezine-Parkinson's disease, kunye neeparitha ze-25 zokuhlalutya. Ingasetyenziselwa uvavanyo lokuqala lomahluko phakathi kwabantu abagulayo kunye nabantu abasempilweni. (Inketho yesicelo ngekhowudi yomthombo kwiPython: Iprojekthi yokuFunda ngoomatshini ekuFundeni isifo sikaParkinson)
  9. Iseti yedatha yeTitanic - iqulethe ulwazi malunga nabagibeli (ubudala, isini, izihlobo ebhodini, njl.) I-891 kwisethi yoqeqesho kunye ne-418 kwisethi yovavanyo.
  10. Uber Pickups Dataset - ulwazi malunga ne-4.5 yezigidi zeehambo ku-Uber ngo-2014 kunye ne-14 yezigidi ngo-2015. (Ukhetho lwesicelo ngekhowudi yemvelaphi kwi-R: Iprojekthi yoHlahlo lweDatha ka-Uber kwi-R)
  11. Chars74k isethi yedatha - iqulethe imifanekiso yaseBrithani kunye neCanada yeempawu ze-64 iiklasi: 0-9, A-Z, a-z. 7700 7.7k imifanekiso yendalo, 3400k ebhalwe ngesandla, 62000 iifonti ezenziwe ngekhompyutha.
  12. Iseti yedatha yokubona ubuqhophololo kwikhadi letyala - inolwazi malunga nokuthengiselana kwamakhadi okuthenga ngetyala. (Ukhetho losetyenziso olunomthombo: IProjekthi yokuFundisa uMatshini woBuqhetseba beKhadi leTyala)
  13. Iseti yedatha yeeNjongo zeChatbot - ifayile ye-JSON equlethe iithegi ezahlukeneyo: imibuliso, sala kakuhle, isibhedlele_ukukhangela, ikhemesti_uphendlo, njl. Iqulethe uluhlu lweetemplate zempendulo yemibuzo. (Inketho yesicelo ngekhowudi yomthombo kwiPython: Iprojekthi yeChatbot kwiPython)
  14. Iseti yedatha ye-imeyile ye-Enron - iqulethe isiqingatha sesigidi seeleta ezivela kubaphathi be-150 Enron.
  15. Iseti yedatha yeYelp - iqulethe i-1,2 yezigidi zeengcebiso ezivela kubasebenzisi be-1,6 yezigidi malunga nemibutho ye-1,2 yezigidi.
  16. Iseti yedatha yeNgcipheko β€” ngaphezu kwe-200 yemibuzo neempendulo ezirekhodwe kumdlalo kamabonakude odumileyo.
  17. Umcebisi weSeti yedatha yeeNkqubo - i-portal enengqokelela yedatha evela kwiYunivesithi yase-UCSD. Iqulethe iirekhodi zokuphononongwa kwiindawo ezidumileyo (i-Goodreads, i-Amazon). Inkulu ekudaleni iinkqubo zokuncoma. (Ukhetho lwesicelo ngekhowudi yemvelaphi kwi-R: IProjekthi yeNkqubo yeeNgcono zeMovie kwi-R )
  18. UCI Spambase Dataset - isethi yedatha yoqeqesho lokukhangela ugaxekile. Iqulethe iileta ezingama-4601 ezineeparamitha zemethadatha ezingama-57.
  19. Flickr 30k Dataset - ngaphezu kwe-30 yemifanekiso kunye nezihloko. (Flickr 8k Dataset - 8000 imifanekiso. Iprojekthi yomthombo wePython: Iprojekthi yePython yeProjekthi yeNgcaciso yoMfanekiso)
  20. Uphononongo lwe-IMDB - Uphononongo lwe-movie lwe-25 kwiseti yoqeqesho kunye ne-000 kwisethi yovavanyo. (Ukhetho lwesicelo ngekhowudi yemvelaphi kwi-R: Iprojekthi yeNzululwazi yoHlalutyo lweNzelo)
  21. Iseti yedatha ye-MS COCO - 1,5 yezigidi zemifanekiso ephawulweyo.
  22. CIFAR-10 kunye ne-CIFAR-100 iseti yedatha - CIFAR-10 iqulethe 60,000 imifanekiso encinane 32 * 32 pixels amanani 0-9. CIFAR-100 - ngokulandelanayo, 0-100.
  23. I-GTSRB (ibenchmark yophawu lwetrafikhi yaseJamani) Iseti yedatha β€” Imifanekiso engama-50 yemiqondiso yendlela engama-000. (Inketho yesicelo ngekhowudi yomthombo kwiPython: IProjekthi yePython yokuQatshelwa kweempawu zeTrafikhi)
  24. ImageNet iseti yedatha - iqulethe ngaphezu kwe-100 yamabinzana kunye nemifanekiso emalunga ne-000 kwibinzana ngalinye.
  25. Iseti yedatha yeMifanekiso ye-Breastathology - isethi yedatha iqulethe imifanekiso yeesampulu zomhlaza wamabele. (Ukhetho lwesicelo ngekhowudi yemvelaphi IPython yoHlelo loMhlaza wamabele)
  26. Iseti yedatha yeCityscapes -Iqulethe amanqaku aphezulu olandelelwano lwevidiyo kwizitrato kwizixeko ezahlukeneyo.
  27. Iseti yedatha yeKinetics -iqulethe ikhonkco ye-URL malunga ne-6,5 yezigidi zeevidiyo ezikumgangatho ophezulu.
  28. Iseti yedatha yeMPII yabantu - i-dataset iqulethe imifanekiso engama-25 yokuma komntu kunye neenkcazo ezidibeneyo.
  29. 20BN-into-into iseti yedatha v2 -Iseti yeevidiyo ezikumgangatho ophezulu ezibonisa indlela umntu enza ngayo isenzo esithile.
  30. Into 365 Dataset - isethi yedatha yemifanekiso ekumgangatho ophezulu kunye neebhokisi zokubopha izinto.
  31. Iseti yedatha yokuzoba ifoto -inemifanekiso engaphezu kwe-1000 kunye nemizobo yayo.
  32. Idatha ye-CQ500 - i-dataset iqulethe i-491 CT scans zentloko kunye nezilayi ze-193.
  33. I-IMDB-Wiki iseti yedatha - isethi yedatha engaphezulu kwezigidi ezi-5 zemifanekiso yobuso ephawulwe ngokwesini kunye nobudala. (Ukhetho lwesicelo ngekhowudi yemvelaphi Isini kunye neProjekthi yePython yokuFumana iminyaka)
  34. Youtube 8M Dataset -Iseti yevidiyo ebhalwe igama equlathe izigidi ezi-6,1 zee-ID zevidiyo ze-Youtube
  35. Isandi sedolophu 8K iseti yedatha - isethi yedatha yesandi sasedolophini (iqulethe i-8732 izandi zasedolophini ezivela kwiiklasi ezili-10).
  36. Iseti yedatha ye-LSUN - isethi yedatha yezigidi zemifanekiso yemibala yemifanekiso kunye nezinto (malunga nezigidi ezingama-59 zemifanekiso, i-10 iindidi zeendawo ezahlukeneyo kunye ne-20 iindidi zezinto ezahlukeneyo).
  37. Iseti yedatha yeRAVDESS -idathabheyisi ye-audiovisual yentetho yeemvakalelo. (Ukhetho lwesicelo ngekhowudi yemvelaphi IProjekthi yePython yeNtetho yokuQatshelwa kweMvakalelo)
  38. Iseti yedatha yeLibrispeech β€” iseti yedatha iqulethe iiyure ezili-1000 zentetho yesiNgesi eneempawu ezahlukeneyo.
  39. Iseti yedatha ye-Baidu Apolloscape - isethi yedatha yophuhliso lobuchwepheshe bokuziqhuba.
  40. Quandl Data Portal - indawo yokugcina idatha yezoqoqosho kunye nezezimali (kukho umxholo wamahhala kunye nohlawulelwayo).
  41. IWorld Bank Open Data Portal β€” ulwazi lwemali-mboleko ekhutshwe yiBhanki yeHlabathi kumazwe asakhasayo.
  42. I-IMF Data Portal yingxowa-mali yemali yezizwe ngezizwe epapasha iinkcukacha ngezemali zamazwe ngamazwe, amaxabiso etyala, utyalo-mali, oovimba botshintshiselwano lwangaphandle kunye neempahla zorhwebo.
  43. Umbutho wezoQoqosho waseMelika (AEA) Data Portal -Isixhobo sokukhangela idatha ye-US macroeconomic.
  44. Google Trends Data Portal -Idatha yendlela kaGoogle ingasetyenziselwa ukujonga nokuhlalutya idatha.
  45. I-Financial Times Market Data Portal sisixhobo solwazi lwangoku kwiimalike zezemali ezivela kwihlabathi jikelele.
  46. Idatha.gov Portal - Urhulumente wase-US uvule i-data portal (ezolimo, impilo, imozulu, imfundo, amandla, imali, isayensi kunye nophando, njl.).
  47. Data Portal: Vula idatha karhulumente (India) liqonga ledatha likarhulumente waseIndiya elivulekileyo.
  48. Indawo yokutya I-Atlas Data Portal -iqulethe idatha yophando kwisondlo e-United States.
  49. Health Data Portal yi-portal yeSebe lezeMpilo lase-US kunye neeNkonzo zoLuntu.
  50. Amaziko oLawulo lweSifo kunye noThintelo lweDatha yeDatha - iqulethe uluhlu olubanzi lwedatha enxulumene nempilo.
  51. London Datastore Portal - data malunga nobomi babantu eLondon.
  52. Canada Government Open Data Portal -I-portal yedatha evulekileyo malunga nabantu baseCanada (ezolimo, ubugcisa, umculo, imfundo, urhulumente, ukhathalelo lwezempilo, njl.)

Funda ngokugqithisileyo

umthombo: www.habr.com

Yongeza izimvo