52 datasets emapurojekiti ekudzidzisa

  1. Mall Customers Dataset - data yevashanyi vezvitoro: id, murume kana mukadzi, zera, mari, chiyero chekushandisa. (Sarudzo yekushandisa: Mutengi Segmentation Project ine Machine Kudzidza)
  2. Iris Dataset - dataset yevanotanga, ine hukuru hwema sepals uye petals emaruva akasiyana-siyana.
  3. MNIST Dataset - dataset yenhamba dzakanyorwa nemaoko. 60 mifananidzo yekudzidziswa uye zviuru gumi zvebvunzo mifananidzo.
  4. Iyo Boston Housing Dataset dhatabheti rakakurumbira rekuzivikanwa kwepateni. Ine ruzivo nezve dzimba muBoston: nhamba yedzimba, mitengo yekurenda, indekisi yemhosva.
  5. Fake News Detection Dataset - ine 7796 zvinyorwa zvine maratidziro enhau: chokwadi kana nhema. (Sarudzo yekushandisa ine kodhi kodhi muPython: Fake News Detection Python Project )
  6. Waini quality dataset - ine ruzivo nezvewaini: 4898 zvinyorwa zvine gumi nemana paramita.
  7. SOCR data - Hurefu uye Weights Dataset - sarudzo yakanaka yekutanga nayo. Iine zvinyorwa zve25 zvehurefu uye huremu hwevanhu vane makore gumi nemasere.

    52 datasets emapurojekiti ekudzidzisa

    Chinyorwa chakaturikirwa nerutsigiro rweEDISON Software, iyo anozadzisa mirairo kubva kuSouthern China "zvakanakisa", pamwe chete inovandudza mawebhusaiti uye mawebhusaiti.

  8. Parkinson Dataset - 195 zvinyorwa zvevarwere vane chirwere cheParkinson, vane 25 kuongororwa parameters. Inogona kushandiswa pakuongorora kwekutanga kwemutsauko pakati pevanhu vanorwara nevanhu vane hutano. (Sarudzo yekushandisa ine kodhi kodhi muPython: Machine Kudzidza Project paKuona chirwere cheParkinson)
  9. Titanic Dataset - ine ruzivo rwevafambi (zera, murume kana mukadzi, hama dziri mubhodhi, nezvimwewo) 891 mugadziriro yekudzidziswa uye 418 muyedzo seti.
  10. Uber Pickups Dataset - ruzivo rwenzendo dzinosvika mamirioni mana neshanu paUber muna 4.5 uye mamirioni gumi nemana muna 2014. (Sarudzo yekushandisa ine kodhi kodhi muR: Uber Data Analysis Project muR)
  11. Chars74k Dataset - ine mifananidzo yeBritish neCanada zviratidzo zvemakirasi makumi matanhatu nemana: 64-0, AZ, az. 9 7700k mifananidzo yakasikwa, 7.7k yakanyorwa nemaoko, 3400 mafonti akagadzirwa nekombuta.
  12. Chikwereti Kadhi Kubiridzira Detection Dataset - ine ruzivo rwezvekutengeserana kwemakadhi echikwereti akakanganiswa. (Sarudzo yekushandisa ine source: Chikwereti Kadhi Kubiridzira Kuona Machine Kudzidza Project)
  13. Chatbot Chinangwa Dataset - faira reJSON rine ma tag akasiyana siyana: kwaziso, kwaziwai, hospital_search, pharmacy_search, nezvimwe. Ine seti yemibvunzo-mhinduro matemplate. (Sarudzo yekushandisa ine kodhi kodhi muPython: Chatbot Project muPython)
  14. Enron Email Dataset - ine hafu yemiriyoni mavara kubva ku150 Enron maneja.
  15. Iyo Yelp Dataset - ine 1,2 miriyoni kurudziro kubva kune 1,6 miriyoni vashandisi vangangoita mamirioni 1,2 masangano.
  16. Jeopardy Dataset β€” zvinopfuura 200 zvemibvunzo nemhinduro zvakarekodhwa kubva mumutambo weterevhizheni une mukurumbira.
  17. Recommender Systems Dataset - portal ine muunganidzwa wedatasets kubva kuUCSD University. Iine marekodhi eongororo panzvimbo dzakakurumbira (Goodreads, Amazon). Yakakura yekugadzira recommender masisitimu. (Sarudzo yekushandisa ine kodhi kodhi muR: Movie Recommendation System Project muR )
  18. UCI Spambase Dataset - dhata rekudzidzisa rekuona spam. Iine mavara 4601 ane 57 metadata paramita.
  19. Flickr 30k Dataset - anopfuura 30 mifananidzo uye zvinyorwa. (Flickr 8k Dataset - 8000 mifananidzo. Python source project: Mufananidzo Caption Jenareta Python Project)
  20. IMDB wongororo - 25 ongororo yemabhaisikopo mune yekudzidziswa seti uye 000 muyedzo seti. (Sarudzo yekushandisa ine kodhi kodhi muR: Sentiment Analysis Data Science Project)
  21. Nhoroondo ye MS COCO - 1,5 miriyoni tagged mifananidzo.
  22. CIFAR-10 uye CIFAR-100 dataset - CIFAR-10 ine 60,000 mifananidzo midiki ye32 * 32 pixels nhamba 0-9. CIFAR-100 - maererano, 0-100.
  23. GTSRB (German traffic sign recognition benchmark) Dataset - 50 mifananidzo ye000 zviratidzo zvemugwagwa. (Sarudzo yekushandisa ine kodhi kodhi muPython: Traffic Signs Recognition Python Project)
  24. ImageNet dataset - ine anopfuura zviuru zana mitsara uye ingangoita zviuru zvemifananidzo pamutsara wega wega.
  25. Zamu Histopathology Images Dataset - iyo dataset ine mifananidzo yemasampuli egomarara rezamu. (Sarudzo yekushandisa ine source code pa Breast Cancer Classification Python Project)
  26. Cityscapes Dataset -Ine zvemhando yepamusoro zvirevo zvevhidhiyo kutevedzana kwemigwagwa mumaguta akasiyana.
  27. Kinetics Dataset - ine URL link kune angangoita 6,5 miriyoni emhando yepamusoro mavhidhiyo.
  28. MPII yemunhu pose dataset - iyo dataset ine 25 mifananidzo yezvimiro zvevanhu zvine majoini zvirevo.
  29. 20BN-chimwe chinhu-chimwe chinhu dataset v2 - seti yemavhidhiyo emhando yepamusoro anoratidza maitiro anoita munhu chimwe chiitiko.
  30. Chinhu 365 Dataset - dataset yemifananidzo yemhando yepamusoro ine chinhu chinosungirirwa mabhokisi.
  31. Mufananidzo wekutora dataset - ine anopfuura zviuru zana emifananidzo ine yavo yekudhirowa.
  32. Nhoroondo ye CQ500 - iyo dataset ine 491 CT scans yemusoro ine 193 zvimedu.
  33. IMDB-Wiki dataset - dhatabheti rine anopfuura mamirioni mashanu emifananidzo yezviso inocherechedzwa nemurume kana zera. (Sarudzo yekushandisa ine source code pa Gender & Age Detection Python Project)
  34. Youtube 8M Dataset -Yakanyorwa vhidhiyo dataset ine 6,1 miriyoni Youtube vhidhiyo ID
  35. Urban Sound 8K dataset - seti yemadhorobha enzwi data (ine 8732 kurira kwemadhorobha kubva kumakirasi gumi).
  36. LSUN Dataset - dhatabheti yemamirioni emifananidzo yemavara ezviratidziro uye zvinhu (angangoita 59 miriyoni mifananidzo, gumi akasiyana ezviitiko zvikamu uye makumi maviri akasiyana ezvinhu).
  37. RAVDESS Dataset - audiovisual dhatabhesi yekutaura kwemanzwiro. (Sarudzo yekushandisa ine source code pa Kutaura Emotion Recognition Python Project)
  38. Librispeech Dataset - iyo dataset ine 1000 maawa ekutaura Chirungu ane maredhiyo akasiyana.
  39. Baidu Apolloscape Dataset - dataset yekuvandudza tekinoroji yekuzvityaira.
  40. Quandl Data Portal - repository yehupfumi uye data data (pane zvemahara uye zvakabhadharwa zvemukati).
  41. Iyo World Bank Open Data Portal - ruzivo rwezvikwereti zvakapihwa neWorld Bank kunyika dzichiri kusimukira.
  42. IMF Data Portal inzvimbo yepasi rose yehomwe yemari inoburitsa data pamusoro pemari yepasi rose, zvikwereti, mari, mari yekunze uye zvigadzirwa.
  43. American Economic Association (AEA) Data Portal -Chishandiso chekutsvaga US macroeconomic data.
  44. Google Trends Data Portal -Google maitiro data inogona kushandiswa kuongorora nekuongorora uye kuongorora data.
  45. Financial Times Market Data Portal chitubu cheruzivo rwechizvino-zvino pamisika yemari kubva pasirese.
  46. Data.gov Portal - Hurumende yeUS yakavhura data portal (kurima, hutano, mamiriro ekunze, dzidzo, simba, mari, sainzi netsvagiridzo, nezvimwewo).
  47. Data Portal: Vhura data yehurumende (India) ipuratifomu yehurumende yakavhurika yeIndia.
  48. Nzvimbo yekudya Atlas Data Portal - ine tsvakurudzo data pamusoro pezvokudya muUnited States.
  49. Health Data Portal ndiyo portal yeUS Department of Health and Human Services.
  50. Centers for Disease Control and Prevention Data Portal - ine ruzivo rwakasiyana-siyana rwehutano hwehutano.
  51. London Datastore Portal - data nezvehupenyu hwevanhu muLondon.
  52. Canada Government Open Data Portal - portal yedata rakavhurika nezvevaCanada (kurima, hunyanzvi, mimhanzi, dzidzo, hurumende, hutano hwehutano, nezvimwewo)

Verenga zvimwe

Source: www.habr.com

Voeg