52 datasets don ayyukan horo

  1. Dataset na Abokan ciniki na Mall - bayanan maziyartan kantin: id, jinsi, shekaru, kudin shiga, ƙimar kashewa. (Zaɓin aikace-aikacen: Aikin Rarraba Abokin Ciniki tare da Koyan Injin)
  2. Iris Dataset - a dataset for sabon shiga, dauke da masu girma dabam na sepals da petals ga daban-daban furanni.
  3. Bayanan Bayani na MNIST - tarin bayanai na lambobi da aka rubuta da hannu. Hotunan horo 60 da hotunan gwaji 000.
  4. Bayanan Gidajen Boston sanannen saitin bayanai ne don sanin ƙirar ƙira. Ya ƙunshi bayanai game da gidaje a Boston: adadin gidaje, farashin haya, fihirisar laifuka.
  5. Saitin Gano Labarai na Karya - ya ƙunshi shigarwar 7796 tare da alamun labarai: gaskiya ko ƙarya. (Zaɓin aikace-aikacen tare da lambar tushe a Python: Aikin Gano Labarai na Fake Python )
  6. Saitin ingancin ruwan inabi - ya ƙunshi bayani game da giya: 4898 rikodin tare da sigogi 14.
  7. Bayanan SOCR - Tsawo da Ma'auni Dataset - kyakkyawan zaɓi don farawa da. Ya ƙunshi bayanan 25 na tsayi da nauyin mutane masu shekaru 000.

    52 datasets don ayyukan horo

    An fassara labarin tare da tallafin EDISON Software, wanda ya cika umarni daga Kudancin China "da kyau"Kuma yana haɓaka aikace-aikacen yanar gizo da gidajen yanar gizo.

  8. Parkinson Dataset - Bayanan 195 na marasa lafiya da cutar Parkinson, tare da sigogi na bincike 25. Ana iya amfani da shi don tantancewar farko na bambanci tsakanin marasa lafiya da masu lafiya. (Zaɓin aikace-aikacen tare da lambar tushe a Python: Aikin Koyan Injin Kan Gano Cutar Parkinson)
  9. Dataset na Titanic - ya ƙunshi bayanai game da fasinjoji (shekaru, jinsi, dangi a cikin jirgin, da dai sauransu) 891 a cikin tsarin horo da 418 a cikin gwajin gwajin.
  10. Uber Pickups Dataset - bayanai game da tafiye-tafiye miliyan 4.5 akan Uber a cikin 2014 da miliyan 14 a cikin 2015. (Zaɓin aikace-aikacen tare da lambar tushe a cikin R: Uber Data Analysis Project a cikin R)
  11. Bayanan Bayani na Chars74k - ya ƙunshi hotunan alamun Burtaniya da Kanada na azuzuwan 64: 0-9, A-Z, a-z. 7700 7.7k hotuna na halitta, 3400k da aka rubuta da hannu, 62000 Haruffa na kwamfuta.
  12. Ƙididdigar Gano Zamba na Katin Kiredit - ya ƙunshi bayanai game da ma'amaloli na katunan bashi da aka yi sulhu. (Zaɓin aikace-aikacen tare da tushe: Aikin Koyon Injin Koyon Katin Kiredit)
  13. Chatbot Intents Dataset - fayil ɗin JSON wanda ya ƙunshi alamun daban-daban: gaisuwa, ban kwana, bincike na asibiti, bincike na kantin magani, da sauransu. Ya ƙunshi saitin samfuran amsar tambaya. (Zaɓin aikace-aikacen tare da lambar tushe a Python: Aikin Chatbot a cikin Python)
  14. Enron Email Dataset - ya ƙunshi haruffa rabin miliyan daga manajojin Enron 150.
  15. Bayanan Bayani na Yelp - ya ƙunshi shawarwari miliyan 1,2 daga masu amfani miliyan 1,6 game da ƙungiyoyi miliyan 1,2.
  16. Jeopardy Dataset - fiye da rikodin tambayoyi da amsa 200 daga shahararren wasan talabijin.
  17. Saitin Bayanan Tsarin Tsarin Ba da Shawara - portal mai tarin tarin bayanai daga Jami'ar UCSD. Ya ƙunshi bayanan bita akan shahararrun shafuka (Goodreads, Amazon). Mai girma don ƙirƙirar tsarin masu ba da shawara. (Zaɓin aikace-aikacen tare da lambar tushe a cikin R: Shirin Tsarin Shawarwari na Fim a cikin R )
  18. UCI Spambase Dataset - bayanan horo don gano spam. Ya ƙunshi haruffa 4601 tare da sigogin metadata 57.
  19. Flicker 30k Dataset - fiye da hotuna 30 da rubutu. (Flicker 8k Dataset - hotuna 8000. Python tushen aikin: Hoton Generator Python Project)
  20. IMDB reviews - Bita na fina-finai 25 a cikin tsarin horo da 000 a cikin tsarin gwajin. (Zaɓin aikace-aikacen tare da lambar tushe a cikin R: Aikin Nazarin Bayanan Jini)
  21. MS COCO data - hotuna miliyan 1,5 da aka yiwa alama.
  22. CIFAR-10 da CIFAR-100 - CIFAR-10 ya ƙunshi ƙananan hotuna 60,000 na lambobin pixels 32*32 0-9. CIFAR-100 - bi da bi, 0-100.
  23. GTSRB (Gidan alamar zirga-zirgar zirga-zirga ta Jamus) Saitin bayanai - Hotuna 50 na alamun hanya 000. (Zaɓin aikace-aikacen tare da lambar tushe a Python: Alamomin Traffic Gane Aikin Python)
  24. ImageNet dataset — ya ƙunshi jumloli sama da 100 da hotuna kusan 000 a kowace jumla.
  25. Hotunan Hotunan Histopathology na Nono - kundin bayanan ya ƙunshi hotunan samfuran ciwon nono. (Zaɓin aikace-aikacen tare da lambar tushe a kunne Aikin Python Rabe Ciwon Kankara)
  26. Ƙididdiga na Cityscapes - ya ƙunshi bayanai masu inganci na jerin bidiyo na tituna a garuruwa daban-daban.
  27. Kinetics Dataset - ya ƙunshi hanyar haɗin URL zuwa kusan bidiyoyi masu inganci miliyan 6,5.
  28. MPII bayanan matsayi na mutum - kundin bayanan ya ƙunshi hotuna 25 na matsayin ɗan adam tare da bayanan haɗin gwiwa.
  29. 20BN-wani abu-wani abu dataset v2 - saitin bidiyo masu inganci waɗanda ke nuna yadda mutum yake yin wasu ayyuka.
  30. Abu 365 Dataset - tarin bayanai na hotuna masu inganci tare da akwatunan daure abu.
  31. Saitin bayanan hoto - ya ƙunshi hotuna sama da 1000 tare da zane-zanensu.
  32. Bayanan Bayani na CQ500 - kundin bayanan ya ƙunshi CT scans na kai 491 tare da yanka 193.
  33. IMDB-Wiki dataset - tarin bayanai mai sama da hotuna miliyan 5 na fuskoki masu alamar jinsi da shekaru. (Zaɓin aikace-aikacen tare da lambar tushe a kunne Jini & Shekaru Gane Python Project)
  34. Youtube 8M Dataset - Saitin bayanan bidiyo mai lakabi wanda ya ƙunshi ID na bidiyo na Youtube miliyan 6,1
  35. Ƙararren sauti na birni 8K - saitin bayanan sauti na birni (ya ƙunshi sautunan birni 8732 daga azuzuwan 10).
  36. Bayanan Bayani na LSUN - bayanan miliyoyin hotuna masu launi na al'amuran da abubuwa (kimanin hotuna miliyan 59, nau'ikan yanayi daban-daban 10 da nau'ikan abubuwa daban-daban 20).
  37. Bayanan Bayani na RAVDESS - audiovisual database na tunanin magana. (Zaɓin aikace-aikacen tare da lambar tushe a kunne Magana Emotion Gane Python Project)
  38. Bayanan Bayani na Librispeech - kundin bayanan ya ƙunshi sa'o'i 1000 na magana Turanci tare da lafazin daban-daban.
  39. Baidu Apolloscape Dataset - kundin bayanai don haɓaka fasahar tuƙi da kai.
  40. Quandl Data Portal - wurin ajiyar bayanan tattalin arziki da na kuɗi (akwai abun ciki kyauta da biyan kuɗi).
  41. Bankin Duniya Bude Portal Data - bayanai kan lamuni da Bankin Duniya ke bayarwa ga kasashe masu tasowa.
  42. IMF Data Portal wata tashar yanar gizo ce ta asusun lamuni ta kasa da kasa wacce ke buga bayanai kan kudaden kasa da kasa, adadin bashi, saka hannun jari, ajiyar kudaden waje da kayayyaki.
  43. Ƙungiyar Tattalin Arziƙin Amurka (AEA) Data Portal - Hanya don bincika bayanan macroeconomic na Amurka.
  44. Google Trends Data Portal - Google Trend bayanai za a iya amfani da su gani da kuma tantance bayanai.
  45. Portal Data Market Data hanya ce don samun bayanai na zamani kan kasuwannin hada-hadar kudi daga ko'ina cikin duniya.
  46. Data.gov Portal - Gwamnatin Amurka ta bude tashar bayanai (noma, kiwon lafiya, yanayi, ilimi, makamashi, kudi, kimiyya da bincike, da sauransu).
  47. Portal Data: Buɗe bayanan gwamnati (Indiya) ita ce dandalin budaddiyar bayanan gwamnati na Indiya.
  48. Yanayin Abinci Atlas Data Portal - ya ƙunshi bayanan bincike kan abinci mai gina jiki a Amurka.
  49. Tashar Bayanan Lafiya tashar tashar Ma'aikatar Lafiya da Sabis ta Jama'a ce ta Amurka.
  50. Cibiyar Kula da Cututtuka da Rigakafin Cututtuka - ya ƙunshi bayanai da yawa masu alaƙa da lafiya.
  51. London Datastore Portal - bayanai game da rayuwar mutane a London.
  52. Gwamnatin Kanada Bude Portal Data - tashar bude bayanai game da mutanen Kanada (noma, fasaha, kiɗa, ilimi, gwamnati, kiwon lafiya, da sauransu)

Kara karantawa

source: www.habr.com

Add a comment