52 datasets bakeng sa merero ea koetliso

  1. Mall Customers Dataset - Lintlha tsa baeti ba mabenkeleng: id, bong, lilemo, chelete, tekanyetso ea tšebeliso. (Khetho ea kopo: Morero oa ho Arola Bareki ka ho Ithuta ka Mochini)
  2. Iris Dataset - pokello ea data bakeng sa ba qalang, e nang le boholo ba li-sepals le lipalesa tsa lipalesa tse fapaneng.
  3. Lethathamo la lintlha tsa MNIST — pokello ea lintlha tsa linomoro tse ngotsoeng ka letsoho. Litšoantšo tse 60 tsa koetliso le litšoantšo tse 000 tsa liteko.
  4. Boston Housing Dataset ke pokello ea data e tsebahalang bakeng sa temoho ea paterone. E na le leseli mabapi le matlo a Boston: palo ea lifolete, litheko tsa khiriso, index ea botlokotsebe.
  5. Phatlalatso ea Phakello ea Litaba - E na le likenyelletso tse 7796 tse nang le matšoao a litaba: 'nete kapa bohata. (Khetho ea kopo e nang le khoutu ea mohloli ho Python: Morero oa Python oa ho lemoha litaba tse fosahetseng )
  6. Lethathamo la lintlha tsa boleng ba veine - e na le tlhahisoleseling mabapi le veine: lirekoto tse 4898 tse nang le liparamente tse 14.
  7. Lintlha tsa SOCR - Heights le Weights Dataset - khetho e ntle ea ho qala. E na le lirekoto tse 25 tsa bolelele le boima ba batho ba lilemo li 000.

    52 datasets bakeng sa merero ea koetliso

    Sengoloa se fetoletsoe ka tšehetso ea EDISON Software, e e phethahatsa litaelo tse tsoang Southern China "hantle", hammoho le e nts'etsapele lits'ebetso tsa webo le liwebsaete.

  8. Parkinson Dataset - Litlaleho tse 195 tsa bakuli ba nang le lefu la Parkinson, tse nang le litekanyetso tse 25 tsa tlhahlobo. E ka sebelisoa bakeng sa tlhahlobo ea pele ea phapang lipakeng tsa batho ba kulang le ba phetseng hantle. (Khetho ea kopo e nang le khoutu ea mohloli ho Python: Morero oa ho Ithuta ka Mochini mabapi le ho Fumana Lefu la Parkinson)
  9. Titanic Dataset - e na le tlhahisoleseding e mabapi le bapalami (lilemo, bong, beng ka sekepeng, joalo-joalo) 891 sete ea koetliso le 418 ka sete ea teko.
  10. Uber Pickups Dataset - leseli mabapi le maeto a limilione tse 4.5 ho Uber ka 2014 le limilione tse 14 ka 2015. (Khetho ea kopo e nang le khoutu ea mohloli ho R: Morero oa Tlhahlobo ea Lintlha tsa Uber ho R)
  11. Lethathamo la lintlha tsa Chars74k - e na le litšoantšo tsa matšoao a Brithani le a Canada a lihlopha tse 64: 0-9, AZ, az. 7700 7.7k litšoantšo tsa tlhaho, 3400k tse ngotsoeng ka letsoho, 62000 lifonti tse entsoeng ka khomphutha.
  12. Dataset ea ho Fumana Bomenemene ba Karete ea Mokitlane - e na le tlhahisoleseling mabapi le transaction ea likarete tsa mekoloto tse senyehileng. (Khetho ea kopo e nang le mohloli: Morero oa ho Ithuta oa Mochini oa ho Fumana Bomenemene ba Karete ea Mokitlane)
  13. Chatbot Intents Dataset - faele ea JSON e nang le li-tag tse fapaneng: litumeliso, sala hantle, sepetlele_search, pharmacy_search, joalo-joalo. E na le lithempleite tsa likarabo tsa lipotso. (Khetho ea kopo e nang le khoutu ea mohloli ho Python: Morero oa Chatbot ho Python)
  14. Enron Email Dataset - e na le litlhaku tse halofo ea milione tse tsoang ho batsamaisi ba 150 ba Enron.
  15. Yelp Dataset - e na le likhothaletso tse limilione tse 1,2 tse tsoang ho basebelisi ba limilione tse 1,6 tse ka bang limilione tse 1,2.
  16. Jeopardy Dataset — lirekoto tse fetang 200 tsa lipotso le likarabo ho tsoa papaling e tsebahalang ea thelevishene.
  17. Recommender Systems Dataset - sebaka sa marang-rang se nang le pokello ea lintlha tse tsoang Univesithing ea UCSD. E na le lirekoto tsa litlhahlobo libakeng tse tsebahalang (Goodreads, Amazon). E ntle bakeng sa ho theha litsamaiso tsa khothaletso. (Khetho ea kopo e nang le khoutu ea mohloli ho R: Morero oa Sistimi ea Keletso ea Lifilimi ho R )
  18. UCI Spambase Dataset - pokello ea lintlha tsa koetliso bakeng sa ho lemoha spam. E na le litlhaku tse 4601 tse nang le lintlha tse 57 tsa metadata.
  19. Flickr 30k Dataset - litšoantšo le litlhaloso tse fetang 30. (Flickr 8k Dataset - 8000 litšoantšo. Morero oa mohloli oa Python: Morero oa Python oa Tlhahiso ea Litšoantšo)
  20. Litlhahlobo tsa IMDB - Litlhahlobo tsa lifilimi tse 25 sehlopheng sa koetliso le 000 sehlopheng sa liteko. (Khetho ea kopo e nang le khoutu ea mohloli ho R: Sentiment Analysis Data Science Project)
  21. Lethathamo la lintlha tsa MS COCO - Litšoantšo tse tšoailoeng tse limilione tse 1,5.
  22. CIFAR-10 le CIFAR-100 dataset - CIFAR-10 e na le litšoantšo tse nyane tse 60,000 tsa linomoro tsa 32 * 32 pixels 0-9. CIFAR-100 - ka ho latellana, 0-100.
  23. GTSRB (Letšoao la tlhokomeliso ea lets'oao la sephethephethe la Jeremane) Seteishene sa Boitsebiso - Litšoantšo tse 50 tsa matšoao a litsela a 000. (Khetho ea kopo e nang le khoutu ea mohloli ho Python: Morero oa Python oa Tlhokomelo ea Matšoao a Sephethephethe)
  24. ImageNet dataset - E na le mantsoe a fetang 100 le litšoantšo tse ka bang 000 poleloaneng ka 'ngoe.
  25. Lenane la Boitsebiso ba Litšoantšo tsa Histopathology ea Matsoele - lethathamo la lintlha le na le litšoantšo tsa lisampole tsa mofetše oa matsoele. (Khetho ea kopo e nang le khoutu ea mohloli Morero oa Python oa Kankere ea Matsoele)
  26. Cityscapes Dataset - e na le litlhaloso tsa boleng bo holimo tsa tatellano ea livideo tsa literata metseng e fapaneng.
  27. Kinetics Dataset - e na le sehokelo sa URL ho livideo tsa boleng bo holimo tse ka bang limilione tse 6,5.
  28. Lethathamo la lintlha tsa MPII tsa motho - dataset e na le litšoantšo tse 25 tsa popeho ea batho e nang le litlhaloso tse kopanetsoeng.
  29. 20BN-ntho-ntho e itseng dataset v2 - sete ea livideo tsa boleng bo holimo tse bontšang kamoo motho a etsang ketso e itseng.
  30. Object 365 Dataset - pokello ea data ea litšoantšo tsa boleng bo holimo tse nang le mabokose a tlamang lintho.
  31. Sekhechana sa data sa ho sekeha linepe - E na le litšoantšo tse fetang 1000 tse nang le litšoantšo tsa tsona.
  32. Lethathamo la lintlha tsa CQ500 - dataset e na le 491 CT scans ea hlooho e nang le lilae tse 193.
  33. Sehlopha sa data sa IMDB-Wiki - pokello ea data e nang le litšoantšo tse fetang limilione tse 5 tsa lifahleho tse tšoailoeng ho ea ka bong le lilemo. (Khetho ea kopo e nang le khoutu ea mohloli Morero oa Python oa Tekano le Lilemo)
  34. Youtube 8M Dataset - Lethathamo la video le ngotsoeng le nang le li-ID tsa video tsa Youtube tse limilione tse 6,1
  35. Urban Sound 8K dataset - sete sa data ea melumo ea litoropong (e na le melumo ea 8732 ea litoropong ho tsoa lihlopheng tse 10).
  36. Lethathamo la lintlha tsa LSUN - pokello ea lintlha tsa limilione tsa litšoantšo tsa mebala ea liketsahalo le lintho (litšoantšo tse ka bang limilione tse 59, mekhahlelo e 10 e fapaneng ea liketsahalo le mekhahlelo e 20 e fapaneng ea lintho).
  37. RAVDESS Dataset - database ea audiovisual ea puo ea maikutlo. (Khetho ea kopo e nang le khoutu ea mohloli Puo Emotion Recognition Python Project)
  38. Librispeech Dataset - dataset e na le lihora tse 1000 tsa puo ea Senyesemane e nang le li-accents tse fapaneng.
  39. Baidu Apolloscape Dataset - pokello ea lintlha bakeng sa nts'etsopele ea mahlale a ho khanna.
  40. Quandl Data Portal - polokelo ea data ea moruo le ea lichelete (ho na le litaba tsa mahala le tse lefelloang).
  41. Banka ea Lefatše ea Open Data Portal - leseli la likalimo tse fanoeng ke Banka ea Lefatše ho ea linaheng tse tsoelang pele.
  42. IMF Data Portal ke portal ea machaba ea lichelete ea lichelete e phatlalatsang lintlha tse mabapi le lichelete tsa machaba, litekanyetso tsa likoloto, matsete, lichelete tsa lichelete tsa kantle ho naha le thepa.
  43. Mokhatlo oa Amerika oa Moruo (AEA) Data Portal - Sesebelisoa sa ho batla data ea macroeconomic ea US.
  44. Google Trends Data Portal - Lintlha tsa mekhoa ea Google li ka sebelisoa ho hlahloba le ho sekaseka lintlha.
  45. Financial Times Market Data Portal ke sesebelisoa sa tlhahisoleseling ea morao-rao mabapi le mebaraka ea lichelete ho tsoa lefats'eng ka bophara.
  46. Data.gov Portal - Mmuso oa US o bulehileng oa data portal (temo, bophelo bo botle, boemo ba leholimo, thuto, matla, lichelete, saense le lipatlisiso, joalo-joalo).
  47. Data Portal: Bula data ea mmuso (India) ke sethala sa litaba sa mmuso se bulehileng sa India.
  48. Tikoloho ea lijo Atlas Data Portal - e na le lintlha tsa lipatlisiso mabapi le phepo e nepahetseng United States.
  49. Bophelo bo Botle Data Portal ke portal ea US Department of Health and Human Services.
  50. Centers for Disease Control and Prevention Data Portal - e na le lintlha tse ngata tse amanang le bophelo bo botle.
  51. London Datastore Portal - lintlha tse mabapi le bophelo ba batho London.
  52. Canada Government Open Data Portal - portal ea data e bulehileng mabapi le batho ba Canada (temo, bonono, mmino, thuto, mmuso, tlhokomelo ea bophelo bo botle, jj.)

Bala haholoanyane

Source: www.habr.com

Eketsa ka tlhaloso