Iris Dataset - gulu la oyambira, lomwe lili ndi makulidwe a sepals ndi maluwa amaluwa osiyanasiyana.
Zithunzi za MNIST - mndandanda wa manambala olembedwa pamanja. Zithunzi zophunzitsira 60 ndi zithunzi 000 zoyeserera.
Boston Housing Dataset ndi gulu lodziwika bwino lodziwika bwino. Ili ndi zambiri zamanyumba ku Boston: kuchuluka kwa nyumba, mitengo yobwereketsa, index yaumbanda.
Titanic Dataset - ili ndi zambiri za okwera (zaka, jenda, achibale omwe ali m'bwalo, ndi zina zotero) 891 mu seti yophunzitsira ndi 418 mu seti yoyesera.
Zithunzi za Chars74K - ili ndi zithunzi za zizindikiro za British ndi Canada za makalasi 64: 0-9, AZ, az. 7700 7.7k zithunzi zachilengedwe, 3400k zolembedwa pamanja, 62000 makompyuta apanga zilembo.
Chatbot Intents Dataset - Fayilo ya JSON yomwe ili ndi ma tag osiyanasiyana: moni, chabwino, hospital_search, pharmacy_search, ndi zina zotero. Lili ndi ma tempulo a mayankho a mafunso. (Njira yogwiritsira ntchito ndi code source ku Python: Ntchito ya Chatbot ku Python)
Enron Email Dataset - ili ndi zilembo theka miliyoni kuchokera kwa oyang'anira 150 Enron.
Yelp Dataset - ili ndi malingaliro 1,2 miliyoni kuchokera kwa ogwiritsa ntchito 1,6 miliyoni pafupifupi mabungwe 1,2 miliyoni.
Jeopardy Dataset - zojambulidwa zopitilira 200 za mafunso ndi mayankho zochokera pamasewera otchuka apawailesi yakanema.
Recommender Systems Dataset - portal yokhala ndi zosunga zobwezeretsera kuchokera ku UCSD University. Muli ndi mbiri ya ndemanga pa malo otchuka (Goodreads, Amazon). Zabwino kupanga ma recommender systems. (Njira yogwiritsira ntchito ndi code source mu R: Kanema Recommendation System Project mu R)
UCI Spambase Dataset - nkhokwe yophunzitsira kuzindikira sipamu. Ili ndi zilembo 4601 zokhala ndi magawo 57 a metadata.
IMF Data Portal ndi thumba la thumba la ndalama zapadziko lonse lapansi lomwe limasindikiza zandalama zapadziko lonse lapansi, mitengo ya ngongole, ndalama, nkhokwe za ndalama zakunja ndi katundu.