Datasets

Here you can download the datasets used in the course.

DatasetFormatDescriptionLink
WikipediaTextLarger set of Wikipedia articles (396 about Programming and 245 about Games) to be used for assignment A3Download
Wikipedia 300ARFF and CSVWikipedia articles (150 about Programming and 150 about Games) to be used for project P3Download
Wikipedia clusteringTextWikipedia articles (90 about Programming and 90 about Games) to be used for project P2Download
Blog dataTextThe blog dataset with 99 blogs to be used for assignment A2Download
Movies exampleCSVThe movie ratings example dataset to be used for assignment A1Download
Movies largeCSVThe larger movie rating dataset to be used for assignment A1Download
MovieLens 100kCSVThe MovieLens 100k ratings dataset to be used for project P1Download
IrisARFF and CSVIris dataset to be used for assignment A4Download
BanknoteCSVBanknote authentication dataset to be used for assignment A4Download