Datasets
Here you can download the datasets used in the course.
Dataset | Format | Description | Link |
---|---|---|---|
Wikipedia | Text | Larger set of Wikipedia articles (396 about Programming and 245 about Games) to be used for assignment A3 | Download |
Wikipedia 300 | ARFF and CSV | Wikipedia articles (150 about Programming and 150 about Games) to be used for project P3 | Download |
Wikipedia clustering | Text | Wikipedia articles (90 about Programming and 90 about Games) to be used for project P2 | Download |
Blog data | Text | The blog dataset with 99 blogs to be used for assignment A2 | Download |
Movies example | CSV | The movie ratings example dataset to be used for assignment A1 | Download |
Movies large | CSV | The larger movie rating dataset to be used for assignment A1 | Download |
MovieLens 100k | CSV | The MovieLens 100k ratings dataset to be used for project P1 | Download |
Iris | ARFF and CSV | Iris dataset to be used for assignment A4 | Download |
Banknote | CSV | Banknote authentication dataset to be used for assignment A4 | Download |