Data sets

Here you can download the data sets used in the course.

Dataset Format Description Link
Wikipedia Text Larger set of Wikipedia articles (400 about Programming and 250 about Games) to be used for the search engine assignment Download
Wikipedia 300 ARFF and CSV Wikipedia articles (150 about Programming and 150 about Games) to be used for the P3 project Download
Wikipedia clustering Text Wikipedia articles (90 about Programming and 90 about Games) to be used for the P2 project Download
Spiral ARFF and CSV A two-dimensional dataset with three spiral arms to be used in the machine learning assignment Download
Blog data Text The blog data set you shall use for the clustering assignments Download
Movie ratings CSV The movie ratings data set you shall use for the recommendation system assignment Download
MovieLens 100k CSV The MovieLens 100k ratings dataset to be used for the P1 project Download

Welcome to CoursePress

en utav Linnéuniversitets lärplattformar. Som inloggad student kan du kommunicera, hålla koll på dina kurser och mycket mer. Du som är gäst kan nå de flesta kurser och dess innehåll utan att logga in.

Läs mer lärplattformar vid Linnéuniversitetet

Student account

To log in you need a student account at Linnaeus University.

Read more about collecting your account

Log in LNU