A3 – Search Engine

In assignment 3 you shall implement a basic search engine for Wikipedia articles in any programming language you like. You can work alone or in group of two students. You shall present your application and code at an oral examination.

Grade Requirements
E
  • Implement a basic search engine that index all pages in the Wikipedia dataset (see Datasets page)
  • Search queries shall only contain single words
  • Results shall be ranked using the word frequencey metric
  • Implement the search engine using a RESTful web service as back-end, and a browser client GUI as front-end
  • The user shall input the search queries in a web client, and display the search results returned from the server
  • Display the top 5 search results with page and rank score
C-D
  • It shall be possible to use search queries of more than one word
  • Results shall be ranked using:
    score = word_frequency + 0.8 * document_location
  • Display the top 5 search results with page and rank score
A-B
  • Implement the PageRank algorithm and use it to rank the search results
  • Run the algorithm for 20 iterations
  • Results shall be ranked using:
    score = word_frequency + 0.8 * document_location + 0.5 * pagerank
  • Display the top 5 search results with page and rank score

There are some search query results in the last part of the lecture you can verify that your application works with. Note that the dataset has been updated, so the results in the last part of the recording is not accurate. The slides PDF has been updated to the new dataset.

Welcome to CoursePress

en utav Linnéuniversitets lärplattformar. Som inloggad student kan du kommunicera, hålla koll på dina kurser och mycket mer. Du som är gäst kan nå de flesta kurser och dess innehåll utan att logga in.

Läs mer lärplattformar vid Linnéuniversitetet

Student account

To log in you need a student account at Linnaeus University.

Read more about collecting your account

Log in LNU