L04 - Search Engines

Contents

  • Introduction to Search Engines
  • Indexing
  • Content-Based Ranking
  • Inbound-Link Ranking
  • PageRank algorithm

Programming Collective Intelligence

Mining of Massive Datasets

Lecture slides

Available for download here.

Recording

Note! There is a small error in the Pearson pseudocode in the recording. Pearson similarity should be calculated between two blogs, not a blog and a user, as the function declaration incorrectly stated. The slides are updated with the correct function declaration.