Web Information Retrieval

Xiannong Meng
Computer Science Department
Bucknell University
Lewisburg, PA 17837
May 2014

This is for a short, intensive course on web information retrieval for about 36 hours of meeting time.

Syllabus

Syllabus .

Tentative schedule along with projects and homework due dates.

Textbooks

See the syllabus for the main textbook and a list of references.

The main textbook is Introduction to Information Retrieval by Manning, Raghavan, and Schutze, Cambridge University Press, 2008.
The textbook is available online from http://nlp.stanford.edu/IR-book/information-retrieval-book.html

Lecture Notes

Follow this link for course notes in PowerPoint and this link for course notes in PDF.

Programming Project

Project Description

Other Assignments

Exercises

  1. Homework 1
  2. Homework 2
  3. Homework 3
  4. Homework 4
  5. Homework 5

Project Presentation

Submission Instructions

Please follow these instructions when submitting your work.

Learning Outcomes

After completing the course, students will be able to

  1. Explain basic information retrieval models such as vector space model and probabilistic model
  2. Evaluate the performance of information retrieval systems
  3. Analyze information retrieval systems such as web search engine using the principles of IR
  4. Design and implement a simple web search engine

Relevant Courses at Other Universities

Here is a partial list of information retrieval and web search courses at various universities.

List of Textbooks

A partial list of IR related textbooks

Code Examples

This directory contains code examples that may be useful.

If you'd like to parse and search Chinese text, the files in this directory can get you started.