CSCI 379 Information Retrieval and Web Search (Fall 03)
Exercise One

Assigned: Monday September 8th
Due: Monday September 15th

  1. Choose any three of your favorite commercial search engines, issue the following one-term query and answer the questions.
    Queries Questions to answer

  2. Visit the site http://www.isc.org/ds and www.netcraft.com/Survey, summarize how these two surveys work (a paragraph or two each).

  3. Consider the description text of the above two exercises as two separate documents. After removing all the stopwords (by common sense or use an existing list), what is the word collection? If these two documents are represented as a word frequency vector, how does it look? If they are represented as a binary vector, how does it look?