CSCI 379 Information Retrieval and Web Search (Fall 03)
Exercise One
Assigned: | Monday September 8th |
Due: | Monday September 15th |
- Choose any three of your favorite commercial search engines,
issue the following one-term query and answer the questions.
Queries
Questions to answer
- Which three search engines did you choose? Why?
- In each of the three search engines, how many returned
results among the first 20 results are relevant to what you
search for?
- Does any of these three search engines support cluster?
Does the clustering work well? What do you think of it?
- Visit the site http://www.isc.org/ds and www.netcraft.com/Survey,
summarize how these two surveys work (a paragraph or two each).
- Consider the description text of the above two exercises as two
separate documents. After removing all the stopwords (by common sense or
use an existing list),
what is the word collection? If these two documents are represented
as a
word frequency vector, how does it look? If they are represented as
a binary vector, how does it look?