Homework 3
Due Tuesday, November 22, 2005
Each of the problems should be solved on a separate sheet of paper to facilitate grading. Limit the solution of each problem to one sheet of paper unless otherwise stated. Many of these problems are deliberately open-ended and vague; do the best you can on them but don't make yourself crazy. Please don't wait until the last minute to look at the problems.
You must do this assignment in groups of 2 to 3 people.
Experiment with the clustering commands in your environment.
In particular, compare the results of average, complete, and
single-link clustering methods.
A dissimilarity matrix and
instructions on how to use the clustering commands will be posted.
at http://www.cs.sunysb.edu/
skiena/549 .
Write a one page report on your experiences.
Your job is to identify an appropriate distance function to measure similarly/dissimilarity among the objects, and then cluster them.
Experiment with different clustering algorithms and different cost functions. Do you get clusters that reflect some sense of reality? How does changing parameters and algorithms effect the quality of the clusters?
Write a three page paper on your experiments.