Pub. date: 2009 | Online Pub. Date: October 05, 2009 | DOI: 10.4135/9780857020994 | Print ISBN: 9781412930918 | Online ISBN: 9780857020994| Publisher:SAGE Publications LtdAbout this handbook
Chapter 20: Cluster Analysis: A Toolbox for MATLAB
Lawrence J. Hubert & Hans-Friedrich Köhn & Douglas L. Steinley
Cluster analysis: A toolbox for matlab A broad definition of clustering can be given as the search for homogeneous groupings of objects based on some type of available data. There are two common such tasks now discussed in (almost) all multivariate analysis texts and implemented in the commercially available behavioral and social science statistical software suites: hierarchical clustering and the K -means partitioning of some set of objects. This chapter begins with a brief review of these topics using two illustrative data sets that are carried along throughout this chapter for numerical illustration. Later sections will develop hierarchical clustering through least-squares and the characterizing notion of an ultrametric; K -means partitioning is generalized by rephrasing as an optimization problem of subdividing a given proximity matrix. In all instances, the MATLAB computational environment is relied on to effect our analyses, using the Statistical Toolbox, for example, to carry out the common ...