Clusters, outliers, and regression: fixed point clusters
Fixed point clustering is a new stochastic approach to cluster analysis. The definition of a single fixed point cluster (FPC) is based on a simple parametric model, but there is no parametric assumption for the whole dataset as opposed to mixture modeling and other approaches. An FPC is defined as a data subset that is exactly the set of non-outliers with respect to its own parameter estimators. This paper concentrates upon the theoretical foundation of FPC analysis as a method for clusterwise linear regression, i.e., the single clusters are modeled as linear regressions with normal errors. In this setup, fixed point clustering is based on an iteratively reweighted estimation with zero weight for all outliers. FPCs are non-hierarchical, but they may overlap and include each other. A specification of the number of clusters is not needed. Consistency results are given for certain mixture models of interest in cluster analysis. Convergence of a fixed point algorithm is shown. Application to a real dataset shows that fixed point clustering can highlight some other interesting features of datasets compared to maximum likelihood methods in the presence of deviations from the usual assumptions of model based cluster analysis.
Year of publication: |
2003
|
---|---|
Authors: | Hennig, Christian |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 86.2003, 1, p. 183-212
|
Publisher: |
Elsevier |
Keywords: | Fixed point clusters Clusterwise linear regression Mixture model Outlier identification Redescending M-estimators |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
Effect of web page menu orientation on retrieving information by people with learning disabilities
Williams, Peter, (2015)
-
A smoothing principle for the Huber and other location M-estimators
Hampel, Frank, (2011)
-
A simulation study to compare robust clustering methods based on mixtures
Coretto, Pietro, (2010)
- More ...