A new approach to cluster analysis: the clustering-function-based method
The purpose of the paper is to present a new statistical approach to hierarchical cluster analysis with "n" objects measured on "p" variables. Motivated by the model of multivariate analysis of variance and the method of maximum likelihood, a clustering problem is formulated as a least squares optimization problem, simultaneously solving for both an "n"-vector of "unknown" group membership of objects and a linear clustering function. This formulation is shown to be linked to linear regression analysis and Fisher linear discriminant analysis and includes principal component regression for tackling multicollinearity or rank deficiency, polynomial or "B"-splines regression for handling non-linearity and various variable selection methods to eliminate irrelevant variables from data analysis. Algorithmic issues are investigated by using sign eigenanalysis. Copyright 2006 Royal Statistical Society.
Year of publication: |
2006
|
---|---|
Authors: | Li, Baibing |
Published in: |
Journal of the Royal Statistical Society Series B. - Royal Statistical Society - RSS, ISSN 1369-7412. - Vol. 68.2006, 3, p. 457-476
|
Publisher: |
Royal Statistical Society - RSS |
Saved in:
Saved in favorites
Similar items by person
-
On hedge fund inceptions in a competitive market
Ma, Tianyi, (2023)
-
Information and capital asset pricing
Li, Baibing, (2011)
-
Online pricing dynamics in internet retailing : the case of the DVD market
Li, Baibing, (2011)
- More ...