Serial and parallel implementations of model-based clustering via parsimonious Gaussian mixture models
Model-based clustering using a family of Gaussian mixture models, with parsimonious factor analysis like covariance structure, is described and an efficient algorithm for its implementation is presented. This algorithm uses the alternating expectation-conditional maximization (AECM) variant of the expectation-maximization (EM) algorithm. Two central issues around the implementation of this family of models, namely model selection and convergence criteria, are discussed. These central issues also have implications for other model-based clustering techniques and for the implementation of techniques like the EM algorithm, in general. The Bayesian information criterion (BIC) is used for model selection and Aitken's acceleration, which is shown to outperform the lack of progress criterion, is used to determine convergence. A brief introduction to parallel computing is then given before the implementation of this algorithm in parallel is facilitated within the master-slave paradigm. A simulation study is then carried out to confirm the effectiveness of this parallelization. The resulting software is applied to two datasets to demonstrate its effectiveness when compared to existing software.
Year of publication: |
2010
|
---|---|
Authors: | McNicholas, P.D. ; Murphy, T.B. ; McDaid, A.F. ; Frost, D. |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 54.2010, 3, p. 711-723
|
Publisher: |
Elsevier |
Saved in:
Saved in favorites
Similar items by person
-
Standardising the lift of an association rule
McNicholas, P.D., (2008)
-
The assessment of patients' distress in Genito-Urinary Medicine Clinics
Fitzpatrick, R., (1987)
-
Aspects of occupational change in the Irish economy : recent trends and future prospects
Sexton, J. J., (1998)
- More ...