On a resampling approach for tests on the number of clusters with mixture model-based clustering of tissue samples
We consider the problem of assessing the number of clusters in a limited number of tissue samples containing gene expressions for possibly several thousands of genes. It is proposed to use a normal mixture model-based approach to the clustering of the tissue samples. One advantage of this approach is that the question on the number of clusters in the data can be formulated in terms of a test on the smallest number of components in the mixture model compatible with the data. This test can be carried out on the basis of the likelihood ratio test statistic, using resampling to assess its null distribution. The effectiveness of this approach is demonstrated on simulated data and on some microarray datasets, as considered previously in the bioinformatics literature.
Year of publication: |
2004
|
---|---|
Authors: | McLachlan, G. J. ; Khan, N. |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 90.2004, 1, p. 90-105
|
Publisher: |
Elsevier |
Keywords: | Microarray gene expression data Mixture models Clustering of tissue samples Tests on number of clusters Likelihood ratio statistic Resampling approach |
Saved in:
Saved in favorites
Similar items by person
-
The classification and mixture maximum likelihood approaches to cluster analysis
McLachlan, G. J., (1982)
-
Comparative study of energy saving light sources
Khan, N., (2011)
-
Public Policy Reforms on Development : Challenges Before Emerging Economies: Introduction
Khan, N., (2013)
- More ...