Biclustering of Gene Expression Data by an Extension of Mixtures of Factor Analyzers
A challenge in microarray data analysis concerns discovering local structures composed by sets of genes that show homogeneous expression patterns across subsets of conditions. We present an extension of the mixture of factor analyzers model (MFA) allowing for simultaneous clustering of genes and conditions. The proposed model is rather flexible since it models the density of high-dimensional data assuming a mixture of Gaussian distributions with a particular omponent-specific covariance structure. Specifically, a binary and row stochastic matrix representing tissue membership is used to cluster tissues (experimental conditions), whereas the traditional mixture approach is used to define the gene clustering. An alternating expectation conditional maximization (AECM) algorithm is proposed for parameter estimation; experiments on simulated and real data show the efficiency of our method as a general approach to biclustering. The Matlab code of the algorithm is available upon request from authors.
Year of publication: |
2008
|
---|---|
Authors: | Francesca, Martella ; Marco, Alfò ; Maurizio, Vichi |
Published in: |
The International Journal of Biostatistics. - De Gruyter, ISSN 1557-4679. - Vol. 4.2008, 1, p. 1-19
|
Publisher: |
De Gruyter |
Saved in:
Saved in favorites
Similar items by subject
-
Find similar items by using search terms and synonyms from our Thesaurus for Economics (STW).