Finding predictive gene groups from microarray data
Microarray experiments generate large datasets with expression values for thousands of genes, but not more than a few dozens of samples. A challenging task with these data is to reveal groups of genes which act together and whose collective expression is strongly associated with an outcome variable of interest. To find these groups, we suggest the use of supervised algorithms: these are procedures which use external information about the response variable for grouping the genes. We present Pelora, an algorithm based on penalized logistic regression analysis, that combines gene selection, gene grouping and sample classification in a supervised, simultaneous way. With an empirical study on six different microarray datasets, we show that Pelora identifies gene groups whose expression centroids have very good predictive potential and yield results that can keep up with state-of-the-art classification methods based on single genes. Thus, our gene groups can be beneficial in medical diagnostics and prognostics, but they may also provide more biological insights into gene function and regulation.
Year of publication: |
2004
|
---|---|
Authors: | Dettling, Marcel ; Bühlmann, Peter |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 90.2004, 1, p. 106-131
|
Publisher: |
Elsevier |
Keywords: | Gene expression Penalized logistic regression Dimension reduction Sample classification |
Saved in:
Saved in favorites
Similar items by person
-
Volatility and risk estimation with linear and nonlinear methods based on high frequency data
Dettling, Marcel, (2004)
-
Volatility and risk estimation with linear and nonlinear methods based on high frequency data
Dettling, Marcel, (2004)
-
Volatility and risk estimation with linear and nonlinear methods based on high frequency data
Dettling, Marcel, (2004)
- More ...