A flexible approximate likelihood ratio test for detecting differential expression in microarray data
Identifying differentially expressed genes in microarray data has been studied extensively and several methods have been proposed. Most popular methods in the study of gene expression microarray data analysis rely on normal distribution assumption and are based on a Wald statistic. These methods may be inefficient when expression levels follow a skewed distribution. To deal with possible violations of the normality assumption, we propose a method based on Generalized Logistic Distribution of Type II (GLDII). The motivation behind this distributional assumption is to allow longer tails than normal distribution. This is important in analyzing gene expression data since extreme values are common in such experiments. The shape parameter for GLDII allows flexibility in modeling a wide range of distributions. To simplify the computational complexity involved in carrying out Likelihood Ratio (LR) tests for several thousands of genes, an Approximate LR Test (ALRT) is proposed. We also generalize the two-class ALRT method to multi-class microarray data. The performance of the ALRT method under the GLDII assumption is compared to methods based on Wald-type statistics using simulation. The results from the simulations show that our method performs quite well compared to the significance analysis of microarrays (SAM) approach using standardized Wilcoxon rank statistics and the empirical Bayes (E-B) t-statistics. Our method is also less sensitive to extreme values. We illustrate our method using two publicly available gene expression data sets.
Year of publication: |
2009
|
---|---|
Authors: | Hossain, Ahmed ; Beyene, Joseph ; Willan, Andrew R. ; Hu, Pingzhao |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 53.2009, 10, p. 3685-3695
|
Publisher: |
Elsevier |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
Application of skew-normal distribution for detecting differential expression to microRNA data
Hossain, Ahmed, (2015)
-
Weighted kernel Fisher discriminant analysis for integrating heterogeneous data
Hamid, Jemila S., (2012)
-
A Multivariate Growth Curve Model for Ranking Genes in Replicated Time Course Microarray Data
Hamid, Jemila, (2009)
- More ...