Correcting the Estimated Level of Differential Expression for Gene Selection Bias: Application to a Microarray Study
The level of differential gene expression may be defined as a fold change, a frequency of upregulation, or some other measure of the degree or extent of a difference in expression across groups of interest. On the basis of expression data for hundreds or thousands of genes, inferring which genes are differentially expressed or ranking genes in order of priority introduces a bias in estimates of their differential expression levels. A previous correction of this feature selection bias suffers from a lack of generality in the method of ranking genes, from requiring many biological replicates, and from unnecessarily overcompensating for the bias.For any method of ranking genes on the basis of gene expression measured for as few as three biological replicates, a simple leave-one-out algorithm corrects, with less overcompensation, the bias in estimates of the level of differential gene expression. In a microarray data set, the bias correction reduces estimates of the probability of upregulation or downregulation from 100% to as low as 60%, even for genes with estimated local false discovery rates close to 0. A simulation study quantifies both the advantage of smoothing estimates of bias before correction and the degree of overcompensation.
Year of publication: |
2008
|
---|---|
Authors: | Bickel David R. |
Published in: |
Statistical Applications in Genetics and Molecular Biology. - De Gruyter, ISSN 1544-6115. - Vol. 7.2008, 1, p. 1-27
|
Publisher: |
De Gruyter |
Saved in:
Saved in favorites
Similar items by person
-
Bickel David R., (2015)
-
Bickel David R., (2013)
-
Estimators of the local false discovery rate designed for small numbers of tests
Marta, Padilla, (2012)
- More ...