On the Dependence Structure of Sequence Alignment Scores Calculated with Multiple Scoring Matrices
A common practice in protein sequence alignment is to try several scoring matrices until ``something interesting'' is found. This leads to a multiple testing problem making p- and E-values hard to interpret. We focus on local alignment and propose to use logistic copula functions to model explicitly the dependence structure of scores obtained using different scoring matrices. By doing this, we obtain p-value correction factors when using more than one scoring matrix on the same sequences. Furthermore the parameter of the logistic copula can be interpreted as measure of dependence, providing insight concerning the relatedness of the scores from different matrices.
Year of publication: |
2004
|
---|---|
Authors: | Florian, Frommlet ; Andreas, Futschik |
Published in: |
Statistical Applications in Genetics and Molecular Biology. - De Gruyter, ISSN 1544-6115. - Vol. 3.2004, 1, p. 1-14
|
Publisher: |
De Gruyter |
Saved in:
Saved in favorites
Similar items by person
-
QTL Mapping Using a Memetic Algorithm with Modifications of BIC as Fitness Function
Florian, Frommlet, (2012)
-
DNA Pooling and Statistical Tests for the Detection of Single Nucleotide Polymorphisms
Ramsey David M., (2012)
- More ...