Entropy based constrained inference for some HDLSS genomic models: UI tests in a Chen-Stein perspective
For qualitative data models, Gini-Simpson index and Shannon entropy are commonly used for statistical analysis. In the context of high-dimensional low-sample size (HDLSS) categorical models, abundant in genomics and bioinformatics, the Gini-Simpson index, as extended to Hamming distance in a pseudo-marginal setup, facilitates drawing suitable statistical conclusions. Under Lorenz ordering it is shown that Shannon entropy and its multivariate analogues proposed here appear to be more informative than the Gini-Simpson index. The nested subset monotonicity prospect along with subgroup decomposability of some proposed measures are exploited. The usual jackknifing (or bootstrapping) methods may not work out well for HDLSS constrained models. Hence, we consider a permutation method incorporating the union-intersection (UI) principle and Chen-Stein Theorem to formulate suitable statistical hypothesis testing procedures for gene classification. Some applications are included as illustration.
Year of publication: |
2010
|
---|---|
Authors: | Tsai, Ming-Tien ; Sen, Pranab Kumar |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 101.2010, 7, p. 1559-1573
|
Publisher: |
Elsevier |
Keywords: | Chen-Stein Theorem Hamming-Shannon pooled measure Lorenz ordering Ordered alternatives Permutation jackknife Subgroup decomposability Union-intersection principle |
Saved in:
Saved in favorites
Similar items by person
-
Sen, Pranab Kumar, (2007)
-
On inadmissibility of Hotelling T2-tests for restricted alternatives
Tsai, Ming-Tien, (2004)
-
Locally best rotation-invariant rank tests for modal location
Tsai, Ming-Tien, (2007)
- More ...