Bayesian model averaging for estimating the number of classes: applications to the total number of species in metagenomics
The species abundance distribution and the total number of species are fundamental descriptors of the biodiversity of an ecological community. This paper focuses on situations where large numbers of rare species are not observed in the data set due to insufficient sampling of the community, as is the case in metagenomics for the study of microbial diversity. We use a truncated mixture model for the observations to explicitly tackle the missing data and propose methods to estimate the total number of species and, in particular, a Bayesian credibility interval for this number. We focus on computationally efficient procedures with variational methods and importance sampling as opposed to Markov Chain Monte Carlo sampling, and we use Bayesian model averaging as the number of components of the mixture model is unknown.
Year of publication: |
2012
|
---|---|
Authors: | Li-Thiao-Té, Sébastien ; Jean-Jacques, Daudin ; Stéphane, Robin |
Published in: |
Journal of Applied Statistics. - Taylor & Francis Journals, ISSN 0266-4763. - Vol. 39.2012, 7, p. 1489-1504
|
Publisher: |
Taylor & Francis Journals |
Saved in:
Saved in favorites
Similar items by person
-
Vincent, Vandenberghe, (2003)
-
Bart, COCKX, (2006)
-
Unsupervised Classification for Tiling Arrays: ChIP-chip and Transcriptome
Caroline, Bérard, (2011)
- More ...