On the measure and the estimation of evenness and diversity
Modelling word or species frequency count data through zero truncated Poisson mixture models allows one to interpret the model mixing distribution as the distribution of the word or species frequencies of the vocabulary or population. As a consequence, estimates of their mixing density can be used as a fingerprint of the style of the author in his texts or of the ecosystem in its samples. Definitions of measure of the evenness and of measure of the diversity within a vocabulary or population are given, and the novelty of these definitions is explained. It is then proposed that the measures of the evenness and of the diversity of a vocabulary or population be approximated through the expectation of these measures under the word or species frequency distribution. That leads to the assessment of the lack of diversity through measures of the variability of the mixing frequency distribution estimates described above.
Year of publication: |
2010
|
---|---|
Authors: | Ginebra, Josep ; Puig, Xavier |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 54.2010, 9, p. 2187-2201
|
Publisher: |
Elsevier |
Keywords: | Overdispersion Population size Poisson mixture Schur concavity Sichel model Species distribution Stylometry Uncertainty Vocabulary distribution |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
A cluster analysis of vote transitions
Puig, Xavier, (2014)
-
A Bayesian cluster analysis of election results
Puig, Xavier, (2014)
-
The Sichel model and the mixing and truncation order
Puig, Xavier, (2010)
- More ...