Non-parametric estimation of data dimensionality prior to data compression: the case of the human development index
In many applications in applied statistics, researchers reduce the complexity of a data set by combining a group of variables into a single measure using a factor analysis or an index number. We argue that such compression loses information if the data actually have high dimensionality. We advocate the use of a non-parametric estimator, commonly used in physics (the <italic>Takens estimator</italic>), to estimate the correlation dimension of the data prior to compression. The advantage of this approach over traditional linear data compression approaches is that the data do not have to be linearised. Applying our ideas to the United Nations Human Development Index, we find that the four variables that are used in its construction have dimension 3 and the index loses information.
Year of publication: |
2013
|
---|---|
Authors: | Canning, David ; French, Declan ; Moore, Michael |
Published in: |
Journal of Applied Statistics. - Taylor & Francis Journals, ISSN 0266-4763. - Vol. 40.2013, 9, p. 1853-1863
|
Publisher: |
Taylor & Francis Journals |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
IS HUMAN DEVELOPMENT MULTIDIMENSIONAL?
French, Declan, (2013)
-
The Economics of Fertility Timing : An Euler Equation Approach
Canning, David, (2014)
-
Is human development multidimensional?
French, Declan, (2013)
- More ...