Imputation of missing values for compositional data using classical and robust methods
New imputation algorithms for estimating missing values in compositional data are introduced. A first proposal uses the k-nearest neighbor procedure based on the Aitchison distance, a distance measure especially designed for compositional data. It is important to adjust the estimated missing values to the overall size of the compositional parts of the neighbors. As a second proposal an iterative model-based imputation technique is introduced which initially starts from the result of the proposed k-nearest neighbor procedure. The method is based on iterative regressions, thereby accounting for the whole multivariate data information. The regressions have to be performed in a transformed space, and depending on the data quality classical or robust regression techniques can be employed. The proposed methods are tested on a real and on simulated data sets. The results show that the proposed methods outperform standard imputation methods. In the presence of outliers, the model-based method with robust regressions is preferable.
Year of publication: |
2010
|
---|---|
Authors: | Hron, K. ; Templ, M. ; Filzmoser, P. |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 54.2010, 12, p. 3095-3107
|
Publisher: |
Elsevier |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
Model-based replacement of rounded zeros in compositional data: Classical and robust approaches
Martín-Fernández, J.A., (2012)
-
Logratio approach to statistical analysis of 2×2 compositional tables
Fačevicová, K., (2014)
-
Linear regression with compositional explanatory variables
Hron, K., (2012)
- More ...