Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables
Multiple imputation is a popular way to handle missing data. Automated procedures are widely available in standard software. However, such automated procedures may hide many assumptions and possible difficulties from the view of the data analyst. Imputation procedures such as monotone imputation and imputation by chained equations often involve the fitting of a regression model for a categorical outcome. If perfect prediction occurs in such a model, then automated procedures may give severely biased results. This is a problem in some standard software, but it may be avoided by bootstrap methods, penalised regression methods, or a new augmentation procedure.
Year of publication: |
2010
|
---|---|
Authors: | White, Ian R. ; Daniel, Rhian ; Royston, Patrick |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 54.2010, 10, p. 2267-2275
|
Publisher: |
Elsevier |
Keywords: | Missing data Multiple imputation Perfect prediction Separation |
Saved in:
Online Resource
Saved in favorites
Similar items by person
-
Multiple imputation of missing values: New features for mim
Royston, Patrick, (2009)
-
Multiple Imputation by Chained Equations (MICE): Implementation in Stata
Royston, Patrick, (2011)
-
gformula: Estimating causal effects in the presence of time-varying confounding or mediation
Daniel, Rhian, (2012)
- More ...