Theory and inference for regression models with missing responses and covariates
In this paper, we carry out an in-depth theoretical investigation for inference with missing response and covariate data for general regression models. We assume that the missing data are missing at random (MAR) or missing completely at random (MCAR) throughout. Previous theoretical investigations in the literature have focused only on missing covariates or missing responses, but not both. Here, we consider theoretical properties of the estimates under three different estimation settings: complete case (CC) analysis, a complete response (CR) analysis that involves an analysis of those subjects with only completely observed responses, and the all case (AC) analysis, which is an analysis based on all of the cases. Under each scenario, we derive general expressions for the likelihood and devise estimation schemes based on the EM algorithm. We carry out a theoretical investigation of the three estimation methods in the normal linear model and analytically characterize the loss of information for each method, as well as derive and compare the asymptotic variances for each method assuming the missing data are MAR or MCAR. In addition, a theoretical investigation of bias for the CC method is also carried out. A simulation study and real dataset are given to illustrate the methodology.
Year of publication: |
2008
|
---|---|
Authors: | Chen, Qingxia ; Ibrahim, Joseph G. ; Chen, Ming-Hui ; Senchaudhuri, Pralay |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 99.2008, 6, p. 1302-1331
|
Publisher: |
Elsevier |
Saved in:
Saved in favorites
Similar items by person
-
Zeng, Donglin, (2012)
-
Semiparametric Models for Missing Covariate and Response Data in Regression Models
Chen, Qingxia, (2006)
-
Gamma frailty transformation models for multivariate survival times
Zeng, Donglin, (2009)
- More ...