Data mining for longitudinal data under multicollinearity and time dependence using penalized generalized estimating equations
Penalized generalized estimating equations with Elastic Net or L2-Smoothly Clipped Absolute Deviation penalization are proposed to simultaneously select the most important variables and estimate their effects for longitudinal Gaussian data when multicollinearity is present. The method is able to consistently select and estimate the main effects even when strong correlations are present. In addition, the potential pitfall of time-dependent covariates is clarified. Both asymptotic theory and simulation results reveal the effectiveness of penalization as a data mining tool for longitudinal data, especially when a large number of variables is present. The method is illustrated by mining for the main determinants of life expectancy in Europe.
Year of publication: |
2014
|
---|---|
Authors: | Blommaert, A. ; Hens, N. ; Beutels, Ph. |
Published in: |
Computational Statistics & Data Analysis. - Elsevier, ISSN 0167-9473. - Vol. 71.2014, C, p. 667-680
|
Publisher: |
Elsevier |
Subject: | Covariate selection | Generalized estimating equations | Longitudinal data | Multicollinearity | Penalization | Time-dependent covariates |
Saved in:
Saved in favorites
Similar items by subject
-
Efficient parameter estimation in longitudinal data analysis using a hybrid GEE method
Leung, DHY, (2009)
-
Pannenberg, Markus, (2007)
-
Effects of variance-function misspecification in analysis of longitudinal data
Wang, YG, (2005)
- More ...
Similar items by person