Austin, Peter - In: Journal of Applied Statistics 35 (2008) 12, pp. 1355-1370
Prior studies have shown that automated variable selection results in models with substantially inflated estimates of the model R2, and that a large proportion of selected variables are truly noise variables. These earlier studies used simulated data sets whose sample sizes were at most 100. We...