Bootstrapping Goodness-of-Fit Measures in Categorical Data Analysis
When sparse data have to be fitted to a log-linear or latent class model, one cannot use the theoretical chi-square distribution to evaluate model fit, because with sparse data the observed cross-table has too many cells in relation to the number of observations to use a distribution that only holds asymptotically. The choice of a theoretical distribution is also difficult when model-expected frequencies are 0 or when model probabilities are estimated 0 or 1. The authors propose to solve these problems by estimating the distribution of a fit measure, using bootstrap methods. An algorithm is presented for estimating this distribution by drawing bootstrap samples from the model-expected proportions, the so-called nonnaive bootstrap method. For the first time the method is applied to empirical data of varying sparseness, from five different data sets. Results show that the asymptotic chi-square distribution is not at all valid for sparse data.
Year of publication: |
1996
|
---|---|
Authors: | LANGEHEINE, ROLF ; PANNEKOEK, JEROEN ; POL, FRANK VAN DE |
Published in: |
Sociological Methods & Research. - Vol. 24.1996, 4, p. 492-516
|
Saved in:
Saved in favorites
Similar items by person
-
A Unifying Framework for Markov Modeling in Discrete Space and Discrete Time
LANGEHEINE, ROLF, (1990)
-
State mastery learning: Dynamic models for longitudinal data
Langeheine, Rolf, (1994)
-
Log-lineare Modelle zur multivariaten Analyse qualitativer Daten : eine Einführung
Langeheine, Rolf, (1980)
- More ...