General ZeroInflated Models and Their Applications
Count data with excess zeros are commonly seen in experiments forimproving electronics manufacturing quality, in medical researchof HIV patients with highrisk behaviors and in agricultural study of number of insects per leaf.Yip (1988) and Lambert (1992) proposed zeroinflated Poisson distribution andHeilbron (1989) used zeroaltered Poisson and negative binomial distributionsto model this type of data. Li, Lu, Park, Kim, Brinkley and Peterson (1999)derived multivariate version of the zeroinflated Poisson distribution andapplied it to detect equipment problems in electronics manufacturingprocesses.Zeroinflated distributions assume that with probability 1  p the onlypossible observation is 0, and with probability p, a random variabledescribing defect counts in the imperfect state is observed. For example, when manufacturing equipment is properlyaligned (perfect state), there may be no defects. Otherwise, defects may occuraccording to a distribution of the imperfect state. The defect counts inimperfect state could follow Poisson, negative binomial, or other distributions but most of the current researches use Poisson distribution. Although the maximum likelihood (ML) method is widely used in estimatingparameters in the zeroinflated distributions, there is no theoreticalstudy on the properties of the ML estimates.In Chapter 1, we propose a generalframework for generalized zeroinflated models (ZIM), which assume only thatthe distribution of the imperfect state has the support of the nonnegativeintegers and satisfies appropriate regularity conditions. We study the properties of the ML estimates of ZIM parameters,including their existence, uniqueness, strong consistencyand asymptotic normality under regularity conditions. By focusing on the univariate ZIM, we give detailedrigorous proofs to the lemmas and theorems stated in the thesis. Then, we study covariate effects in the univariate and multivariate zeroinflated regression models. Because the zeroinflated model involves both Bernoulli parameter p and the imperfect state parameter lambda,building the model separately does not use the information efficiently and the resulted model is more complicated than needed. This problem gets worse in the multivariate ZIM, where the number of model terms increases drastically. Our procedure selects limited important model terms to maximize the ZIM likelihood functions.In Chapter 2, we review current researches on zeroinflated Poissonmodels. Some new results on multivariate Poisson and multivariate zeroinflated Poisson distributions are given. By generalizing theresults in Lambert (1992) and Li, et al (1999), we propose a multivariatezeroinflated Poisson regression model. An example from Nortel process development research is used to illustrate the model selection procedure for the zeroinflated regression models and computational details.
Year of publication: 
20000331


Authors:  Gan, Nianci 
Other Persons:  JyeChyi Lu (contributor) ; Anastasios Tsiatis (contributor) ; Sujit Ghosh (contributor) ; Matthias Stallmann (contributor) 
Saved in favorites
Similar items by person

Guo, Xiang, (2005)

A Generalized Estimator of the Attributable Benefit of an Optimal Treatment Regime
Brinkley, Jason, (2010)

Smooth Inference for Survival Functions with Arbitrarily Censored Data
Doehler, Kirsten Ann, (2006)
 More ...