Multivariate Weibull mixtures with proportional hazard restrictions for dwell-time-based session clustering with incomplete data
Emanating from classical Weibull mixture models we propose a framework for clustering survival data with various more parsimonious models by imposing restrictions on the distributional parameters. We show that these restrictions on the Weibull mixtures correspond to different proportional hazard restrictions across mixture components and Web page areas. A parametric cluster approach based on the EM algorithm is carried out on a multivariate data set. Our model set-up encompasses incomplete-data structures as well as censoring observations. We apply the methodology on retail data stemming from a global e-commerce company. Sessions are clustered with respect to the dwell times that a user spends on certain page areas. The cluster solution that is found allows for a detailed examination of the navigation behaviour in terms of the hazard and survivor functions within each component. Copyright (c) 2009 Royal Statistical Society.
Year of publication: |
2009
|
---|---|
Authors: | Mair, Patrick ; Hudec, Marcus |
Published in: |
Journal of the Royal Statistical Society Series C. - Royal Statistical Society - RSS, ISSN 0035-9254. - Vol. 58.2009, 5, p. 619-639
|
Publisher: |
Royal Statistical Society - RSS |
Saved in:
Saved in favorites
Similar items by person
-
Mair, Patrick, (2009)
-
Computergestützte Inhaltsanalyse : ein Modell für die Printmedien
Lederer, Brigitte, (1992)
-
Web usage mining in e-commerce
Grossmann, Wilfried, (2004)
- More ...