Complexity-penalized estimation of minimum volume sets for dependent data
A minimum volume (MV) set, at level [alpha], is a set having minimum volume among all those sets containing at least [alpha] probability mass. MV sets provide a natural notion of the 'central mass' of a distribution and, as such, have recently become popular as a tool for the detection of anomalies in multivariate data. Motivated by the fact that anomaly detection problems frequently arise in settings with temporally indexed measurements, we propose here a new method for the estimation of MV sets from dependent data. Our method is based on the concept of complexity-penalized estimation, extending recent work of Scott and Nowak for the case of independent and identically distributed measurements, and has both desirable theoretical properties and a practical implementation. Of particular note is the fact that, for a large class of stochastic processes, choice of an appropriate complexity penalty reduces to the selection of a single tuning parameter, which represents the data dependency of the underlying stochastic process. While in reality the dependence structure is unknown, we offer a data-dependent method for selecting this parameter, based on subsampling principles. Our work is motivated by and illustrated through an application to the detection of anomalous traffic levels in Internet traffic time series.
Year of publication: |
2010
|
---|---|
Authors: | Di, J. ; Kolaczyk, E. |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 101.2010, 9, p. 1910-1926
|
Publisher: |
Elsevier |
Keywords: | Anomaly detection Strong-mixing process Multivariate data Nonparametric |
Saved in:
Saved in favorites
Similar items by subject
-
Find similar items by using search terms and synonyms from our Thesaurus for Economics (STW).