Oracle inequalities for multi-fold cross validation
We consider choosing an estimator or model from a given class by cross validation consisting of holding a nonneglible fraction of the observations out as a test set. We derive bounds that show that the risk of the resulting procedure is (up to a constant) smaller than the risk of an oracle plus an error which typically grows logarithmically with the number of estimators in the class. We extend the results to penalized cross validation in order to control unbounded loss functions. Applications include regression with squared and absolute deviation loss and classification under Tsybakov’s condition.
Year of publication: |
2006
|
---|---|
Authors: | der, Vaart Aad W. van ; Sandrine, Dudoit ; der, Laan Mark J. van |
Published in: |
Statistics & Risk Modeling. - De Gruyter. - Vol. 24.2006, 3, p. 21-21
|
Publisher: |
De Gruyter |
Saved in:
Saved in favorites
Similar items by person
-
The cross-validated adaptive epsilon-net estimator
der, Laan Mark J. van, (2006)
-
Special Issue on Computational Statistical Methods for Genomics and Systems Biology
Aurélie, Labbe, (2012)
-
van der Laan Mark J., (2004)
- More ...