Robustness of reweighted Least Squares Kernel Based Regression
Kernel Based Regression (KBR) minimizes a convex risk over a possibly infinite dimensional reproducing kernel Hilbert space. Recently, it was shown that KBR with a least squares loss function may have some undesirable properties from a robustness point of view: even very small amounts of outliers can dramatically affect the estimates. KBR with other loss functions is more robust, but often gives rise to more complicated computations (e.g. for Huber or logistic losses). In classical statistics robustness is often improved by reweighting the original estimate. In this paper we provide a theoretical framework for reweighted Least Squares KBR (LS-KBR) and analyze its robustness. Some important differences are found with respect to linear regression, indicating that LS-KBR with a bounded kernel is much more suited for reweighting. In two special cases our results can be translated into practical guidelines for a good choice of weights, providing robustness as well as fast convergence. In particular a logistic weight function seems an appropriate choice, not only to downweight outliers, but also to improve performance at heavy tailed distributions. For the latter some heuristic arguments are given comparing concepts from robustness and stability.
Year of publication: |
2010
|
---|---|
Authors: | Debruyne, Michiel ; Christmann, Andreas ; Hubert, Mia ; Suykens, Johan A.K. |
Published in: |
Journal of Multivariate Analysis. - Elsevier, ISSN 0047-259X. - Vol. 101.2010, 2, p. 447-463
|
Publisher: |
Elsevier |
Keywords: | Robust statistics Kernel methods Kernel based regression Reweighting Influence function |
Saved in:
Saved in favorites
Similar items by person
-
Detecting influential observations in Kernel PCA
Debruyne, Michiel, (2010)
-
Robust Learning from Bites for Data Mining
Christmann, Andreas, (2006)
-
Robust learning from bites for data mining
Christmann, Andreas, (2007)
- More ...