Sparsity and smoothness via the fused lasso
The lasso penalizes a least squares regression by the sum of the absolute values ("L"<sub>1</sub>-norm) of the coefficients. The form of this penalty encourages sparse solutions (with many coefficients equal to 0). We propose the 'fused lasso', a generalization that is designed for problems with features that can be ordered in some meaningful way. The fused lasso penalizes the "L"<sub>1</sub>-norm of both the coefficients and their successive differences. Thus it encourages sparsity of the coefficients and also sparsity of their differences-i.e. local constancy of the coefficient profile. The fused lasso is especially useful when the number of features "p" is much greater than "N", the sample size. The technique is also extended to the 'hinge' loss function that underlies the support vector classifier. We illustrate the methods on examples from protein mass spectroscopy and gene expression data. Copyright 2005 Royal Statistical Society.
Year of publication: |
2005
|
---|---|
Authors: | Tibshirani, Robert ; Saunders, Michael ; Rosset, Saharon ; Zhu, Ji ; Knight, Keith |
Published in: |
Journal of the Royal Statistical Society Series B. - Royal Statistical Society - RSS, ISSN 1369-7412. - Vol. 67.2005, 1, p. 91-108
|
Publisher: |
Royal Statistical Society - RSS |
Saved in:
freely available
Saved in favorites
Similar items by person
-
Theory and Methods - Comment - Bayesian CART Model Search
Knight, Keith, (1998)
-
The Covariance Inflation Criterion for Adaptive Model Selection
Tibshirani, Robert, (1999)
-
Optimal control of false discovery criteria in the two‐group model
Heller, Ruth, (2020)
- More ...