Simultaneous supervised clustering and feature selection over a graph
In this article, we propose a regression method for simultaneous supervised clustering and feature selection over a given undirected graph, where homogeneous groups or clusters are estimated as well as informative predictors, with each predictor corresponding to one node in the graph and a connecting path indicating a priori possible grouping among the corresponding predictors. The method seeks a parsimonious model with high predictive power through identifying and collapsing homogeneous groups of regression coefficients. To address computational challenges, we present an efficient algorithm integrating the augmented Lagrange multipliers, coordinate descent and difference convex methods. We prove that the proposed method not only identifies the true homogeneous groups and informative features consistently but also leads to accurate parameter estimation. A gene network dataset is analysed to demonstrate that the method can make a difference by exploring dependency structures among the genes. Copyright 2012, Oxford University Press.
Year of publication: |
2012
|
---|---|
Authors: | Shen, Xiaotong ; Huang, Hsin-Cheng ; Pan, Wei |
Published in: |
Biometrika. - Biometrika Trust, ISSN 0006-3444. - Vol. 99.2012, 4, p. 899-914
|
Publisher: |
Biometrika Trust |
Saved in:
Saved in favorites
Similar items by person
-
On large margin hierarchical classification with multiple paths
Wang, Junhui, (2009)
-
Inference after model selection
Shen, Xiaotong, (2004)
-
Grouping pursuit through a regularization solution surface
Shen, Xiaotong, (2010)
- More ...