SMOOTHING PROXIMAL GRADIENT METHOD FOR GENERAL STRUCTURED SPARSE REGRESSION

被引：144

作者：

Chen, Xi ^{[1
]}

Lin, Qihang ^{[2
]}

Kim, Seyoung ^{[1
]}

Carbonell, Jaime G. ^{[1
]}

Xing, Eric P. ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA 15213 USA

来源：

ANNALS OF APPLIED STATISTICS | 2012年 / 6卷 / 02期

关键词：

Sparse regression; structured sparsity; smoothing; proximal gradient; optimization; SHRINKAGE; SELECTION;

D O I：

10.1214/11-AOAS514

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We study the problem of estimating high-dimensional regression models regularized by a structured sparsity-inducing penalty that encodes prior structural information on either the input or output variables. We consider two widely adopted types of penalties of this kind as motivating examples: (1) the general overlapping-group-lasso penalty, generalized from the group-lasso penalty; and (2) the graph-guided-fused-lasso penalty, generalized from the fused-lasso penalty. For both types of penalties, due to their nonseparability and nonsmoothness, developing an efficient optimization method remains a challenging problem. In this paper we propose a general optimization approach, the smoothing proximal gradient (SPG) method, which can solve structured sparse regression problems with any smooth convex loss under a wide spectrum of structured sparsity-inducing penalties. Our approach combines a smoothing technique with an effective proximal gradient method. It achieves a convergence rate significantly faster than the standard first-order methods, subgradient methods, and is much more scalable than the most widely used interior-point methods. The efficiency and scalability of our method are demonstrated on both simulation experiments and real genetic data sets.

引用

页码：719 / 752

页数：34

共 50 条

[31] Smoothing Accelerated Proximal Gradient Method with Fast Convergence Rate for Nonsmooth Convex Optimization Beyond Differentiability
Fan Wu
Wei Bian
[J]. Journal of Optimization Theory and Applications, 2023, 197 : 539 - 572
[32] Smoothing Accelerated Proximal Gradient Method with Fast Convergence Rate for Nonsmooth Convex Optimization Beyond Differentiability
Wu, Fan
Bian, Wei
[J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 197 (02) : 539 - 572
[33] Structured sparse K-means clustering via Laplacian smoothing
Gong, Weikang
Zhao, Renbo
Grunewald, Stefan
[J]. PATTERN RECOGNITION LETTERS, 2018, 112 : 63 - 69
[34] Sparse conditional copula models for structured output regression
Kim, Minyoung
[J]. PATTERN RECOGNITION, 2016, 60 : 761 - 769
[35] Sparse regression with structured priors:: Application to audio denoising
Fevotte, Cedric
Daudet, Laurent
Godsill, Simon J.
Torresani, Bruno
[J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 2508 - 2511
[36] STRUCTURED, SPARSE REGRESSION WITH APPLICATION TO HIV DRUG RESISTANCE
Percival, Daniel
Roeder, Kathryn
Rosenfeld, Roni
Wasserman, Larry
[J]. ANNALS OF APPLIED STATISTICS, 2011, 5 (2A): : 628 - 644
[37] Dependent relevance determination for smooth and structured sparse regression
Wu, Anqi
Koyejo, Oluwasanmi
Pillow, Jonathan
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
[38] Dependent relevance determination for smooth and structured sparse regression
Wu, Anqi
Koyejo, Oluwasanmi
Pillow, Jonathan
[J]. Journal of Machine Learning Research, 2019, 20
[39] Adaptive Proximal Gradient Methods for Structured Neural Networks
Yun, Jihun
Lozano, Aurelie C.
Yang, Eunho
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[40] Image Smoothing Based on Image Decomposition and Sparse High Frequency Gradient
Guang-Hao Ma
Ming-Li Zhang
Xue-Mei Li
Cai-Ming Zhang
[J]. Journal of Computer Science and Technology, 2018, 33 : 502 - 510

← 1 2 3 4 5 →