SMOOTHING PROXIMAL GRADIENT METHOD FOR GENERAL STRUCTURED SPARSE REGRESSION

被引:144
|
作者
Chen, Xi [1 ]
Lin, Qihang [2 ]
Kim, Seyoung [1 ]
Carbonell, Jaime G. [1 ]
Xing, Eric P. [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA 15213 USA
来源
ANNALS OF APPLIED STATISTICS | 2012年 / 6卷 / 02期
关键词
Sparse regression; structured sparsity; smoothing; proximal gradient; optimization; SHRINKAGE; SELECTION;
D O I
10.1214/11-AOAS514
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study the problem of estimating high-dimensional regression models regularized by a structured sparsity-inducing penalty that encodes prior structural information on either the input or output variables. We consider two widely adopted types of penalties of this kind as motivating examples: (1) the general overlapping-group-lasso penalty, generalized from the group-lasso penalty; and (2) the graph-guided-fused-lasso penalty, generalized from the fused-lasso penalty. For both types of penalties, due to their nonseparability and nonsmoothness, developing an efficient optimization method remains a challenging problem. In this paper we propose a general optimization approach, the smoothing proximal gradient (SPG) method, which can solve structured sparse regression problems with any smooth convex loss under a wide spectrum of structured sparsity-inducing penalties. Our approach combines a smoothing technique with an effective proximal gradient method. It achieves a convergence rate significantly faster than the standard first-order methods, subgradient methods, and is much more scalable than the most widely used interior-point methods. The efficiency and scalability of our method are demonstrated on both simulation experiments and real genetic data sets.
引用
收藏
页码:719 / 752
页数:34
相关论文
共 50 条
  • [31] Smoothing Accelerated Proximal Gradient Method with Fast Convergence Rate for Nonsmooth Convex Optimization Beyond Differentiability
    Fan Wu
    Wei Bian
    [J]. Journal of Optimization Theory and Applications, 2023, 197 : 539 - 572
  • [32] Smoothing Accelerated Proximal Gradient Method with Fast Convergence Rate for Nonsmooth Convex Optimization Beyond Differentiability
    Wu, Fan
    Bian, Wei
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2023, 197 (02) : 539 - 572
  • [33] Structured sparse K-means clustering via Laplacian smoothing
    Gong, Weikang
    Zhao, Renbo
    Grunewald, Stefan
    [J]. PATTERN RECOGNITION LETTERS, 2018, 112 : 63 - 69
  • [34] Sparse conditional copula models for structured output regression
    Kim, Minyoung
    [J]. PATTERN RECOGNITION, 2016, 60 : 761 - 769
  • [35] Sparse regression with structured priors:: Application to audio denoising
    Fevotte, Cedric
    Daudet, Laurent
    Godsill, Simon J.
    Torresani, Bruno
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 2508 - 2511
  • [36] STRUCTURED, SPARSE REGRESSION WITH APPLICATION TO HIV DRUG RESISTANCE
    Percival, Daniel
    Roeder, Kathryn
    Rosenfeld, Roni
    Wasserman, Larry
    [J]. ANNALS OF APPLIED STATISTICS, 2011, 5 (2A): : 628 - 644
  • [37] Dependent relevance determination for smooth and structured sparse regression
    Wu, Anqi
    Koyejo, Oluwasanmi
    Pillow, Jonathan
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [38] Dependent relevance determination for smooth and structured sparse regression
    Wu, Anqi
    Koyejo, Oluwasanmi
    Pillow, Jonathan
    [J]. Journal of Machine Learning Research, 2019, 20
  • [39] Adaptive Proximal Gradient Methods for Structured Neural Networks
    Yun, Jihun
    Lozano, Aurelie C.
    Yang, Eunho
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [40] Image Smoothing Based on Image Decomposition and Sparse High Frequency Gradient
    Guang-Hao Ma
    Ming-Li Zhang
    Xue-Mei Li
    Cai-Ming Zhang
    [J]. Journal of Computer Science and Technology, 2018, 33 : 502 - 510