Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent

被引:16
|
作者
Klosa, Jan [1 ]
Simon, Noah [2 ]
Westermark, Pal Olof [1 ]
Liebscher, Volkmar [3 ]
Wittenburg, Doerte [1 ]
机构
[1] Leibniz Inst Farm Anim Biol, Inst Genet & Biometry, D-18196 Dummerstorf, Germany
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[3] Univ Greifswald, Inst Math & Comp Sci, D-17489 Greifswald, Germany
关键词
Optimization; Machine learning; High-dimensional data; R package; SELECTION;
D O I
10.1186/s12859-020-03725-w
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Statistical analyses of biological problems in life sciences often lead to high-dimensional linear models. To solve the corresponding system of equations, penalization approaches are often the methods of choice. They are especially useful in case of multicollinearity, which appears if the number of explanatory variables exceeds the number of observations or for some biological reason. Then, the model goodness of fit is penalized by some suitable function of interest. Prominent examples are the lasso, group lasso and sparse-group lasso. Here, we offer a fast and numerically cheap implementation of these operators via proximal gradient descent. The grid search for the penalty parameter is realized by warm starts. The step size between consecutive iterations is determined with backtracking line search. Finally,seagull-the R package presented here- produces complete regularization paths. Results Publicly available high-dimensional methylation data are used to compareseagullto the established R packageSGL. The results of both packages enabled a precise prediction of biological age from DNA methylation status. But even though the results ofseagullandSGLwere very similar (R-2 > 0.99),seagullcomputed the solution in a fraction of the time needed bySGL. Additionally,seagullenables the incorporation of weights for each penalized feature. Conclusions The following operators for linear regression models are available inseagull: lasso, group lasso, sparse-group lasso and Integrative LASSO with Penalty Factors (IPF-lasso). Thus,seagullis a convenient envelope of lasso variants.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Seagull: lasso, group lasso and sparse-group lasso regularization for linear regression models via proximal gradient descent
    Jan Klosa
    Noah Simon
    Pål Olof Westermark
    Volkmar Liebscher
    Dörte Wittenburg
    [J]. BMC Bioinformatics, 21
  • [2] A Sparse-Group Lasso
    Simon, Noah
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Robert
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2013, 22 (02) : 231 - 245
  • [3] An Iterative Sparse-Group Lasso
    Laria, Juan C.
    Carmen Aguilera-Morillo, M.
    Lillo, Rosa E.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (03) : 722 - 731
  • [4] A Modified Adaptive Sparse-Group LASSO Regularization for Optimal Portfolio Selection
    Sadik, Somaya
    Et-Tolba, Mohamed
    Nsiri, Benayad
    [J]. IEEE ACCESS, 2024, 12 : 107337 - 107352
  • [5] An application of sparse-group lasso regularization to equity portfolio optimization and sector selection
    Jingnan Chen
    Gengling Dai
    Ning Zhang
    [J]. Annals of Operations Research, 2020, 284 : 243 - 262
  • [6] 基于Sparse-Group Lasso的指数跟踪
    王国长
    高桃璇
    徐世荣
    [J]. 系统科学与数学, 2019, 39 (12) : 2025 - 2040
  • [7] GAP Safe Screening Rules for Sparse-Group Lasso
    Ndiaye, Eugene
    Fercoq, Olivier
    Gramfort, Alexandre
    Salmon, Joseph
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] An application of sparse-group lasso regularization to equity portfolio optimization and sector selection
    Chen, Jingnan
    Dai, Gengling
    Zhang, Ning
    [J]. ANNALS OF OPERATIONS RESEARCH, 2020, 284 (01) : 243 - 262
  • [9] Multiple Change-Points Estimation in Linear Regression Models via Sparse Group Lasso
    Zhang, Bingwen
    Geng, Jun
    Lai, Lifeng
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (09) : 2209 - 2224
  • [10] Sparse group lasso for multiclass functional logistic regression models
    Matsui, Hidetoshi
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (06) : 1784 - 1797