Large-Scale Sparse Logistic Regression

被引:0
|
作者
Liu, Jun [1 ]
Chen, Jianhui [1 ]
Ye, Jieping [1 ]
机构
[1] Arizona State Univ, Tempe, AZ 85287 USA
关键词
Logistic regression; sparse learning; l(1)-ball constraint; Nesterov's method; adaptive line search; CLASSIFICATION; ALGORITHM; SELECTION; CANCER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logistic Regression is a well-known classification method that has been used widely in many applications of data mining, machine learning, computer vision, and bioinformatics. Sparse logistic regression embeds feature selection in the classification framework using the PI-norm regularization, and is attractive in many applications involving high-dimensional data. In this paper, we propose Lassplore for solving Large-scale sparse logistic regression. Specifically, we formulate the problem as the l(1)-ball constrained smooth convex optimization, and propose to solve the problem using the Nesterov's method, an optimal first-order black-box method for smooth convex optimization. One of the critical issues in the use of the Nesterov's method is the estimation of the step size at each of the optimization iterations. Previous approaches either applies the constant step size which assumes that the Lipschitz gradient is known in advance, or requires a sequence of decreasing step size which leads to slow convergence in practice. In this paper, we propose an adaptive line search scheme which allows to tune the step size adaptively and meanwhile guarantees the optimal convergence rate. Empirical comparisons with several state-of-the-art algorithms demonstrate the efficiency of the proposed Lassplore algorithm for large-scale problems.
引用
收藏
页码:547 / 555
页数:9
相关论文
共 50 条
  • [1] A sparse version of the ridge logistic regression for large-scale text categorization
    Aseervatham, Sujeevan
    Antoniadis, Anestis
    Gaussier, Eric
    Burlet, Michel
    Denneulin, Yves
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (02) : 101 - 106
  • [2] Communication-efficient distributed large-scale sparse multinomial logistic regression
    Lei, Dajiang
    Huang, Jie
    Chen, Hao
    Li, Jie
    Wu, Yu
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (18):
  • [3] Large-scale Bayesian logistic regression for text categorization
    Genkin, Alexander
    Lewis, David D.
    Madigan, David
    [J]. TECHNOMETRICS, 2007, 49 (03) : 291 - 304
  • [4] LOCAL UNCERTAINTY SAMPLING FOR LARGE-SCALE MULTICLASS LOGISTIC REGRESSION
    Han, Lei
    Tan, Kean Ming
    Yang, Ting
    Zhang, Tong
    [J]. ANNALS OF STATISTICS, 2020, 48 (03): : 1770 - 1788
  • [5] Kernel Logistic Regression Algorithm for Large-Scale Data Classification
    Elbashir, Murtada
    Wang, Jianxin
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (05) : 465 - 472
  • [6] Trust region Newton method for large-scale logistic regression
    Lin, Chih-Jen
    Weng, Ruby C.
    Keerthi, S. Sathiya
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 627 - 650
  • [7] Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm
    Marta Avalos
    Hélène Pouyes
    Yves Grandvalet
    Ludivine Orriols
    Emmanuel Lagarde
    [J]. BMC Bioinformatics, 16
  • [8] Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm
    Avalos, Marta
    Pouyes, Helene
    Grandvalet, Yves
    Orriols, Ludivine
    Lagarde, Emmanuel
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [9] LARGE-SCALE MULTIVARIATE SPARSE REGRESSION WITH APPLICATIONS TO UK BIOBANK
    Qian, Junyang
    Tanigawa, Yosuke
    Li, Ruilin
    Tibshirani, Robert
    Rivas, Manuel A.
    Hastie, Trevor
    [J]. ANNALS OF APPLIED STATISTICS, 2022, 16 (03): : 1891 - 1918
  • [10] Randomized Sketching for Large-Scale Sparse Ridge Regression Problems
    Iyer, Chander
    Carothers, Christopher
    Drineas, Petros
    [J]. PROCEEDINGS OF SCALA 2016: 7TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS, 2016, : 65 - 72