Stochastic Second-Order Method for Large-Scale Nonconvex Sparse Learning Models

被引:0
|
作者
Gao, Hongchang [1 ]
Huang, Heng [1 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
SIGNAL RECOVERY; SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparse learning models have shown promising performance in the high dimensional machine learning applications. The main challenge of sparse learning models is how to optimize it efficiently. Most existing methods solve this problem by relaxing it as a convex problem, incurring large estimation bias. Thus, the sparse learning model with nonconvex constraint has attracted much attention due to its better performance. But it is difficult to optimize due to the non-convexity. In this paper, we propose a linearly convergent stochastic second-order method to optimize this nonconvex problem for large-scale datasets. The proposed method incorporates the second-order information to improve the convergence speed. Theoretical analysis shows that our proposed method enjoys linear convergence rate and guarantees to converge to the underlying true model parameter. Experimental results have verified the efficiency and correctness of our proposed method.
引用
收藏
页码:2128 / 2134
页数:7
相关论文
共 50 条
  • [1] Dimension reduction of large-scale second-order dynamical systems via a second-order Arnoldi method
    Bai, ZJ
    Su, YF
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 26 (05): : 1692 - 1709
  • [2] An efficient support vector machine cone programming for learning method with second-order large-scale problems
    Debnath, R
    Muramatsu, M
    Takahashi, H
    APPLIED INTELLIGENCE, 2005, 23 (03) : 219 - 239
  • [3] An Efficient Support Vector Machine Learning Method with Second-Order Cone Programming for Large-Scale Problems
    Rameswar Debnath
    Masakazu Muramatsu
    Haruhisa Takahashi
    Applied Intelligence, 2005, 23 : 219 - 239
  • [4] Large-scale dynamic optimization using the directional second-order adjoint method
    Özyurt, DB
    Barton, PI
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2005, 44 (06) : 1804 - 1811
  • [5] Model-order reduction of large-scale second-order MIMO dynamical systems via a block second-order Arnoldi method
    Lin, Yiqin
    Bao, Liang
    Wei, Yimin
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2007, 84 (07) : 1003 - 1019
  • [6] A KRYLOV SUBSPACE METHOD FOR LARGE-SCALE SECOND-ORDER CONE LINEAR COMPLEMENTARITY PROBLEM
    Zhang, Lei-Hong
    Yang, Wei Hong
    Shen, Chungen
    Li, Ren-Cang
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2015, 37 (04): : A2046 - A2075
  • [7] Second-order corrections to weak lensing by large-scale structure
    Cooray, A
    Hu, WN
    ASTROPHYSICAL JOURNAL, 2002, 574 (01): : 19 - 23
  • [8] Is Second-order Information Helpful for Large-scale Visual Recognition?
    Li, Peihua
    Xie, Jiangtao
    Wang, Qilong
    Zuo, Wangmeng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2089 - 2097
  • [9] Second-Order Guarantees of Stochastic Gradient Descent in Nonconvex Optimization
    Vlaski, Stefan
    Sayed, Ali H.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6489 - 6504
  • [10] A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization With Applications
    Chen, Huiming
    Wu, Ho-Chun
    Chan, Shing-Chow
    Lam, Wong-Hing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4776 - 4790