Stochastic Second-Order Method for Large-Scale Nonconvex Sparse Learning Models

被引：0

作者：

Gao, Hongchang ^{[1
]}

Huang, Heng ^{[1
]}

机构：

[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA

来源：

PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年

基金：

美国国家科学基金会;

关键词：

SIGNAL RECOVERY; SELECTION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sparse learning models have shown promising performance in the high dimensional machine learning applications. The main challenge of sparse learning models is how to optimize it efficiently. Most existing methods solve this problem by relaxing it as a convex problem, incurring large estimation bias. Thus, the sparse learning model with nonconvex constraint has attracted much attention due to its better performance. But it is difficult to optimize due to the non-convexity. In this paper, we propose a linearly convergent stochastic second-order method to optimize this nonconvex problem for large-scale datasets. The proposed method incorporates the second-order information to improve the convergence speed. Theoretical analysis shows that our proposed method enjoys linear convergence rate and guarantees to converge to the underlying true model parameter. Experimental results have verified the efficiency and correctness of our proposed method.

引用

页码：2128 / 2134

页数：7

共 50 条

[1] Dimension reduction of large-scale second-order dynamical systems via a second-order Arnoldi method
Bai, ZJ
Su, YF
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2005, 26 (05): : 1692 - 1709
[2] An efficient support vector machine cone programming for learning method with second-order large-scale problems
Debnath, R
Muramatsu, M
Takahashi, H
APPLIED INTELLIGENCE, 2005, 23 (03) : 219 - 239
[3] An Efficient Support Vector Machine Learning Method with Second-Order Cone Programming for Large-Scale Problems
Rameswar Debnath
Masakazu Muramatsu
Haruhisa Takahashi
Applied Intelligence, 2005, 23 : 219 - 239
[4] Large-scale dynamic optimization using the directional second-order adjoint method
Özyurt, DB
Barton, PI
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2005, 44 (06) : 1804 - 1811
[5] Model-order reduction of large-scale second-order MIMO dynamical systems via a block second-order Arnoldi method
Lin, Yiqin
Bao, Liang
Wei, Yimin
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2007, 84 (07) : 1003 - 1019
[6] A KRYLOV SUBSPACE METHOD FOR LARGE-SCALE SECOND-ORDER CONE LINEAR COMPLEMENTARITY PROBLEM
Zhang, Lei-Hong
Yang, Wei Hong
Shen, Chungen
Li, Ren-Cang
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2015, 37 (04): : A2046 - A2075
[7] Second-order corrections to weak lensing by large-scale structure
Cooray, A
Hu, WN
ASTROPHYSICAL JOURNAL, 2002, 574 (01): : 19 - 23
[8] Is Second-order Information Helpful for Large-scale Visual Recognition?
Li, Peihua
Xie, Jiangtao
Wang, Qilong
Zuo, Wangmeng
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2089 - 2097
[9] Second-Order Guarantees of Stochastic Gradient Descent in Nonconvex Optimization
Vlaski, Stefan
Sayed, Ali H.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6489 - 6504
[10] A Stochastic Quasi-Newton Method for Large-Scale Nonconvex Optimization With Applications
Chen, Huiming
Wu, Ho-Chun
Chan, Shing-Chow
Lam, Wong-Hing
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4776 - 4790

← 1 2 3 4 5 →