Fast stochastic second-order method logarithmic in condition number

被引:1
|
作者
Ye, Haishan [1 ]
Xie, Guangzeng [2 ]
Luo, Luo [1 ]
Zhang, Zhihua [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, 800 Dong Chuan Rd, Shanghai 200240, Peoples R China
[2] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
38;
D O I
10.1016/j.patcog.2018.11.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optimization is an important issue in machine learning because many machine learning models are reformulated as optimization problems. Different kinds of machine learning algorithms mainly focus on minimizing their empirical loss like deep learning, logistic regression, and support vector machine. Because data is explosively growing, it is challenging to deal with a large-scale optimization problem. Recently, stochastic second-order methods have emerged to attract much attention due to their efficiency in each iteration. These methods show good performance on training machine learning algorithms like logistic regression and support vector machine. However, the computational complexity of existing stochastic second-order methods heavily depends on the condition number of the Hessian. In this paper, we propose a new Newton-like method called Preconditioned Newton Conjugate Gradient with Sketched Hessian (PNCG). The runtime complexity of PNCG is at most logarithmic in the condition number of the Hessian. PNCG exhibits advantages over existing subsampled Newton methods especially when the Hessian matrix in question is ill-conditioned. We also show that our method has good performance on training machine learning algorithm empirically. The results show consistent improvements in computational efficiency. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:629 / 642
页数:14
相关论文
共 50 条
  • [41] Perturbed Second-Order Stochastic Evolution Equations
    Lijuan Cheng
    Yong Ren
    Qualitative Theory of Dynamical Systems, 2021, 20
  • [42] Perturbed Second-Order Stochastic Evolution Equations
    Cheng, Lijuan
    Ren, Yong
    QUALITATIVE THEORY OF DYNAMICAL SYSTEMS, 2021, 20 (02)
  • [43] Unified second-order stochastic averaging approach
    Hijawi, M.
    Moschuk, N.
    Ibrahim, R.A.
    Journal of Applied Mechanics, Transactions ASME, 1997, 64 (02): : 281 - 291
  • [44] A new characterization of second-order stochastic dominance
    Guan, Yuanying
    Huang, Muqiao
    Wang, Ruodu
    INSURANCE MATHEMATICS & ECONOMICS, 2024, 119 : 261 - 267
  • [45] Weak Galerkin Method for Second-Order Elliptic Equations with Newton Boundary Condition
    Qin, Mingze
    Wang, Ruishu
    Zhai, Qilong
    Zhang, Ran
    COMMUNICATIONS IN COMPUTATIONAL PHYSICS, 2023, 33 (02) : 568 - 595
  • [46] Asymmetric second-order stochastic resonance weak fault feature extraction method
    Tang, Jiachen
    Shi, Boqiang
    MEASUREMENT & CONTROL, 2020, 53 (5-6): : 788 - 795
  • [47] A second-order optimality condition with first- and second-order complementarity associated with global convergence of algorithms
    Haeser, Gabriel
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2018, 70 (02) : 615 - 639
  • [48] A second-order optimality condition with first- and second-order complementarity associated with global convergence of algorithms
    Gabriel Haeser
    Computational Optimization and Applications, 2018, 70 : 615 - 639
  • [49] A second-order distributed memory parallel fast sweeping method for the Eikonal equation
    Tro, Sara
    Evans, Tyco Mera
    Aslam, Tariq D.
    Lozano, Eduardo
    Culp, David B.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 474
  • [50] A fast second-order absorbing boundary condition for the linearized Benjamin-Bona-Mahony equation
    Zheng, Zijun
    Pang, Gang
    Ehrhardt, Matthias
    Liu, Baiyili
    NUMERICAL ALGORITHMS, 2024, : 2037 - 2080