Fast stochastic second-order method logarithmic in condition number

被引:1
|
作者
Ye, Haishan [1 ]
Xie, Guangzeng [2 ]
Luo, Luo [1 ]
Zhang, Zhihua [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, 800 Dong Chuan Rd, Shanghai 200240, Peoples R China
[2] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
38;
D O I
10.1016/j.patcog.2018.11.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optimization is an important issue in machine learning because many machine learning models are reformulated as optimization problems. Different kinds of machine learning algorithms mainly focus on minimizing their empirical loss like deep learning, logistic regression, and support vector machine. Because data is explosively growing, it is challenging to deal with a large-scale optimization problem. Recently, stochastic second-order methods have emerged to attract much attention due to their efficiency in each iteration. These methods show good performance on training machine learning algorithms like logistic regression and support vector machine. However, the computational complexity of existing stochastic second-order methods heavily depends on the condition number of the Hessian. In this paper, we propose a new Newton-like method called Preconditioned Newton Conjugate Gradient with Sketched Hessian (PNCG). The runtime complexity of PNCG is at most logarithmic in the condition number of the Hessian. PNCG exhibits advantages over existing subsampled Newton methods especially when the Hessian matrix in question is ill-conditioned. We also show that our method has good performance on training machine learning algorithm empirically. The results show consistent improvements in computational efficiency. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:629 / 642
页数:14
相关论文
共 50 条
  • [21] On the Weak Second-order Optimality Condition for Nonlinear Semidefinite and Second-order Cone Programming
    Ellen H. Fukuda
    Gabriel Haeser
    Leonardo M. Mito
    Set-Valued and Variational Analysis, 2023, 31
  • [22] A second-order sequential optimality condition for nonlinear second-order cone programming problems
    Fukuda, Ellen H.
    Okabe, Kosuke
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2025, 90 (03) : 911 - 939
  • [23] HANF NUMBER OF SECOND-ORDER LOGIC
    BARWISE, KJ
    JOURNAL OF SYMBOLIC LOGIC, 1972, 37 (03) : 588 - 594
  • [24] The velocity decomposition method for second-order accuracy in stochastic parcel simulations
    Pischke, Philipp
    Cordes, Diana
    Kneer, Reinhold
    INTERNATIONAL JOURNAL OF MULTIPHASE FLOW, 2012, 47 : 160 - 170
  • [25] A New Second-Order Tristable Stochastic Resonance Method for Fault Diagnosis
    Lu, Lu
    Yuan, Yu
    Wang, Heng
    Zhao, Xing
    Zheng, Jianjie
    SYMMETRY-BASEL, 2019, 11 (08):
  • [27] A fast and accurate method for evaluating joint second-order PMD statistics
    Forestieri, E
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2003, 21 (11) : 2942 - 2952
  • [28] Fast beam propagation method for the analysis of second-order nonlinear phenomena
    Capobianco, AD
    Brillo, D
    De Angelis, C
    Nalesso, G
    IEEE PHOTONICS TECHNOLOGY LETTERS, 1998, 10 (04) : 543 - 545
  • [29] Second-order Nonlinear Function Navigation Method for Fast Mobile Robots
    Kim, Dong-Han
    Rew, Keun-Ho
    2008 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, VOLS 1-4, 2008, : 2466 - +
  • [30] The second-order Ehrenfest method
    Morgane Vacher
    David Mendive-Tapia
    Michael J. Bearpark
    Michael A. Robb
    Theoretical Chemistry Accounts, 2014, 133 (7)