Large Scale Empirical Risk Minimization via Truncated Adaptive Newton Method

被引:0
|
作者
Eisen, Mark [1 ]
Mokhtari, Aryan [2 ]
Ribeiro, Alejandro [1 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
[2] MIT, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most second order methods are inapplicable to large scale empirical risk minimization (ERM) problems because both, the number of samples N and number of parameters p are large. Large N makes it costly to evaluate Hessians and large p makes it costly to invert Hessians. This paper propose a novel adaptive sample size second-order method, which reduces the cost of computing the Hessian by solving a sequence of ERM problems corresponding to a subset of samples and lowers the cost of computing the Hessian inverse using a truncated eigenvalue decomposition. Although the sample size is grown at a geometric rate, it is shown that it is sufficient to run a single iteration in each growth stage to track the optimal classifier to within its statistical accuracy. This results in convergence to the optimal classifier associated with the whole set in a number of iterations that scales with log(N). The use of a truncated eigenvalue decomposition result in the cost of each iteration being of order p(2). Theoretical performance gains manifest in practical implementations.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Adaptive Newton Method for Empirical Risk Minimization to Statistical Accuracy
    Mokhtari, Aryan
    Daneshmand, Hadi
    Lucchi, Aurelien
    Hofmann, Thomas
    Ribeiro, Alejandro
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [2] A truncated Newton method for the solution of large-scale inequality constrained minimization problems
    Facchinei, F
    Liuzzi, G
    Lucidi, S
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2003, 25 (1-3) : 85 - 122
  • [3] A Truncated Newton Method for the Solution of Large-Scale Inequality Constrained Minimization Problems
    Francisco Facchinei
    Giampaolo Liuzzi
    Stefano Lucidi
    Computational Optimization and Applications, 2003, 25 : 85 - 122
  • [4] Truncated Newton method for the analysis of large-scale water distribution networks
    Instituto Mexicano de Tecnologia del, Agua, Mexico, Mexico
    Int Conf Comput Methods Water Res CMWR, (145-152):
  • [5] A truncated Newton method for the analysis of large-scale water distribution networks
    Tzatchkov, VG
    MoralesPerez, JL
    COMPUTATIONAL METHODS IN WATER RESOURCES XI, VOL 2: COMPUTATIONAL METHODS IN SURFACE FLOW AND TRANSPORT PROBLEMS, 1996, : 145 - 152
  • [6] TNPACK - A TRUNCATED NEWTON MINIMIZATION PACKAGE FOR LARGE-SCALE PROBLEMS .1. ALGORITHM AND USAGE
    SCHLICK, T
    FOGELSON, A
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1992, 18 (01): : 46 - 70
  • [7] TNPACK - A TRUNCATED NEWTON MINIMIZATION PACKAGE FOR LARGE-SCALE PROBLEMS .2. IMPLEMENTATION EXAMPLES
    SCHLICK, T
    FOGELSON, A
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1992, 18 (01): : 71 - 111
  • [8] A POWERFUL TRUNCATED NEWTON METHOD FOR POTENTIAL-ENERGY MINIMIZATION
    SCHLICK, T
    OVERTON, M
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 1987, 8 (07) : 1025 - 1039
  • [9] An active set truncated Newton method for large-scale bound constrained optimization
    Cheng, Wanyou
    Chen, Zixin
    Li, Dong-hui
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2014, 67 (05) : 1016 - 1023
  • [10] Global Convergence of Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Space
    Chang, Ting-Jui
    Shahrampour, Shahin
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1222 - 1226