A stochastic extra-step quasi-Newton method for nonsmooth nonconvex optimization

被引:12
|
作者
Yang, Minghan [1 ]
Milzarek, Andre [2 ,3 ,4 ]
Wen, Zaiwen [1 ,5 ]
Zhang, Tong [6 ]
机构
[1] Peking Univ, BICMR, Beijing Int Ctr Math Res, Beijing, Peoples R China
[2] Chinese Univ Hong Kong Shenzhen, Sch Data Sci SDS, Shenzhen, Guangdong, Peoples R China
[3] SRIBD, Shenzhen Res Inst Big Data, Shenzhen, Guangdong, Peoples R China
[4] AIRS, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Guangdong, Peoples R China
[5] Peking Univ, Ctr Data Sci, Beijing, Peoples R China
[6] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
Nonsmooth stochastic optimization; Stochastic approximation; Global convergence; Stochastic higher order method; Stochastic quasi-Newton scheme; EXTRAGRADIENT METHOD; VARIANCE REDUCTION; SUPERLINEAR CONVERGENCE; NEURAL-NETWORKS; ALGORITHM; CONVEX; INEQUALITIES; EQUATIONS;
D O I
10.1007/s10107-021-01629-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a novel stochastic extra-step quasi-Newton method is developed to solve a class of nonsmooth nonconvex composite optimization problems. We assume that the gradient of the smooth part of the objective function can only be approximated by stochastic oracles. The proposed method combines general stochastic higher order steps derived from an underlying proximal type fixed-point equation with additional stochastic proximal gradient steps to guarantee convergence. Based on suitable bounds on the step sizes, we establish global convergence to stationary points in expectation and an extension of the approach using variance reduction techniques is discussed. Motivated by large-scale and big data applications, we investigate a stochastic coordinate-type quasi-Newton scheme that allows to generate cheap and tractable stochastic higher order directions. Finally, numerical results on large-scale logistic regression and deep learning problems show that our proposed algorithm compares favorably with other state-of-the-art methods.
引用
收藏
页码:257 / 303
页数:47
相关论文
共 50 条
  • [21] A PROXIMAL QUASI-NEWTON TRUST-REGION METHOD FOR NONSMOOTH REGULARIZED OPTIMIZATION
    Aravkin, Aleksandr Y.
    Baraldi, Robert
    Orban, Dominique
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2022, 32 (02) : 900 - 929
  • [22] Forward–backward quasi-Newton methods for nonsmooth optimization problems
    Lorenzo Stella
    Andreas Themelis
    Panagiotis Patrinos
    [J]. Computational Optimization and Applications, 2017, 67 : 443 - 487
  • [23] Convergence of Perry and Shanno's Memoryless Quasi-Newton Method for Nonconvex Optimization Problems
    JI-YE HAN GUANG-HUI LIU HONG-XIA YIN(Institution of Applied Mathematics
    [J]. 运筹学学报, 1997, (01) : 22 - 28
  • [24] A NONSMOOTH GLOBAL QUASI-NEWTON METHOD FOR NONLINEAR COMPLEMENTARITY PROBLEMS
    Andres Arias, Carlos
    Jairo Martinez, Hector
    Perez, Rosana
    [J]. PACIFIC JOURNAL OF OPTIMIZATION, 2017, 13 (01): : 1 - 15
  • [25] A SMOOTHING STOCHASTIC QUASI-NEWTON METHOD FOR NON-LIPSCHITZIAN STOCHASTIC OPTIMIZATION PROBLEMS
    Yousefian, Farzad
    Nedic, Angelia
    Shanbhag, Uday V.
    [J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 2291 - 2302
  • [26] A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning
    Yu, Jin
    Vishwanathan, S. V. N.
    Guenter, Simon
    Schraudolph, Nicol N.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1145 - 1200
  • [27] A Survey of Quasi-Newton Equations and Quasi-Newton Methods for Optimization
    Chengxian Xu
    Jianzhong Zhang
    [J]. Annals of Operations Research, 2001, 103 : 213 - 234
  • [28] Forward-backward quasi-Newton methods for nonsmooth optimization problems
    Stella, Lorenzo
    Themelis, Andreas
    Patrinos, Panagiotis
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2017, 67 (03) : 443 - 487
  • [29] Stochastic generalized gradient method for nonconvex nonsmooth stochastic optimization
    Yu. M. Ermol'ev
    V. I. Norkin
    [J]. Cybernetics and Systems Analysis, 1998, 34 : 196 - 215
  • [30] Stochastic generalized gradient method for nonconvex nonsmooth stochastic optimization
    Ermol'ev, YM
    Norkin, VI
    [J]. CYBERNETICS AND SYSTEMS ANALYSIS, 1998, 34 (02) : 196 - 215