Forward-backward quasi-Newton methods for nonsmooth optimization problems

被引:71
|
作者
Stella, Lorenzo [1 ,2 ]
Themelis, Andreas [1 ,2 ]
Patrinos, Panagiotis [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT STADIUS, Kasteelpk Arenberg 10, B-3001 Leuven, Belgium
[2] IMT Sch Adv Studies Lucca, Piazza San Francesco 19, I-55100 Lucca, Italy
关键词
Nonsmooth optimization; Forward-backward splitting; Line-search methods; Quasi-Newton; Kurdyka-Lojasiewicz; BFGS METHOD; SUPERLINEAR CONVERGENCE; PROXIMAL ALGORITHM; GLOBAL CONVERGENCE; DESCENT METHODS; NONCONVEX; MINIMIZATION; INEQUALITY; SHRINKAGE; SUM;
D O I
10.1007/s10589-017-9912-y
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The forward-backward splitting method (FBS) for minimizing a nonsmooth composite function can be interpreted as a (variable-metric) gradient method over a continuously differentiable function which we call forward-backward envelope (FBE). This allows to extend algorithms for smooth unconstrained optimization and apply them to nonsmooth (possibly constrained) problems. Since the FBE can be computed by simply evaluating forward-backward steps, the resulting methods rely on a similar black-box oracle as FBS. We propose an algorithmic scheme that enjoys the same global convergence properties of FBS when the problem is convex, or when the objective function possesses the Kurdyka-Aojasiewicz property at its critical points. Moreover, when using quasi-Newton directions the proposed method achieves superlinear convergence provided that usual second-order sufficiency conditions on the FBE hold at the limit point of the generated sequence. Such conditions translate into milder requirements on the original function involving generalized second-order differentiability. We show that BFGS fits our framework and that the limited-memory variant L-BFGS is well suited for large-scale problems, greatly outperforming FBS or its accelerated version in practice, as well as ADMM and other problem-specific solvers. The analysis of superlinear convergence is based on an extension of the Dennis and Mor, theorem for the proposed algorithmic scheme.
引用
收藏
页码:443 / 487
页数:45
相关论文
共 50 条
  • [1] Forward–backward quasi-Newton methods for nonsmooth optimization problems
    Lorenzo Stella
    Andreas Themelis
    Panagiotis Patrinos
    [J]. Computational Optimization and Applications, 2017, 67 : 443 - 487
  • [2] On the convergence of quasi-Newton methods for nonsmooth problems
    Lopes, VLR
    Martinez, JM
    [J]. NUMERICAL FUNCTIONAL ANALYSIS AND OPTIMIZATION, 1995, 16 (9-10) : 1193 - 1209
  • [3] Nonsmooth optimization via quasi-Newton methods
    Adrian S. Lewis
    Michael L. Overton
    [J]. Mathematical Programming, 2013, 141 : 135 - 163
  • [4] Nonsmooth optimization via quasi-Newton methods
    Lewis, Adrian S.
    Overton, Michael L.
    [J]. MATHEMATICAL PROGRAMMING, 2013, 141 (1-2) : 135 - 163
  • [5] ON QUASI-NEWTON FORWARD-BACKWARD SPLITTING: PROXIMAL CALCULUS AND CONVERGENCE
    Becker, Stephen
    Fadili, Jalal
    Ochs, Peter
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (04) : 2445 - 2481
  • [6] Newton and quasi-Newton methods for a class of nonsmooth equations and related problems
    Sun, DF
    Han, JY
    [J]. SIAM JOURNAL ON OPTIMIZATION, 1997, 7 (02) : 463 - 480
  • [7] Quasi-Newton methods for multiobjective optimization problems
    Morovati, Vahid
    Basirzadeh, Hadi
    Pourkarimi, Latif
    [J]. 4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2018, 16 (03): : 261 - 294
  • [8] Quasi-Newton methods for multiobjective optimization problems
    Vahid Morovati
    Hadi Basirzadeh
    Latif Pourkarimi
    [J]. 4OR, 2018, 16 : 261 - 294
  • [9] A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning
    Yu, Jin
    Vishwanathan, S. V. N.
    Guenter, Simon
    Schraudolph, Nicol N.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1145 - 1200
  • [10] Nonsmooth multiobjective programming with quasi-Newton methods
    Qu, Shaojian
    Liu, Chen
    Goh, Mark
    Li, Yijun
    Ji, Ying
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2014, 235 (03) : 503 - 510