Self-Concordant Analysis of Frank-Wolfe Algorithms

被引:0
|
作者
Dvurechensky, Pavel [1 ,2 ,3 ]
Ostroukhov, Petr [4 ]
Safin, Kamil [4 ]
Shtern, Shimrit [5 ]
Staudigl, Mathias [6 ]
机构
[1] Weierstrass Inst Appl Anal & Stochast, Berlin, Germany
[2] Natl Res Univ Higher Sch Econ, Moscow, Russia
[3] Inst Informat Transmiss Problems RAS, Moscow, Russia
[4] Moscow Inst Phys & Technol, Dolgoprudnyi, Russia
[5] Technion Israel Inst Technol, Haifa, Israel
[6] Maastricht Univ, Dept Data Sci & Knowledge Engn, Maastricht, Netherlands
关键词
OPTIMIZATION; CONVERGENCE; COMPLEXITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Projection-free optimization via different variants of the Frank-Wolfe (FW), a.k.a. Conditional Gradient method has become one of the cornerstones in optimization for machine learning since in many cases the linear minimization oracle is much cheaper to implement than projections and some sparsity needs to be preserved. In a number of applications, e.g. Poisson inverse problems or quantum state tomography, the loss is given by a self-concordant (SC) function having unbounded curvature, implying absence of theoretical guarantees for the existing FW methods. We use the theory of SC functions to provide a new adaptive step size for FW methods and prove global convergence rate O(1/k) after k iterations. If the problem admits a stronger local linear minimization oracle, we construct a novel FW method with linear convergence rate for SC functions.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] A GENERALIZATION OF THE FRANK-WOLFE THEOREM
    PEROLD, AF
    MATHEMATICAL PROGRAMMING, 1980, 18 (02) : 215 - 227
  • [22] CCCP is Frank-Wolfe in disguise
    Yurtsever, Alp
    Sra, Suvrit
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [23] On Extensions of the Frank-Wolfe Theorems
    Zhi-Quan Luo
    Shuzhong Zhang
    Computational Optimization and Applications, 1999, 13 : 87 - 110
  • [24] On Frank-Wolfe and Equilibrium Computation
    Abernethy, Jacob
    Wang, Jun-Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [25] On extensions of the Frank-Wolfe theorems
    Luo, ZQ
    Zhang, SZ
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 13 (1-3) : 87 - 110
  • [26] Frank-Wolfe with Subsampling Oracle
    Kerdreux, Thomas
    Pedregosa, Fabian
    d'Aspremont, Alexandre
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [27] Asynchronous Stochastic Frank-Wolfe Algorithms for Non-Convex Optimization
    Gu, Bin
    Xian, Wenhan
    Huang, Heng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 737 - 743
  • [28] Zeroth and First Order Stochastic Frank-Wolfe Algorithms for Constrained Optimization
    Akhtar, Zeeshan
    Rajawat, Ketan
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 2119 - 2135
  • [29] Self-concordant analysis for logistic regression
    Bach, Francis
    ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 : 384 - 414
  • [30] One Sample Stochastic Frank-Wolfe
    Zhang, Mingrui
    Shen, Zebang
    Mokhtari, Aryan
    Hassani, Hamed
    Karbasi, Amin
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4012 - 4022