Riemannian Optimization via Frank-Wolfe Methods

被引:5
|
作者
Weber, Melanie [1 ,2 ]
Sra, Suvrit [3 ]
机构
[1] Univ Oxford, Math Inst, Oxford, England
[2] Princeton Univ, Princeton, NJ 08544 USA
[3] MIT, Lab Informat & Decis Syst, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
46N10; 15A24; 65K10; 49Q99; MINIMIZATION ALGORITHM; MATRIX; MANIFOLDS; GEOMETRY;
D O I
10.1007/s10107-022-01840-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We study projection-free methods for constrained Riemannian optimization. In particular, we propose a Riemannian Frank-Wolfe (RFW) method that handles constraints directly, in contrast to prior methods that rely on (potentially costly) projections. We analyze non-asymptotic convergence rates of RFW to an optimum for geodesically convex problems, and to a critical point for nonconvex objectives. We also present a practical setting under which RFW can attain a linear convergence rate. As a concrete example, we specialize RFW to the manifold of positive definite matrices and apply it to two tasks: (i) computing the matrix geometric mean (Riemannian centroid); and (ii) computing the Bures-Wasserstein barycenter. Both tasks involve geodesically convex interval constraints, for which we show that the Riemannian "linear" oracle required by RFW admits a closed form solution; this result may be of independent interest. We complement our theoretical results with an empirical comparison of RFW against state-of-the-art Riemannian optimization methods, and observe that RFW performs competitively on the task of computing Riemannian centroids.
引用
收藏
页码:525 / 556
页数:32
相关论文
共 50 条
  • [1] Riemannian Optimization via Frank-Wolfe Methods
    Melanie Weber
    Suvrit Sra
    Mathematical Programming, 2023, 199 : 525 - 556
  • [2] Stochastic Frank-Wolfe Methods for Nonconvex Optimization
    Reddi, Sashank J.
    Sra, Suvrit
    Poczos, Barnabas
    Smola, Alex
    2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 1244 - 1251
  • [3] Apprenticeship Learning via Frank-Wolfe
    Zahavy, Tom
    Cohen, Alon
    Kaplan, Haim
    Mansour, Yishay
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6720 - 6728
  • [4] TRAINING SUPPORT VECTOR MACHINES USING FRANK-WOLFE OPTIMIZATION METHODS
    Frandi, Emanuele
    Nanculef, Ricardo
    Gasparo, Maria Grazia
    Lodi, Stefano
    Sartori, Claudio
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (03)
  • [5] A Neural Network Implementation of Frank-Wolfe Optimization
    Bauckhage, Christian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2017, PT I, 2017, 10613 : 219 - 226
  • [6] On a Frank-Wolfe type theorem in cubic optimization
    Klatte, Diethard
    OPTIMIZATION, 2019, 68 (2-3) : 539 - 547
  • [7] Fast Pure Exploration via Frank-Wolfe
    Wang, Po-An
    Tzeng, Ruo-Chun
    Proutiere, Alexandre
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Restarting Frank-Wolfe
    Kerdreux, Thomas
    d'Aspremont, Alexandre
    Pokutta, Sebastian
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [9] Fast and scalable Lasso via stochastic Frank-Wolfe methods with a convergence guarantee
    Frandi, Emanuele
    Nanculef, Ricardo
    Lodi, Stefano
    Sartori, Claudio
    Suykens, Johan A. K.
    MACHINE LEARNING, 2016, 104 (2-3) : 195 - 221
  • [10] On the Global Linear Convergence of Frank-Wolfe Optimization Variants
    Lacoste-Julien, Simon
    Jaggi, Martin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28