Riemannian Optimization via Frank-Wolfe Methods

被引：5

作者：

Weber, Melanie ^{[1
,2
]}

Sra, Suvrit ^{[3
]}

机构：

[1] Univ Oxford, Math Inst, Oxford, England

[2] Princeton Univ, Princeton, NJ 08544 USA

[3] MIT, Lab Informat & Decis Syst, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

MATHEMATICAL PROGRAMMING | 2023年 / 199卷 / 1-2期

关键词：

46N10; 15A24; 65K10; 49Q99; MINIMIZATION ALGORITHM; MATRIX; MANIFOLDS; GEOMETRY;

D O I：

10.1007/s10107-022-01840-5

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

We study projection-free methods for constrained Riemannian optimization. In particular, we propose a Riemannian Frank-Wolfe (RFW) method that handles constraints directly, in contrast to prior methods that rely on (potentially costly) projections. We analyze non-asymptotic convergence rates of RFW to an optimum for geodesically convex problems, and to a critical point for nonconvex objectives. We also present a practical setting under which RFW can attain a linear convergence rate. As a concrete example, we specialize RFW to the manifold of positive definite matrices and apply it to two tasks: (i) computing the matrix geometric mean (Riemannian centroid); and (ii) computing the Bures-Wasserstein barycenter. Both tasks involve geodesically convex interval constraints, for which we show that the Riemannian "linear" oracle required by RFW admits a closed form solution; this result may be of independent interest. We complement our theoretical results with an empirical comparison of RFW against state-of-the-art Riemannian optimization methods, and observe that RFW performs competitively on the task of computing Riemannian centroids.

引用

页码：525 / 556

页数：32

共 50 条

[1] Riemannian Optimization via Frank-Wolfe Methods
Melanie Weber
Suvrit Sra
Mathematical Programming, 2023, 199 : 525 - 556
[2] Stochastic Frank-Wolfe Methods for Nonconvex Optimization
Reddi, Sashank J.
Sra, Suvrit
Poczos, Barnabas
Smola, Alex
2016 54TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2016, : 1244 - 1251
[3] Apprenticeship Learning via Frank-Wolfe
Zahavy, Tom
Cohen, Alon
Kaplan, Haim
Mansour, Yishay
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6720 - 6728
[4] TRAINING SUPPORT VECTOR MACHINES USING FRANK-WOLFE OPTIMIZATION METHODS
Frandi, Emanuele
Nanculef, Ricardo
Gasparo, Maria Grazia
Lodi, Stefano
Sartori, Claudio
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2013, 27 (03)
[5] A Neural Network Implementation of Frank-Wolfe Optimization
Bauckhage, Christian
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2017, PT I, 2017, 10613 : 219 - 226
[6] On a Frank-Wolfe type theorem in cubic optimization
Klatte, Diethard
OPTIMIZATION, 2019, 68 (2-3) : 539 - 547
[7] Fast Pure Exploration via Frank-Wolfe
Wang, Po-An
Tzeng, Ruo-Chun
Proutiere, Alexandre
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[8] Restarting Frank-Wolfe
Kerdreux, Thomas
d'Aspremont, Alexandre
Pokutta, Sebastian
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[9] Fast and scalable Lasso via stochastic Frank-Wolfe methods with a convergence guarantee
Frandi, Emanuele
Nanculef, Ricardo
Lodi, Stefano
Sartori, Claudio
Suykens, Johan A. K.
MACHINE LEARNING, 2016, 104 (2-3) : 195 - 221
[10] On the Global Linear Convergence of Frank-Wolfe Optimization Variants
Lacoste-Julien, Simon
Jaggi, Martin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28

← 1 2 3 4 5 →