Trivializations for Gradient-Based Optimization on Manifolds

被引:0
|
作者
Lezcano-Casado, Mario [1 ]
机构
[1] Univ Oxford, Dept Math, Oxford, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a framework to study the transformation of problems with manifold constraints into unconstrained problems through parametrizations in terms of a Euclidean space. We call these parametrizations trivializations. We prove conditions under which a trivialization is sound in the context of gradient-based optimization and we show how two large families of trivializations have overall favorable properties, but also suffer from a performance issue. We then introduce dynamic trivializations, which solve this problem, and we show how these form a family of optimization methods that lie between trivializations and Riemannian gradient descent, and combine the benefits of both of them. We then show how to implement these two families of trivializations in practice for different matrix manifolds. To this end, we prove a formula for the gradient of the exponential of matrices, which can be of practical interest on its own. Finally, we show how dynamic trivializations improve the performance of existing methods on standard tasks designed to test long-term memory within neural networks.(1)
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Gradient-based optimization of hyperparameters
    Bengio, Y
    [J]. NEURAL COMPUTATION, 2000, 12 (08) : 1889 - 1900
  • [2] Gradient-based simulation optimization
    Kim, Sujin
    [J]. PROCEEDINGS OF THE 2006 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2006, : 159 - 167
  • [3] Gradient-based learning and optimization
    Cao, XR
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 3 - 7
  • [4] A gradient-based direct aperture optimization
    Yang, Jie
    Zhang, Pengcheng
    Zhang, Liyuan
    Gui, Zhiguo
    [J]. Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2018, 35 (03): : 358 - 367
  • [5] A skeletonization algorithm for gradient-based optimization
    Menten, Martin J.
    Paetzold, Johannes C.
    Zimmer, Veronika A.
    Shit, Suprosanna
    Ezhov, Ivan
    Holland, Robbie
    Probst, Monika
    Schnabel, Julia A.
    Rueckert, Daniel
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21337 - 21346
  • [6] Catalyst for Gradient-based Nonconvex Optimization
    Paquette, Courtney
    Lin, Hongzhou
    Drusvyatskiy, Dmitriy
    Mairal, Julien
    Harchaoui, Zaid
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [7] ON THE ADAPTIVITY OF STOCHASTIC GRADIENT-BASED OPTIMIZATION
    Lei, Lihua
    Jordan, Michael I.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2020, 30 (02) : 1473 - 1500
  • [8] A Gradient-Based Optimization Algorithm for LASSO
    Kim, Jinseog
    Kim, Yuwon
    Kim, Yongdai
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2008, 17 (04) : 994 - 1009
  • [9] Gradient-Based Multiobjective Optimization with Uncertainties
    Peitz, Sebastian
    Dellnitz, Michael
    [J]. NEO 2016: RESULTS OF THE NUMERICAL AND EVOLUTIONARY OPTIMIZATION WORKSHOP NEO 2016 AND THE NEO CITIES 2016 WORKSHOP, 2018, 731 : 159 - 182
  • [10] Gradient-based optimization for quantum architecture search
    He, Zhimin
    Wei, Jiachun
    Chen, Chuangtao
    Huang, Zhiming
    Situ, Haozhen
    Li, Lvzhou
    [J]. NEURAL NETWORKS, 2024, 179