Improved Variance Reduction Methods for Riemannian Non-Convex Optimization

被引:4
|
作者
Han, Andi [1 ]
Gao, Junbin [1 ]
机构
[1] Univ Sydney, Business Sch, Discipline Business Analyt, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会;
关键词
Complexity theory; Optimization; Manifolds; Convergence; Convex functions; Training; Principal component analysis; Riemannian optimization; non-convex optimization; online optimization; variance reduction; batch size adaptation;
D O I
10.1109/TPAMI.2021.3112139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Variance reduction is popular in accelerating gradient descent and stochastic gradient descent for optimization problems defined on both euclidean space and Riemannian manifold. This paper further improves on existing variance reduction methods for non-convex Riemannian optimization, including R-SVRG and R-SRG/R-SPIDER by providing a unified framework for batch size adaptation. Such framework is more general than the existing works by considering retraction and vector transport and mini-batch stochastic gradients. We show that the adaptive-batch variance reduction methods require lower gradient complexities for both general non-convex and gradient dominated functions, under both finite-sum and online optimization settings. Moreover, under the new framework, we complete the analysis of R-SVRG and R-SRG, which is currently missing in the literature. We prove convergence of R-SVRG with much simpler analysis, which leads to curvature-free complexity bounds. We also show improved results for R-SRG under double-loop convergence, which match the optimal complexities as the R-SPIDER. In addition, we prove the first online complexity results for R-SVRG and R-SRG. Lastly, we discuss the potential of adapting batch size for non-smooth, constrained and second-order Riemannian optimizers. Extensive experiments on a variety of applications support the analysis and claims in the paper.
引用
下载
收藏
页码:7610 / 7623
页数:14
相关论文
共 50 条
  • [21] DUALITY IN NON-CONVEX OPTIMIZATION
    TOLAND, JF
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1978, 66 (02) : 399 - 415
  • [22] A Variance Controlled Stochastic Method with Biased Estimation for Faster Non-convex Optimization
    Bi, Jia
    Gunn, Steve R.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 135 - 150
  • [23] A Hybrid Variance-Reduced Method for Decentralized Stochastic Non-Convex Optimization
    Xin, Ran
    Khan, Usman A.
    Kar, Soummya
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [25] Improved Harris hawks optimization for non-convex function optimization and design optimization problems
    Kang, Helei
    Liu, Renyun
    Yao, Yifei
    Yu, Fanhua
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2023, 204 : 619 - 639
  • [26] Swarm based mean-variance mapping optimization for convex and non-convex economic dispatch problems
    T. H. Khoa
    P. M. Vasant
    M. S. Balbir Singh
    V. N. Dieu
    Memetic Computing, 2017, 9 : 91 - 108
  • [27] Swarm based mean-variance mapping optimization for convex and non-convex economic dispatch problems
    Khoa, T. H.
    Vasant, P. M.
    Singh, M. S. Balbir
    Dieu, V. N.
    MEMETIC COMPUTING, 2017, 9 (02) : 91 - 108
  • [28] An Improved Convergence Analysis for Decentralized Online Stochastic Non-Convex Optimization
    Xin, Ran
    Khan, Usman A.
    Kar, Soummya
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 1842 - 1858
  • [29] Inexact Proximal Gradient Methods for Non-Convex and Non-Smooth Optimization
    Gu, Bin
    Wang, De
    Huo, Zhouyuan
    Huang, Heng
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3093 - 3100
  • [30] Improved Particle Swarm Optimization for Non-convex optimal power flow
    Xia Shiwei
    Bai Xuefeng
    Guo Zhizhong
    Xu Ying
    2012 ASIA-PACIFIC POWER AND ENERGY ENGINEERING CONFERENCE (APPEEC), 2012,