Optimization with Momentum: Dynamical, Control-Theoretic, and Symplectic Perspectives

被引:0
|
作者
Muehlebach, Michael [1 ]
Jordan, Michael, I [1 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
关键词
Gradient-based optimization; convergence rate analysis; Nesterov acceleration; symplectic integration; nonconvex optimization;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We analyze the convergence rate of various momentum-based optimization algorithms from a dynamical systems point of view. Our analysis exploits fundamental topological properties, such as the continuous dependence of iterates on their initial conditions, to provide a simple characterization of convergence rates. In many cases, closed-form expressions are obtained that relate algorithm parameters to the convergence rate. The analysis encompasses discrete time and continuous time, as well as time-invariant and time-variant formulations, and is not limited to a convex or Euclidean setting. In addition, the article rigorously establishes why symplectic discretization schemes are important for momentum-based optimization algorithms, and provides a characterization of algorithms that exhibit accelerated convergence.
引用
收藏
页数:50
相关论文
共 50 条
  • [1] Optimization with momentum: Dynamical, control-theoretic, and symplectic perspectives
    Muehlebach, Michael
    Jordan, Michael I.
    Journal of Machine Learning Research, 2021, 22
  • [2] Neuroscience out of control: control-theoretic perspectives on neural circuit dynamics
    Kao, Ta-Chu
    Hennequin, Guillaume
    CURRENT OPINION IN NEUROBIOLOGY, 2019, 58 : 122 - 129
  • [3] A control-theoretic perspective on optimal high-order optimization
    Lin, Tianyi
    Jordan, Michael, I
    MATHEMATICAL PROGRAMMING, 2022, 195 (1-2) : 929 - 975
  • [4] A control-theoretic perspective on optimal high-order optimization
    Tianyi Lin
    Michael I. Jordan
    Mathematical Programming, 2022, 195 : 929 - 975
  • [5] A Control-Theoretic View of Intelligent Control
    Kawaji, Shigeyasu
    Journal of Robotics and Mechatronics, 2000, 12 (06) : 605 - 613
  • [6] A Control-Theoretic Model of Atherosclerosis
    Formanowicz, Dorota
    Krawczyk, Jacek B.
    Perek, Bartlomiej
    Formanowicz, Piotr
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (03):
  • [7] Control-Theoretic Data Smoothing
    Dey, Biswadip
    Krishnaprasad, P. S.
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 5064 - 5070
  • [8] A CONTROL-THEORETIC VIEW ON INCENTIVES
    HO, YC
    LUH, PB
    OLSDER, GJ
    AUTOMATICA, 1982, 18 (02) : 167 - 179
  • [9] Multiagent Coordination Optimization: A Control-Theoretic Perspective of Swarm Intelligence Algorithms
    Zhang, Haopeng
    Hui, Qing
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 3339 - 3346
  • [10] CONTROL-THEORETIC MODELS OF ENVIRONMENTAL CRIME
    Cartee, Elliot
    Vladimirsky, Alexander
    SIAM JOURNAL ON APPLIED MATHEMATICS, 2020, 80 (03) : 1441 - 1466