Deterministic Neural Networks Optimization from a Continuous and Energy Point of View

被引:0
|
作者
Bensaid, Bilel [1 ,2 ]
Poette, Gael [2 ]
Turpault, Rodolphe [1 ]
机构
[1] Univ Bordeaux, Inst Math Bordeaux IMB, CNRS, Bordeaux INP, F-33405 Talence, France
[2] CEA, CESTA, DAM, F-33114 Le Barp, France
关键词
Neural Networks; Non-convex optimization; ODEs; Lyapunov stability; Adaptive scheme; Machine Learning;
D O I
10.1007/s10915-023-02215-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Getting an efficient neural network can be a very difficult task for engineers and researchers because of the huge number of hyperparameters to tune and their interconnections. To make the tuning step easier and more understandable, this work focuses on probably one of the most important leverage to improve Neural Networks efficiency: the optimizer. These recent years, a great number of algorithms have been developed but they need an accurate tuning to be efficient. To get rid of this long and experimental step, we are looking for generic and desirable properties for non-convex optimization. For this purpose, the optimizers are reinterpreted or analyzed as a discretization of a continuous dynamical system. This continuous framework offers many mathematical tools in order to interpret the sensitivity of the optimizer with respect to the initial guess such as Lyapunov stability. By enforcing the discrete decrease of Lyapunov functionals, new robust and efficient optimizers are designed. They also considerably simplify the tuning of hyperparameters (learning rate, momentum etc.). These Lyapunov based algorithms outperform several state of the art optimizers on different benchmarks of the literature. Drawing its inspiration from the numerical analysis of PDEs, this paper emphasizes the essential role of some hidden energy/entropy quantities for machine learning tasks.
引用
收藏
页数:41
相关论文
共 50 条
  • [1] Deterministic Neural Networks Optimization from a Continuous and Energy Point of View
    Bilel Bensaid
    Gaël Poëtte
    Rodolphe Turpault
    Journal of Scientific Computing, 2023, 96
  • [2] METEOROLOGICAL NETWORKS OPTIMIZATION FROM A STATISTICAL POINT OF VIEW
    DERMEGREDITCHIAN, G
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1990, 9 (01) : 57 - 75
  • [3] Deterministic Global Optimization with Artificial Neural Networks Embedded
    Schweidtmann, Artur M.
    Mitsos, Alexander
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2019, 180 (03) : 925 - 948
  • [4] Deterministic Global Optimization with Artificial Neural Networks Embedded
    Artur M. Schweidtmann
    Alexander Mitsos
    Journal of Optimization Theory and Applications, 2019, 180 : 925 - 948
  • [5] Neural networks applications in economics: A statistical point of view
    Lauro, NC
    Davino, C
    Vistocco, D
    NEURAL NETS - WIRN VIETRI-99, 1999, : 357 - 375
  • [6] A DETERMINISTIC APPROACH IN THE OPTIMIZATION OF A NUCLEAR-POWER SYSTEM FROM FUEL RESOURCE POINT-OF-VIEW
    URSU, I
    PAVELESCU, M
    TUTURICI, M
    INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 1986, 10 (01) : 11 - 27
  • [7] Optimization of Highway Engineering Decisions from the Energy Economy Point of View.
    Breiter, Bernhard
    Bitumen, 1987, 49 (02): : 50 - 57
  • [8] A Dynamical View on Optimization Algorithms of Overparameterized Neural Networks
    Bu, Zhiqi
    Xu, Shiyun
    Chen, Kan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [9] Deterministic global optimization with a neighbourhood determination algorithm based on neural networks
    Sun, WT
    Shu, JW
    Zheng, WM
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 1, PROCEEDINGS, 2005, 3496 : 700 - 705
  • [10] ANALYZING THE DYNAMICS OF DETERMINISTIC SYSTEMS FROM A HYPERGRAPH THEORETICAL POINT OF VIEW
    Torres, Luis M.
    Wagler, Annegret K.
    RAIRO-OPERATIONS RESEARCH, 2013, 47 (03) : 321 - 330