Deterministic Neural Networks Optimization from a Continuous and Energy Point of View

被引:0
|
作者
Bensaid, Bilel [1 ,2 ]
Poette, Gael [2 ]
Turpault, Rodolphe [1 ]
机构
[1] Univ Bordeaux, Inst Math Bordeaux IMB, CNRS, Bordeaux INP, F-33405 Talence, France
[2] CEA, CESTA, DAM, F-33114 Le Barp, France
关键词
Neural Networks; Non-convex optimization; ODEs; Lyapunov stability; Adaptive scheme; Machine Learning;
D O I
10.1007/s10915-023-02215-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Getting an efficient neural network can be a very difficult task for engineers and researchers because of the huge number of hyperparameters to tune and their interconnections. To make the tuning step easier and more understandable, this work focuses on probably one of the most important leverage to improve Neural Networks efficiency: the optimizer. These recent years, a great number of algorithms have been developed but they need an accurate tuning to be efficient. To get rid of this long and experimental step, we are looking for generic and desirable properties for non-convex optimization. For this purpose, the optimizers are reinterpreted or analyzed as a discretization of a continuous dynamical system. This continuous framework offers many mathematical tools in order to interpret the sensitivity of the optimizer with respect to the initial guess such as Lyapunov stability. By enforcing the discrete decrease of Lyapunov functionals, new robust and efficient optimizers are designed. They also considerably simplify the tuning of hyperparameters (learning rate, momentum etc.). These Lyapunov based algorithms outperform several state of the art optimizers on different benchmarks of the literature. Drawing its inspiration from the numerical analysis of PDEs, this paper emphasizes the essential role of some hidden energy/entropy quantities for machine learning tasks.
引用
收藏
页数:41
相关论文
共 50 条
  • [31] Factors Influencing the Threats for Urban Energy Networks: The Inhabitants' Point of View
    Cabelkova, Inna
    Strielkowski, Wadim
    Wende, Frank-Detlef
    Krayneva, Raisa
    ENERGIES, 2020, 13 (21)
  • [32] The Expressive Power of Neural Networks: A View from the Width
    Lu, Zhou
    Pu, Hongming
    Wang, Feicheng
    Hu, Zhiqiang
    Wang, Liwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [33] COUPLING OF ENERGY AND HEAT FROM THE MODERN POINT OF VIEW
    WALTHER, F
    MONATSSCHRIFT FUR BRAUEREI, 1978, 31 (12): : 462 - 466
  • [34] Energy and sustainability, from the point of view of environmental physics
    Tomkiewicz, Micha
    MRS Energy and Sustainability, 2015, 2 (01):
  • [36] The maintenance of energy from the point of view of engineers.
    Kammerer
    ZEITSCHRIFT DES VEREINES DEUTSCHER INGENIEURE, 1901, 45 : 1750 - 1754
  • [37] A COMPARISON OF DESALTING PROCESSES FROM AN ENERGY POINT OF VIEW
    MORRIS, M
    DESALINATION, 1982, 40 (03) : 237 - 244
  • [38] Energy and sustainability, from the point of view of environmental physics
    Tomkiewicz M.
    MRS Energy and Sustainability - A Review Journal, 2015, 2 (1):
  • [39] The conservation of energy from the engineers point of view.
    Kammerer, O
    PHYSIKALISCHE ZEITSCHRIFT, 1901, 3 : 70 - 76