Adaptive Levenberg-Marquardt Algorithm: A New Optimization Strategy for Levenberg-Marquardt Neural Networks

被引:32
|
作者
Yan, Zhiqi [1 ]
Zhong, Shisheng [1 ]
Lin, Lin [1 ]
Cui, Zhiquan [1 ]
机构
[1] Harbin Inst Technol, Dept Mech Engn, Harbin 150000, Peoples R China
基金
中国国家自然科学基金;
关键词
Levenberg-Marquardt algorithm; convergence; neural networks; local minima; optimization; CONVERGENCE; SYSTEMS; NEURONS;
D O I
10.3390/math9172176
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg-Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network's poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Adaptive predistortions based on neural networks associated with Levenberg-Marquardt algorithm for satellite down links
    Zayani, Rafik
    Bouallegue, Ridha
    Roviras, Daniel
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2008, 2008 (1)
  • [42] ON THE COMPLEXITY OF A STOCHASTIC LEVENBERG-MARQUARDT METHOD
    Shao, Weiyi
    Fan, Jinyan
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2024, 20 (03) : 1011 - 1027
  • [43] Adaptive Momentum Levenberg-Marquardt RBF for Face Recognition
    Ch'ng, Sue Inn
    Seng, Kah Phooi
    Ang, Li-Minn
    2012 IEEE INTERNATIONAL CONFERENCE ON CIRCUITS AND SYSTEMS (ICCAS), 2012, : 126 - 131
  • [44] A Levenberg-Marquardt method with approximate projections
    Behling, R.
    Fischer, A.
    Herrich, M.
    Iusem, A.
    Ye, Y.
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2014, 59 (1-2) : 5 - 26
  • [45] Improvement of magnetometer calibration using Levenberg-Marquardt algorithm
    Pang, Hongfeng
    Chen, Dixiang
    Pan, Mengchun
    Luo, Shitu
    Zhang, Qi
    Li, Ji
    Luo, Feilu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2014, 9 (03) : 324 - 328
  • [46] Application of Levenberg-Marquardt algorithm in the Brillouin spectrum fitting
    Zhang, Chuankai
    Yang, Yuanhong
    Li, Anqi
    SEVENTH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND CONTROL TECHNOLOGY: OPTOELECTRONIC TECHNOLOGY AND INSTUMENTS, CONTROL THEORY AND AUTOMATION, AND SPACE EXPLORATION, 2008, 7129
  • [47] A smoothing Levenberg-Marquardt method for NCP
    Zhang, Ju-liang
    Zhang, Xiangsun
    APPLIED MATHEMATICS AND COMPUTATION, 2006, 178 (02) : 212 - 228
  • [48] A New Cuckoo Search Based Levenberg-Marquardt (CSLM) Algorithm
    Nawi, Nazri Mohd
    Khan, Abdullah
    Rehman, Mohammad Zubair
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, PT I, 2013, 7971 : 438 - 451
  • [49] Improved Computation for Levenberg-Marquardt Training
    Wilamowski, Bogdan M.
    Yu, Hao
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2010, 21 (06): : 930 - 937
  • [50] On the convergence properties of the Levenberg-Marquardt method
    Zhang, JL
    OPTIMIZATION, 2003, 52 (06) : 739 - 756