A Deep Learning Optimizer Based on Grunwald-Letnikov Fractional Order Definition

被引:8
|
作者
Zhou, Xiaojun [1 ]
Zhao, Chunna [1 ]
Huang, Yaqun [1 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning optimizer; stochastic gradient descent; fractional order; Adam; time series prediction; STOCHASTIC GRADIENT DESCENT; MOMENTUM;
D O I
10.3390/math11020316
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this paper, a deep learning optimization algorithm is proposed, which is based on the Grunwald-Letnikov (G-L) fractional order definition. An optimizer fractional calculus gradient descent based on the G-L fractional order definition (FCGD_G-L) is designed. Using the short-memory effect of the G-L fractional order definition, the derivation only needs 10 time steps. At the same time, via the transforming formula of the G-L fractional order definition, the Gamma function is eliminated. Thereby, it can achieve the unification of the fractional order and integer order in FCGD_G-L. To prevent the parameters falling into local optimum, a small disturbance is added in the unfolding process. According to the stochastic gradient descent (SGD) and Adam, two optimizers' fractional calculus stochastic gradient descent based on the G-L definition (FCSGD_G-L), and the fractional calculus Adam based on the G-L definition (FCAdam_G-L), are obtained. These optimizers are validated on two time series prediction tasks. With the analysis of train loss, related experiments show that FCGD_G-L has the faster convergence speed and better convergence accuracy than the conventional integer order optimizer. Because of the fractional order property, the optimizer exhibits stronger robustness and generalization ability. Through the test sets, using the saved optimal model to evaluate, FCGD_G-L also shows a better evaluation effect than the conventional integer order optimizer.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Higher Order Time Stepping Methods for Subdiffusion Problems Based on Weighted and Shifted Grunwald-Letnikov Formulae with Nonsmooth Data
    Wang, Yanyong
    Yan, Yuyuan
    Yan, Yubin
    Pani, Amiya K.
    JOURNAL OF SCIENTIFIC COMPUTING, 2020, 83 (03)
  • [42] Sharp error estimate of Grunwald-Letnikov scheme for a multi-term time fractional diffusion equation
    Cao, Dewei
    Chen, Hu
    ADVANCES IN COMPUTATIONAL MATHEMATICS, 2022, 48 (06)
  • [43] Stability Analysis and Nonstandard Grunwald-Letnikov Scheme for a Fractional Order Predator-Prey Model with Ratio-Dependent Functional Response
    Suryanto, Agus
    Darti, Isnani
    INTERNATIONAL CONFERENCE AND WORKSHOP ON MATHEMATICAL ANALYSIS AND ITS APPLICATIONS (ICWOMAA 2017), 2017, 1913
  • [44] A Comparative Analysis of Laguerre-Based Approximators to the Grunwald Letnikov Fractional-Order Difference
    Stanislawski, Rafal
    Latawiec, Krzysztof J.
    Lukaniszyn, Marian
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [45] Non integer order, discrete, state space model of heat transfer process using Grunwald-Letnikov operator
    Oprzedkiewicz, K.
    Dziedzic, K.
    Wieckowski, L.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2019, 67 (05) : 905 - 914
  • [46] Physics-preserving averaging scheme based on Grunwald-Letnikov formula for gas flow in fractured media
    Amir, Sahar Z.
    Sun, Shuyu
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2018, 163 : 616 - 639
  • [47] Discrete fracmemristor-based chaotic map by Grunwald-Letnikov difference and its circuit implementation
    Peng, Yuexi
    Liu, Jun
    He, Shaobo
    Sun, Kehui
    CHAOS SOLITONS & FRACTALS, 2023, 171
  • [48] A Grunwald–Letnikov based Manta ray foraging optimizer for global optimization and image segmentation
    Abd Elaziz, Mohamed
    Yousri, Dalia
    Al-qaness, Mohammed A.A.
    AbdelAty, Amr M.
    Radwan, Ahmed G.
    Ewees, Ahmed A.
    Engineering Applications of Artificial Intelligence, 2021, 98
  • [49] Modeling and identification of fractional first-order systems with Laguerre-Grunwald-Letnikov fractional-order differences
    Stanislawski, Rafal
    Latawiec, Krzysztof J.
    Lukaniszyn, Marian
    Czuczwara, Wojciech
    Kopka, Ryszard
    2016 21ST INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2016, : 174 - 177
  • [50] Three-dimensional lattice models with long-range interactions of Grunwald-Letnikov type for fractional generalization of gradient elasticity
    Tarasov, Vasily E.
    MECCANICA, 2016, 51 (01) : 125 - 138