A fractional-order momentum optimization approach of deep neural networks

被引:0
|
作者
ZhongLiang Yu
Guanghui Sun
Jianfeng Lv
机构
[1] Yu Harbin Institute of Technology,Department of Control Science and Engineering
来源
关键词
Deep neural networks; Optimization; Gradient descent; Fractional-order; Residual network; Image classification;
D O I
暂无
中图分类号
学科分类号
摘要
The development of universal and high-efficiency optimization algorithms is a very important research direction of neural networks. Stochastic Gradient Decent Momentum(SGDM) is one of the most successful optimization algorithms, and easily fall into local extremes minimum. Inspired by the prominent success of Fractional-order Calculus in automatic control, we proposed a method based on Fractional-Order named Fractional-Order Momentum(FracM). As a natural extension of integral calculus, fractional order calculus inherits almost all the characteristics of integral calculus, and have some memorization and nonlocality. FracM performs fractional-order difference of momentum and gradient in SGDM algorithm. FracM can partially solve the problem of traps in the local minimum point and accelerated the train process. The proposed FracM optimization method can compare with the most advanced SGDM and Adam and other advanced optimization algorithm in terms of classification accuracy. The experiments show that FracM outperforms other optimizers on CIFAR10/100 and textual datasets IMDB with transformer-based models.
引用
收藏
页码:7091 / 7111
页数:20
相关论文
共 50 条
  • [1] A fractional-order momentum optimization approach of deep neural networks
    Yu, ZhongLiang
    Sun, Guanghui
    Lv, Jianfeng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (09): : 7091 - 7111
  • [2] A fractional-order momentum optimization approach of deep neural networks
    Yu, ZhongLiang
    Sun, Guanghui
    Lv, Jianfeng
    [J]. Neural Computing and Applications, 2022, 34 (09) : 7091 - 7111
  • [3] Fractional-order stochastic gradient descent method with momentum and energy for deep neural networks
    Zhou, Xingwen
    You, Zhenghao
    Sun, Weiguo
    Zhao, Dongdong
    Yan, Shi
    [J]. Neural Networks, 2025, 181
  • [4] Convolutional neural networks based on fractional-order momentum for parameter training
    Kan, Tao
    Gao, Zhe
    Yang, Chuang
    Jian, Jing
    [J]. NEUROCOMPUTING, 2021, 449 : 85 - 99
  • [5] Optimization of fractional-order chaotic cellular neural networks by metaheuristics
    Esteban Tlelo-Cuautle
    Astrid Maritza González-Zapata
    Jonathan Daniel Díaz-Muñoz
    Luis Gerardo de la Fraga
    Israel Cruz-Vega
    [J]. The European Physical Journal Special Topics, 2022, 231 : 2037 - 2043
  • [6] Fractional-order convolutional neural networks with population extremal optimization
    Chen, Bi-Peng
    Chen, Yun
    Zeng, Guo-Qiang
    She, Qingshan
    [J]. NEUROCOMPUTING, 2022, 477 : 36 - 45
  • [7] Optimization of fractional-order chaotic cellular neural networks by metaheuristics
    Tlelo-Cuautle, Esteban
    Maritza Gonzalez-Zapata, Astrid
    Daniel Diaz-Munoz, Jonathan
    Gerardo de la Fraga, Luis
    Cruz-Vega, Israel
    [J]. EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2022, 231 (10): : 2037 - 2043
  • [8] Stability analysis of fractional-order neural networks: An LMI approach
    Yang, Ying
    He, Yong
    Wang, Yong
    Wu, Min
    [J]. NEUROCOMPUTING, 2018, 285 : 82 - 93
  • [9] Fractional-Order Hopfield Neural Networks
    Boroomand, Arefeh
    Menhaj, Mohammad B.
    [J]. ADVANCES IN NEURO-INFORMATION PROCESSING, PT I, 2009, 5506 : 883 - 890
  • [10] A SURVEY OF FRACTIONAL-ORDER NEURAL NETWORKS
    Zhang, Shuo
    Chen, YangQuan
    Yu, Yongguang
    [J]. PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2017, VOL 9, 2017,