Difference-enhanced adaptive momentum methods for non-convex stochastic optimization in image classification

被引:0
|
作者
Ouyang, Chen [1 ,2 ]
Jian, Ailun [3 ]
Zhao, Xiong [1 ,2 ]
Yuan, Gonglin [1 ,2 ]
机构
[1] Guangxi Univ, Sch Math & Informat Sci, Nanning 530004, Guangxi, Peoples R China
[2] Guangxi Univ, Ctr Appl Math Guangxi Guangxi Univ, Nanning 530004, Guangxi, Peoples R China
[3] Hangzhou Dianzi Univ, Sch Sci, Hangzhou 310027, Zhejiang, Peoples R China
关键词
Adaptive momentum methods; Non-convex; Deep learning; Image classification; CONJUGATE-GRADIENT METHOD; ALGORITHMS; DESCENT;
D O I
10.1016/j.dsp.2025.105118
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Stochastic gradient descent with momentum (SGDM) is a classic optimization method that determines the update direction using a moving average of the gradient over historical steps. However, SGDM suffers from slow convergence. In 2022, Yuan et al. [6] proposed stochastic gradient descent with momentum and difference (SGDMD), which incorporates the concept of differences to adjust the convergence direction and accelerate the optimization process. Despite its improvements, SGDMD requires careful parameter tuning and is prone to oscillations due to the difference mechanism. In this work, we introduce a new momentum method: stochastic gradient descent with adaptive momentum and difference (SGDAMD). Compared to SGDMD, SGDAMD demonstrates superior performance in experiments, achieving greater stability in terms of both loss values and accuracy in deep learning image classification tasks. Additionally, SGDAMD attains a sublinear convergence rate in non-convex settings while requiring less restrictive assumptions than standard smoothness conditions. These features underscore the algorithm's efficiency and effectiveness in addressing complex optimization challenges.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] A STOCHASTIC APPROACH TO THE CONVEX OPTIMIZATION OF NON-CONVEX DISCRETE ENERGY SYSTEMS
    Burger, Eric M.
    Moura, Scott J.
    PROCEEDINGS OF THE ASME 10TH ANNUAL DYNAMIC SYSTEMS AND CONTROL CONFERENCE, 2017, VOL 3, 2017,
  • [22] SOLVING A CLASS OF NON-CONVEX MIN-MAX GAMES USING ADAPTIVE MOMENTUM METHODS
    Barazandeh, Babak
    Tarzanagh, Davoud Ataee
    Michailidis, George
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3625 - 3629
  • [23] mPage: Probabilistic Gradient Estimator With Momentum for Non-Convex Optimization
    Liang, Yuqing
    Su, Hui
    Liu, Jinlan
    Xu, Dongpo
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 1375 - 1386
  • [24] Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds
    Zhou, Pan
    Yuan, Xiao-Tong
    Feng, Jiashi
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 138 - 147
  • [25] Faster First-Order Methods for Stochastic Non-Convex Optimization on Riemannian Manifolds
    Zhou, Pan
    Yuan, Xiao-Tong
    Yan, Shuicheng
    Feng, Jiashi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 459 - 472
  • [26] Convex and non-convex adaptive TV regularizations for color image restoration
    Wang, Xinv
    Ma, Mingxi
    Lu, Jingjing
    Zhang, Jun
    COMPUTATIONAL & APPLIED MATHEMATICS, 2024, 43 (01):
  • [27] Convex and non-convex adaptive TV regularizations for color image restoration
    Xinv Wang
    Mingxi Ma
    Jingjing Lu
    Jun Zhang
    Computational and Applied Mathematics, 2024, 43
  • [28] Efficient Convex Optimization for Non-convex Non-smooth Image Restoration
    Li, Xinyi
    Yuan, Jing
    Tai, Xue-Cheng
    Liu, Sanyang
    JOURNAL OF SCIENTIFIC COMPUTING, 2024, 99 (02)
  • [29] Equilibrated adaptive learning rates for non-convex optimization
    Dauphin, Yann N.
    de Vries, Harm
    Bengio, Yoshua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [30] A non-convex adaptive regularization approach to binary optimization
    Cerone, V
    Fosson, S. M.
    Regruto, D.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 3844 - 3849