SALR: Sharpness-Aware Learning Rate Scheduler for Improved Generalization

被引:0
|
作者
Yue, Xubo [1 ]
Nouiehed, Maher [2 ]
Al Kontar, Raed [1 ]
机构
[1] Univ Michigan, Dept Ind & Operat Engn, Ann Arbor, MI 48109 USA
[2] Amer Univ Beirut, Dept Ind Engn & Management, Beirut 1072020, Lebanon
基金
美国国家科学基金会;
关键词
Schedules; Deep learning; Neural networks; Convergence; Bayes methods; Training; Stochastic processes; generalization; learning rate schedule; sharpness;
D O I
10.1109/TNNLS.2023.3263393
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In an effort to improve generalization in deep learning and automate the process of learning rate scheduling, we propose SALR: a sharpness-aware learning rate update technique designed to recover flat minimizers. Our method dynamically updates the learning rate of gradient-based optimizers based on the local sharpness of the loss function. This allows optimizers to automatically increase learning rates at sharp valleys to increase the chance of escaping them. We demonstrate the effectiveness of SALR when adopted by various algorithms over a broad range of networks. Our experiments indicate that SALR improves generalization, converges faster, and drives solutions to significantly flatter regions.
引用
收藏
页码:12518 / 12527
页数:10
相关论文
共 50 条
  • [41] Federated Model-Agnostic Meta-Learning With Sharpness-Aware Minimization for Internet of Things Optimization
    Wu, Qingtao
    Zhang, Yong
    Liu, Muhua
    Zhu, Junlong
    Zheng, Ruijuan
    Zhang, Mingchuan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (19): : 31317 - 31330
  • [42] Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima
    Si, Dongkuk
    Yun, Chulhee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition
    Zhou, Yixuan
    Qu, Yi
    Xu, Xing
    Shen, Hengtao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11311 - 11321
  • [44] Self-adaptive asynchronous federated optimizer with adversarial sharpness-aware minimization
    Zhang, Xiongtao
    Wang, Ji
    Bao, Weidong
    Xiao, Wenhua
    Zhang, Yaohong
    Liu, Lihua
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 161 : 638 - 654
  • [45] 基于Sharpness-Aware Minimization程序的最优Rho值选择
    沈奥然
    软件, 2023, (01) : 126 - 129
  • [46] Enhancing Fine-Tuning based Backdoor Defense with Sharpness-Aware Minimization
    Zhu, Mingli
    Wei, Shaokui
    Shen, Li
    Fan, Yanbo
    Wu, Baoyuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4443 - 4454
  • [47] The Dynamics of Sharpness-Aware Minimization: Bouncing Across Ravines and Drifting Towards Wide Minima
    Bartlett, Peter L.
    Long, Philip M.
    Bousquet, Olivier
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [48] Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition
    Zhou, Zhipeng
    Li, Lanqing
    Zhao, Peilin
    Heng, Pheng-Ann
    Gong, Wei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3499 - 3509
  • [49] Sharpness-aware Real-time Haze Removal for Advanced Driver Assistance Systems
    Ahn, Joonggeun
    Kim, Jihoon
    Lee, Youngjoo
    2016 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2016, : 47 - 48
  • [50] Double-Branch Multi-attention Mechanism Based Sharpness-Aware Classification Network
    Jiang W.
    Zhao L.
    Tu C.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (03): : 252 - 267