Reinforcement learning guided Spearman dynamic opposite Gradient-based optimizer for numerical optimization and anchor clustering

被引:3
|
作者
Sun, Kangjian [1 ]
Huo, Ju [1 ]
Jia, Heming [2 ]
Yue, Lin [3 ]
机构
[1] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China
[2] Sanming Univ, Sch Informat Engn, Sanming 365004, Peoples R China
[3] China Acad Railway Sci, Signal & Commun Res Inst, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Gradient-based optimizer; Reinforcement learning; Spearman rank correlation coefficient; Dynamic opposite; Numerical optimization; Anchor clustering; GLOBAL OPTIMIZATION; FEATURE-SELECTION; ALGORITHM;
D O I
10.1093/jcde/qwad109
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
As science and technology advance, the need for novel optimization techniques has led to an increase. The recently proposed metaheuristic algorithm, Gradient-based optimizer (GBO), is rooted in the gradient-based Newton's method. GBO has a more concrete theoretical foundation. However, gradient search rule (GSR) and local escaping operator (LEO) operators in GBO still have some shortcomings. The insufficient updating method and the simple selection process limit the search performance of the algorithm. In this paper, an improved version is proposed to compensate for the above shortcomings, called RL-SDOGBO. First, during the GSR phase, the Spearman rank correlation coefficient is used to determine weak solutions on which to perform dynamic opposite learning. This operation assists the algorithm to escape from local optima and enhance exploration capability. Secondly, to optimize the exploitation capability, reinforcement learning is used to guide the selection of solution update modes in the LEO operator. RL-SDOGBO is tested on 12 classical benchmark functions and 12 CEC2022 benchmark functions with seven representative metaheuristics, respectively. The impact of the improvements, the scalability and running time of the algorithm, and the balance of exploration and exploitation are analyzed and discussed. Combining the experimental results and some statistical results, RL-SDOGBO exhibits excellent numerical optimization performance and provides high-quality solutions in most cases. In addition, RL-SDOGBO is also used to solve the anchor clustering problem for small target detection, making it a more potential and competitive option. Graphical Abstract
引用
收藏
页码:12 / 33
页数:22
相关论文
共 50 条
  • [21] Clustering-assisted gradient-based optimizer for scheduling parallel cloud workflows with budget constraints
    Li, Huifang
    Chen, Boyuan
    Huang, Jingwei
    Song, Zhuoyue
    Xia, Yuanqing
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17097 - 17134
  • [22] Gradient-Based Minimization for Multi-Expert Inverse Reinforcement Learning
    Tateo, Davide
    Pirotta, Matteo
    Restelli, Marcello
    Bonarini, Andrea
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 815 - 822
  • [23] Optimizing thermodynamic trajectories using evolutionary and gradient-based reinforcement learning
    Beeler, Chris
    Yahorau, Uladzimir
    Coles, Rory
    Mills, Kyle
    Whitelam, Stephen
    Tamblyn, Isaac
    PHYSICAL REVIEW E, 2021, 104 (06)
  • [24] Reinforcement learning for enhanced online gradient-based parameter adaptation in metaheuristics
    Tatsis, Vasileios A.
    Parsopoulos, Konstantinos E.
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
  • [25] An algorithm for gradient-based dynamic optimization of UV flash processes
    Ritschel, Tobias K. S.
    Capolei, Andrea
    Gaspar, Jozsef
    Jorgensen, John Bagterp
    COMPUTERS & CHEMICAL ENGINEERING, 2018, 114 : 281 - 295
  • [26] Extreme Learning Machine Using Improved Gradient-Based Optimizer for Dam Seepage Prediction
    Li Lei
    Yongquan Zhou
    Huajuan Huang
    Qifang Luo
    Arabian Journal for Science and Engineering, 2023, 48 : 9693 - 9712
  • [27] Extreme Learning Machine Using Improved Gradient-Based Optimizer for Dam Seepage Prediction
    Lei, Li
    Zhou, Yongquan
    Huang, Huajuan
    Luo, Qifang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 9693 - 9712
  • [28] Learning Supervised PageRank with Gradient-Based and Gradient-Free Optimization Methods
    Bogolubsky, Lev
    Gusev, Gleb
    Raigorodskii, Andrei
    Tikhonov, Aleksey
    Zhukovskii, Maksim
    Dvurechensky, Pavel
    Gasnikov, Alexander
    Nesterov, Yurii
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [29] Variational learning the SDC quantum protocol with gradient-based optimization
    Haozhen Situ
    Zhiming Huang
    Xiangfu Zou
    Shenggen Zheng
    Quantum Information Processing, 2019, 18
  • [30] Two-step gradient-based reinforcement learning for underwater robotics behavior learning
    El-Fakdi, Andres
    Carreras, Marc
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (03) : 271 - 282