Reinforcement learning guided Spearman dynamic opposite Gradient-based optimizer for numerical optimization and anchor clustering

被引：3

作者：

Sun, Kangjian ^{[1
]}

Huo, Ju ^{[1
]}

Jia, Heming ^{[2
]}

Yue, Lin ^{[3
]}

机构：

[1] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China

[2] Sanming Univ, Sch Informat Engn, Sanming 365004, Peoples R China

[3] China Acad Railway Sci, Signal & Commun Res Inst, Beijing 100081, Peoples R China

来源：

JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING | 2024年 / 11卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Gradient-based optimizer; Reinforcement learning; Spearman rank correlation coefficient; Dynamic opposite; Numerical optimization; Anchor clustering; GLOBAL OPTIMIZATION; FEATURE-SELECTION; ALGORITHM;

D O I：

10.1093/jcde/qwad109

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

As science and technology advance, the need for novel optimization techniques has led to an increase. The recently proposed metaheuristic algorithm, Gradient-based optimizer (GBO), is rooted in the gradient-based Newton's method. GBO has a more concrete theoretical foundation. However, gradient search rule (GSR) and local escaping operator (LEO) operators in GBO still have some shortcomings. The insufficient updating method and the simple selection process limit the search performance of the algorithm. In this paper, an improved version is proposed to compensate for the above shortcomings, called RL-SDOGBO. First, during the GSR phase, the Spearman rank correlation coefficient is used to determine weak solutions on which to perform dynamic opposite learning. This operation assists the algorithm to escape from local optima and enhance exploration capability. Secondly, to optimize the exploitation capability, reinforcement learning is used to guide the selection of solution update modes in the LEO operator. RL-SDOGBO is tested on 12 classical benchmark functions and 12 CEC2022 benchmark functions with seven representative metaheuristics, respectively. The impact of the improvements, the scalability and running time of the algorithm, and the balance of exploration and exploitation are analyzed and discussed. Combining the experimental results and some statistical results, RL-SDOGBO exhibits excellent numerical optimization performance and provides high-quality solutions in most cases. In addition, RL-SDOGBO is also used to solve the anchor clustering problem for small target detection, making it a more potential and competitive option. Graphical Abstract

引用

页码：12 / 33

页数：22

共 50 条

[21] Clustering-assisted gradient-based optimizer for scheduling parallel cloud workflows with budget constraints
Li, Huifang
Chen, Boyuan
Huang, Jingwei
Song, Zhuoyue
Xia, Yuanqing
JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17097 - 17134
[22] Gradient-Based Minimization for Multi-Expert Inverse Reinforcement Learning
Tateo, Davide
Pirotta, Matteo
Restelli, Marcello
Bonarini, Andrea
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 815 - 822
[23] Optimizing thermodynamic trajectories using evolutionary and gradient-based reinforcement learning
Beeler, Chris
Yahorau, Uladzimir
Coles, Rory
Mills, Kyle
Whitelam, Stephen
Tamblyn, Isaac
PHYSICAL REVIEW E, 2021, 104 (06)
[24] Reinforcement learning for enhanced online gradient-based parameter adaptation in metaheuristics
Tatsis, Vasileios A.
Parsopoulos, Konstantinos E.
SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
[25] An algorithm for gradient-based dynamic optimization of UV flash processes
Ritschel, Tobias K. S.
Capolei, Andrea
Gaspar, Jozsef
Jorgensen, John Bagterp
COMPUTERS & CHEMICAL ENGINEERING, 2018, 114 : 281 - 295
[26] Extreme Learning Machine Using Improved Gradient-Based Optimizer for Dam Seepage Prediction
Li Lei
Yongquan Zhou
Huajuan Huang
Qifang Luo
Arabian Journal for Science and Engineering, 2023, 48 : 9693 - 9712
[27] Extreme Learning Machine Using Improved Gradient-Based Optimizer for Dam Seepage Prediction
Lei, Li
Zhou, Yongquan
Huang, Huajuan
Luo, Qifang
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 9693 - 9712
[28] Learning Supervised PageRank with Gradient-Based and Gradient-Free Optimization Methods
Bogolubsky, Lev
Gusev, Gleb
Raigorodskii, Andrei
Tikhonov, Aleksey
Zhukovskii, Maksim
Dvurechensky, Pavel
Gasnikov, Alexander
Nesterov, Yurii
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[29] Variational learning the SDC quantum protocol with gradient-based optimization
Haozhen Situ
Zhiming Huang
Xiangfu Zou
Shenggen Zheng
Quantum Information Processing, 2019, 18
[30] Two-step gradient-based reinforcement learning for underwater robotics behavior learning
El-Fakdi, Andres
Carreras, Marc
ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (03) : 271 - 282

← 1 2 3 4 5 →