Reinforcement learning guided Spearman dynamic opposite Gradient-based optimizer for numerical optimization and anchor clustering

被引：3

作者：

Sun, Kangjian ^{[1
]}

Huo, Ju ^{[1
]}

Jia, Heming ^{[2
]}

Yue, Lin ^{[3
]}

机构：

[1] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China

[2] Sanming Univ, Sch Informat Engn, Sanming 365004, Peoples R China

[3] China Acad Railway Sci, Signal & Commun Res Inst, Beijing 100081, Peoples R China

来源：

JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING | 2024年 / 11卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Gradient-based optimizer; Reinforcement learning; Spearman rank correlation coefficient; Dynamic opposite; Numerical optimization; Anchor clustering; GLOBAL OPTIMIZATION; FEATURE-SELECTION; ALGORITHM;

D O I：

10.1093/jcde/qwad109

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

As science and technology advance, the need for novel optimization techniques has led to an increase. The recently proposed metaheuristic algorithm, Gradient-based optimizer (GBO), is rooted in the gradient-based Newton's method. GBO has a more concrete theoretical foundation. However, gradient search rule (GSR) and local escaping operator (LEO) operators in GBO still have some shortcomings. The insufficient updating method and the simple selection process limit the search performance of the algorithm. In this paper, an improved version is proposed to compensate for the above shortcomings, called RL-SDOGBO. First, during the GSR phase, the Spearman rank correlation coefficient is used to determine weak solutions on which to perform dynamic opposite learning. This operation assists the algorithm to escape from local optima and enhance exploration capability. Secondly, to optimize the exploitation capability, reinforcement learning is used to guide the selection of solution update modes in the LEO operator. RL-SDOGBO is tested on 12 classical benchmark functions and 12 CEC2022 benchmark functions with seven representative metaheuristics, respectively. The impact of the improvements, the scalability and running time of the algorithm, and the balance of exploration and exploitation are analyzed and discussed. Combining the experimental results and some statistical results, RL-SDOGBO exhibits excellent numerical optimization performance and provides high-quality solutions in most cases. In addition, RL-SDOGBO is also used to solve the anchor clustering problem for small target detection, making it a more potential and competitive option. Graphical Abstract

引用

页码：12 / 33

页数：22

共 50 条

[1] Direct gradient-based reinforcement learning
Baxter, J
Bartlett, PL
ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 271 - 274
[2] Gradient-based optimizer for economic optimization of engineering problems
Mehta, Pranav
Yildiz, Betul Sultan
Sait, Sadiq M.
Yildiz, Ali Riza
MATERIALS TESTING, 2022, 64 (05) : 690 - 696
[3] Gradient-based optimizer: A new metaheuristic optimization algorithm
Ahmadianfar, Iman
Bozorg-Haddad, Omid
Chu, Xuefeng
INFORMATION SCIENCES, 2020, 540 : 131 - 159
[4] Gradient-based learning and optimization
Cao, XR
PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 3 - 7
[5] A Novel Deployment Strategy Based on Improved Gradient-Based Optimizer for BLE Anchor Nodes
Yan, Jinjin
Zhang, Manyu
Gu, Fuqiang
Li, You
TRANSACTIONS IN GIS, 2025, 29 (01)
[6] Solving Optimization Problems Using an Extended Gradient-Based Optimizer
Ewees, Ahmed A.
MATHEMATICS, 2023, 11 (02)
[7] A Dynamic Multi-objective Scheduling Approach for Gradient-Based Reinforcement Learning
Hengel, Katharina
Wagner, Achim
Ruskowski, Martin
IFAC PAPERSONLINE, 2024, 58 (19): : 49 - 54
[8] Direct gradient-based reinforcement learning for robot behavior learning
El-Fakdi, Andres
Carreras, Marc
Ridao, Pere
INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS II, 2007, : 175 - +
[9] A Gradient-based reinforcement learning model of market equilibration
He, Zhongzhi
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 152
[10] Estimation and approximation bounds for gradient-based reinforcement learning
Bartlett, PL
Baxter, J
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) : 133 - 150

← 1 2 3 4 5 →