Reinforcement learning guided Spearman dynamic opposite Gradient-based optimizer for numerical optimization and anchor clustering

被引:3
|
作者
Sun, Kangjian [1 ]
Huo, Ju [1 ]
Jia, Heming [2 ]
Yue, Lin [3 ]
机构
[1] Harbin Inst Technol, Sch Elect Engn & Automat, Harbin 150001, Peoples R China
[2] Sanming Univ, Sch Informat Engn, Sanming 365004, Peoples R China
[3] China Acad Railway Sci, Signal & Commun Res Inst, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Gradient-based optimizer; Reinforcement learning; Spearman rank correlation coefficient; Dynamic opposite; Numerical optimization; Anchor clustering; GLOBAL OPTIMIZATION; FEATURE-SELECTION; ALGORITHM;
D O I
10.1093/jcde/qwad109
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
As science and technology advance, the need for novel optimization techniques has led to an increase. The recently proposed metaheuristic algorithm, Gradient-based optimizer (GBO), is rooted in the gradient-based Newton's method. GBO has a more concrete theoretical foundation. However, gradient search rule (GSR) and local escaping operator (LEO) operators in GBO still have some shortcomings. The insufficient updating method and the simple selection process limit the search performance of the algorithm. In this paper, an improved version is proposed to compensate for the above shortcomings, called RL-SDOGBO. First, during the GSR phase, the Spearman rank correlation coefficient is used to determine weak solutions on which to perform dynamic opposite learning. This operation assists the algorithm to escape from local optima and enhance exploration capability. Secondly, to optimize the exploitation capability, reinforcement learning is used to guide the selection of solution update modes in the LEO operator. RL-SDOGBO is tested on 12 classical benchmark functions and 12 CEC2022 benchmark functions with seven representative metaheuristics, respectively. The impact of the improvements, the scalability and running time of the algorithm, and the balance of exploration and exploitation are analyzed and discussed. Combining the experimental results and some statistical results, RL-SDOGBO exhibits excellent numerical optimization performance and provides high-quality solutions in most cases. In addition, RL-SDOGBO is also used to solve the anchor clustering problem for small target detection, making it a more potential and competitive option. Graphical Abstract
引用
收藏
页码:12 / 33
页数:22
相关论文
共 50 条
  • [1] Direct gradient-based reinforcement learning
    Baxter, J
    Bartlett, PL
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL III: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 271 - 274
  • [2] Gradient-based optimizer for economic optimization of engineering problems
    Mehta, Pranav
    Yildiz, Betul Sultan
    Sait, Sadiq M.
    Yildiz, Ali Riza
    MATERIALS TESTING, 2022, 64 (05) : 690 - 696
  • [3] Gradient-based optimizer: A new metaheuristic optimization algorithm
    Ahmadianfar, Iman
    Bozorg-Haddad, Omid
    Chu, Xuefeng
    INFORMATION SCIENCES, 2020, 540 : 131 - 159
  • [4] Gradient-based learning and optimization
    Cao, XR
    PROCEEDINGS OF THE 17TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2003, : 3 - 7
  • [5] A Novel Deployment Strategy Based on Improved Gradient-Based Optimizer for BLE Anchor Nodes
    Yan, Jinjin
    Zhang, Manyu
    Gu, Fuqiang
    Li, You
    TRANSACTIONS IN GIS, 2025, 29 (01)
  • [6] Solving Optimization Problems Using an Extended Gradient-Based Optimizer
    Ewees, Ahmed A.
    MATHEMATICS, 2023, 11 (02)
  • [7] A Dynamic Multi-objective Scheduling Approach for Gradient-Based Reinforcement Learning
    Hengel, Katharina
    Wagner, Achim
    Ruskowski, Martin
    IFAC PAPERSONLINE, 2024, 58 (19): : 49 - 54
  • [8] Direct gradient-based reinforcement learning for robot behavior learning
    El-Fakdi, Andres
    Carreras, Marc
    Ridao, Pere
    INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS II, 2007, : 175 - +
  • [9] A Gradient-based reinforcement learning model of market equilibration
    He, Zhongzhi
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 152
  • [10] Estimation and approximation bounds for gradient-based reinforcement learning
    Bartlett, PL
    Baxter, J
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2002, 64 (01) : 133 - 150