Deep Reinforcement Learning Based Adaptive Operator Selection for Evolutionary Multi-Objective Optimization

被引:51
|
作者
Tian, Ye [1 ,2 ]
Li, Xiaopeng [3 ]
Ma, Haiping [1 ,2 ]
Zhang, Xingyi [4 ]
Tan, Kay Chen [5 ]
Jin, Yaochu [6 ]
机构
[1] Anhui Univ Hefei, Inst Phys Sci, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Anhui, Peoples R China
[2] Anhui Univ Hefei, Inst Informat Technol, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Anhui, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[4] Anhui Univ, Sch Artificial Intelligence, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Anhui, Peoples R China
[5] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[6] Bielefeld Univ, Fac Technol, D-33619 Bielefeld, Germany
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Reinforcement learning; Optimization; Convergence; Statistics; Sociology; Neural networks; Particle swarm optimization; Evolutionary algorithm; multi-objective optimization; operator selection; reinforcement learning; DIFFERENTIAL EVOLUTION; ALGORITHM; PERFORMANCE; STRATEGY; BANDITS; MOEA/D;
D O I
10.1109/TETCI.2022.3146882
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evolutionary algorithms (EAs) have become one of the most effective techniques for multi-objective optimization, where a number of variation operators have been developed to handle the problems with various difficulties. While most EAs use a fixed operator all the time, it is a labor-intensive process to determine the best EA for a new problem. Hence, some recent studies have been dedicated to the adaptive selection of the best operators during the search process. To address the exploration versus exploitation dilemma in operator selection, this paper proposes a novel operator selection method based on reinforcement learning. In the proposed method, the decision variables are regarded as states and the candidate operators are regarded as actions. By using deep neural networks to learn a policy that estimates the $Q$ value of each action given a state, the proposed method can determine the best operator for each parent that maximizes its cumulative improvement. An EA is developed based on the proposed method, which is verified to be more effective than the state-of-the-art ones on challenging multi-objective optimization problems.
引用
收藏
页码:1051 / 1064
页数:14
相关论文
共 50 条
  • [1] Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection
    Ming, Fei
    Gong, Wenyin
    Wang, Ling
    Jin, Yaochu
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (04) : 919 - 931
  • [2] Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection
    Fei Ming
    Wenyin Gong
    Ling Wang
    Yaochu Jin
    [J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11 (04) : 919 - 959
  • [3] Adaptive operator selection with dueling deep Q-network for evolutionary multi-objective optimization
    Yin, Shihong
    Xiang, Zhengrong
    [J]. NEUROCOMPUTING, 2024, 581
  • [4] A Multi-Objective Optimization Method for Shelter Site Selection Based on Deep Reinforcement Learning
    Zhang, Di
    Meng, Huan
    Wang, Moyang
    Xu, Xianrui
    Yan, Jianhai
    Li, Xiang
    [J]. TRANSACTIONS IN GIS, 2024,
  • [5] A Clonal Selection Adaptive Local Search Operator for multi-objective optimization evolutionary algorithm
    Li, Yong
    Wang, Yu
    Zhang, Yuxian
    An, Yuejun
    [J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 755 - 757
  • [6] A decomposition-based multi-objective evolutionary algorithm with Q-learning for adaptive operator selection
    Xue, Fei
    Chen, Yuezheng
    Wang, Peiwen
    Ye, Yunsen
    Dong, Jinda
    Dong, Tingting
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (14): : 21229 - 21283
  • [7] Adaptive Objective Selection for Correlated Objectives in Multi-Objective Reinforcement Learning
    Brys, Tim
    Van Moffaert, Kristof
    Nowe, Ann
    Taylor, Matthew E.
    [J]. AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1349 - 1350
  • [8] A classification tree and decomposition based multi-objective evolutionary algorithm with adaptive operator selection
    Huantong Geng
    Ke Xu
    Yanqi Zhang
    Zhengli Zhou
    [J]. Complex & Intelligent Systems, 2023, 9 : 579 - 596
  • [9] A classification tree and decomposition based multi-objective evolutionary algorithm with adaptive operator selection
    Geng, Huantong
    Xu, Ke
    Zhang, Yanqi
    Zhou, Zhengli
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 579 - 596
  • [10] Deep Reinforcement Learning for Adaptive Parameter Control in Differential Evolution for Multi-Objective Optimization
    Reijnen, Robbert
    Zhang, Yingqian
    Bukhsh, Zaharah
    Guzek, Mateusz
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 804 - 811