Deep Reinforcement Learning Based Adaptive Operator Selection for Evolutionary Multi-Objective Optimization

被引：51

作者：

Tian, Ye ^{[1
,2
]}

Li, Xiaopeng ^{[3
]}

Ma, Haiping ^{[1
,2
]}

Zhang, Xingyi ^{[4
]}

Tan, Kay Chen ^{[5
]}

Jin, Yaochu ^{[6
]}

机构：

[1] Anhui Univ Hefei, Inst Phys Sci, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Anhui, Peoples R China

[2] Anhui Univ Hefei, Inst Informat Technol, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Anhui, Peoples R China

[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

[4] Anhui Univ, Sch Artificial Intelligence, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Anhui, Peoples R China

[5] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

[6] Bielefeld Univ, Fac Technol, D-33619 Bielefeld, Germany

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2023年 / 7卷 / 04期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Reinforcement learning; Optimization; Convergence; Statistics; Sociology; Neural networks; Particle swarm optimization; Evolutionary algorithm; multi-objective optimization; operator selection; reinforcement learning; DIFFERENTIAL EVOLUTION; ALGORITHM; PERFORMANCE; STRATEGY; BANDITS; MOEA/D;

D O I：

10.1109/TETCI.2022.3146882

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Evolutionary algorithms (EAs) have become one of the most effective techniques for multi-objective optimization, where a number of variation operators have been developed to handle the problems with various difficulties. While most EAs use a fixed operator all the time, it is a labor-intensive process to determine the best EA for a new problem. Hence, some recent studies have been dedicated to the adaptive selection of the best operators during the search process. To address the exploration versus exploitation dilemma in operator selection, this paper proposes a novel operator selection method based on reinforcement learning. In the proposed method, the decision variables are regarded as states and the candidate operators are regarded as actions. By using deep neural networks to learn a policy that estimates the $Q$ value of each action given a state, the proposed method can determine the best operator for each parent that maximizes its cumulative improvement. An EA is developed based on the proposed method, which is verified to be more effective than the state-of-the-art ones on challenging multi-objective optimization problems.

引用

页码：1051 / 1064

页数：14

共 50 条

[1] Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection
Ming, Fei
Gong, Wenyin
Wang, Ling
Jin, Yaochu
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (04) : 919 - 931
[2] Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection
Fei Ming
Wenyin Gong
Ling Wang
Yaochu Jin
[J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11 (04) : 919 - 959
[3] Adaptive operator selection with dueling deep Q-network for evolutionary multi-objective optimization
Yin, Shihong
Xiang, Zhengrong
[J]. NEUROCOMPUTING, 2024, 581
[4] A Multi-Objective Optimization Method for Shelter Site Selection Based on Deep Reinforcement Learning
Zhang, Di
Meng, Huan
Wang, Moyang
Xu, Xianrui
Yan, Jianhai
Li, Xiang
[J]. TRANSACTIONS IN GIS, 2024,
[5] A Clonal Selection Adaptive Local Search Operator for multi-objective optimization evolutionary algorithm
Li, Yong
Wang, Yu
Zhang, Yuxian
An, Yuejun
[J]. PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 755 - 757
[6] A decomposition-based multi-objective evolutionary algorithm with Q-learning for adaptive operator selection
Xue, Fei
Chen, Yuezheng
Wang, Peiwen
Ye, Yunsen
Dong, Jinda
Dong, Tingting
[J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (14): : 21229 - 21283
[7] Adaptive Objective Selection for Correlated Objectives in Multi-Objective Reinforcement Learning
Brys, Tim
Van Moffaert, Kristof
Nowe, Ann
Taylor, Matthew E.
[J]. AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 1349 - 1350
[8] A classification tree and decomposition based multi-objective evolutionary algorithm with adaptive operator selection
Huantong Geng
Ke Xu
Yanqi Zhang
Zhengli Zhou
[J]. Complex & Intelligent Systems, 2023, 9 : 579 - 596
[9] A classification tree and decomposition based multi-objective evolutionary algorithm with adaptive operator selection
Geng, Huantong
Xu, Ke
Zhang, Yanqi
Zhou, Zhengli
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (01) : 579 - 596
[10] Deep Reinforcement Learning for Adaptive Parameter Control in Differential Evolution for Multi-Objective Optimization
Reijnen, Robbert
Zhang, Yingqian
Bukhsh, Zaharah
Guzek, Mateusz
[J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 804 - 811

← 1 2 3 4 5 →