A reinforcement learning-based multi-objective optimization in an interval and dynamic environment

被引：3

作者：

Xu, Yue ^{[1
]}

Song, Yuxuan ^{[1
]}

Pi, Dechang ^{[1
]}

Chen, Yang ^{[1
]}

Qin, Shuo ^{[1
]}

Zhang, Xiaoge ^{[2
]}

Yang, Shengxiang ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

[2] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hung Hom, Hong Kong, Peoples R China

[3] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, England

来源：

KNOWLEDGE-BASED SYSTEMS | 2023年 / 280卷

关键词：

Dynamic optimization; Interval optimization; Multi-objective optimization; Q learning; Change severity detection; Change response; GENETIC ALGORITHM; NSGA-II;

D O I：

10.1016/j.knosys.2023.111019

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There are many fields involving multi-objective optimization problems in presence of dynamic and interval environments (DI-MOPs), in which the number of objective functions is greater than one, the objectives are conflicting with each other, the problem is varying with time, and the parameters are interval-valued. Conflicts between multiple objectives make interval problems more difficult to be optimized in the dynamic environment. Recent works suffer from the lack of accuracy in change severity detection, the lack of adaptability in change response, and insufficient consideration of reducing imprecision. To tackle these issues, a novel reinforcement learning-based algorithm is proposed in this study, which has three original contributions: (1) Internal interval similarity is specially designed for the interval detection of change severity. To be specific, this operator is proposed for higher accuracy, including the hybridization between the interval similarity and point similarity, and the decision of the detection object and strategy. (2) Q learning is embedded into the optimization algorithm to select the optimal change response after the change occurs. The benefit of this operator is that the response mechanism is dynamically changed in accordance to the environments. (3) To reduce the uncertainty of problems, a new crowding distance operator is presented to guide the search to simultaneously increase diversity, speed up convergence, and decrease imprecision. The computational results from the benchmark sets demonstrate that the proposed algorithm is more efficient than other state-of-the-art algorithms, generating Pareto sets with stronger convergence, wider distribution, and less uncertainty.

引用

页数：15

共 50 条

[1] Reinforcement Learning-Based Hybrid Multi-Objective Optimization Algorithm Design
Palm, Herbert
Arndt, Lorin
[J]. INFORMATION, 2023, 14 (05)
[2] A reinforcement learning approach for dynamic multi-objective optimization
Zou, Fei
Yen, Gary G.
Tang, Lixin
Wang, Chunfeng
[J]. INFORMATION SCIENCES, 2021, 546 : 815 - 834
[3] Multi-objective reinforcement learning-based approach for pressurized water reactor optimization
Seurin, Paul
Shirvan, Koroush
[J]. ANNALS OF NUCLEAR ENERGY, 2024, 205
[4] Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
Ebrie, Awol Seid
Kim, Young Jin
[J]. SYSTEMS, 2024, 12 (03):
[5] Reinforcement learning-based multi-objective differential evolution for wind farm layout optimization
Yu, Xiaobing
Lu, Yangchen
[J]. ENERGY, 2023, 284
[6] Reinforcement learning-based differential evolution algorithm for constrained multi-objective optimization problems
Yu, Xiaobing
Xu, Pingping
Wang, Feng
Wang, Xuming
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
[7] Multi-Objective Interval Optimization Dispatch of Microgrid via Deep Reinforcement Learning
Mu, Chaoxu
Shi, Yakun
Xu, Na
Wang, Xinying
Tang, Zhuo
Jia, Hongjie
Geng, Hua
[J]. IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (03) : 2957 - 2970
[8] An Improved Multi-objective Optimization Algorithm Based on Reinforcement Learning
Liu, Jun
Zhou, Yi
Qiu, Yimin
Li, Zhongfeng
[J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 501 - 513
[9] Genetic algorithm based multi-objective reliability optimization in interval environment
Sahoo, Laxminarayan
Bhunia, Asoke Kumar
Kapur, Parmad Kumar
[J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2012, 62 (01) : 152 - 160
[10] A Dynamic Resource Allocation Strategy with Reinforcement Learning for Multimodal Multi-objective Optimization
Dang, Qian-Long
Xu, Wei
Yuan, Yang-Fei
[J]. MACHINE INTELLIGENCE RESEARCH, 2022, 19 (02) : 138 - 152

← 1 2 3 4 5 →