A reinforcement learning-based multi-objective optimization in an interval and dynamic environment

被引:3
|
作者
Xu, Yue [1 ]
Song, Yuxuan [1 ]
Pi, Dechang [1 ]
Chen, Yang [1 ]
Qin, Shuo [1 ]
Zhang, Xiaoge [2 ]
Yang, Shengxiang [3 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hung Hom, Hong Kong, Peoples R China
[3] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, England
关键词
Dynamic optimization; Interval optimization; Multi-objective optimization; Q learning; Change severity detection; Change response; GENETIC ALGORITHM; NSGA-II;
D O I
10.1016/j.knosys.2023.111019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are many fields involving multi-objective optimization problems in presence of dynamic and interval environments (DI-MOPs), in which the number of objective functions is greater than one, the objectives are conflicting with each other, the problem is varying with time, and the parameters are interval-valued. Conflicts between multiple objectives make interval problems more difficult to be optimized in the dynamic environment. Recent works suffer from the lack of accuracy in change severity detection, the lack of adaptability in change response, and insufficient consideration of reducing imprecision. To tackle these issues, a novel reinforcement learning-based algorithm is proposed in this study, which has three original contributions: (1) Internal interval similarity is specially designed for the interval detection of change severity. To be specific, this operator is proposed for higher accuracy, including the hybridization between the interval similarity and point similarity, and the decision of the detection object and strategy. (2) Q learning is embedded into the optimization algorithm to select the optimal change response after the change occurs. The benefit of this operator is that the response mechanism is dynamically changed in accordance to the environments. (3) To reduce the uncertainty of problems, a new crowding distance operator is presented to guide the search to simultaneously increase diversity, speed up convergence, and decrease imprecision. The computational results from the benchmark sets demonstrate that the proposed algorithm is more efficient than other state-of-the-art algorithms, generating Pareto sets with stronger convergence, wider distribution, and less uncertainty.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Reinforcement Learning-Based Hybrid Multi-Objective Optimization Algorithm Design
    Palm, Herbert
    Arndt, Lorin
    [J]. INFORMATION, 2023, 14 (05)
  • [2] A reinforcement learning approach for dynamic multi-objective optimization
    Zou, Fei
    Yen, Gary G.
    Tang, Lixin
    Wang, Chunfeng
    [J]. INFORMATION SCIENCES, 2021, 546 : 815 - 834
  • [3] Multi-objective reinforcement learning-based approach for pressurized water reactor optimization
    Seurin, Paul
    Shirvan, Koroush
    [J]. ANNALS OF NUCLEAR ENERGY, 2024, 205
  • [4] Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
    Ebrie, Awol Seid
    Kim, Young Jin
    [J]. SYSTEMS, 2024, 12 (03):
  • [5] Reinforcement learning-based multi-objective differential evolution for wind farm layout optimization
    Yu, Xiaobing
    Lu, Yangchen
    [J]. ENERGY, 2023, 284
  • [6] Reinforcement learning-based differential evolution algorithm for constrained multi-objective optimization problems
    Yu, Xiaobing
    Xu, Pingping
    Wang, Feng
    Wang, Xuming
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [7] Multi-Objective Interval Optimization Dispatch of Microgrid via Deep Reinforcement Learning
    Mu, Chaoxu
    Shi, Yakun
    Xu, Na
    Wang, Xinying
    Tang, Zhuo
    Jia, Hongjie
    Geng, Hua
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (03) : 2957 - 2970
  • [8] An Improved Multi-objective Optimization Algorithm Based on Reinforcement Learning
    Liu, Jun
    Zhou, Yi
    Qiu, Yimin
    Li, Zhongfeng
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 501 - 513
  • [9] Genetic algorithm based multi-objective reliability optimization in interval environment
    Sahoo, Laxminarayan
    Bhunia, Asoke Kumar
    Kapur, Parmad Kumar
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2012, 62 (01) : 152 - 160
  • [10] A Dynamic Resource Allocation Strategy with Reinforcement Learning for Multimodal Multi-objective Optimization
    Dang, Qian-Long
    Xu, Wei
    Yuan, Yang-Fei
    [J]. MACHINE INTELLIGENCE RESEARCH, 2022, 19 (02) : 138 - 152