A new distributed reinforcement learning algorithm for multiple objective optimization problems

被引:0
|
作者
Mariano, C
Morales, E
机构
[1] Inst Mexicano Tecnol Agua, Jiutepec 62550, Morelos, Mexico
[2] ITESM, Temixco 62589, Morelos, Mexico
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a new algorithm, called MDQL, for the solution of multiple objective optimization problems. MDQL is based on a new distributed Q-learning algorithm, called DQL, which is also introduced in this paper. In DQL a family of independent agents, exploring different options, finds a common policy in a common environment. Information about action goodness is transmitted using traces over state-action pairs. MDQL extends this idea to multiple objectives, assigning a family of agents for each objective involved. A non-dominant criterion is used to construct Pareto fronts and by delaying adjustments on the rewards MDQL achieves better distributions of solutions. Furthermore, an extension for applying reinforcement learning to continuous functions is also given. Successful results of MDQL on several test-bed problems suggested in the literature are described.
引用
收藏
页码:290 / 299
页数:10
相关论文
共 50 条
  • [22] A new evolutionary algorithm for solving many-objective optimization problems
    Zou, Xiufen
    Chen, Yu
    Liu, Minzhong
    Kang, Lishan
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (05): : 1402 - 1412
  • [23] A reinforcement learning-based metaheuristic algorithm for solving global optimization problems
    Seyyedabbasi, Amir
    ADVANCES IN ENGINEERING SOFTWARE, 2023, 178
  • [24] Reinforcement learning iterated greedy algorithm for distributed assembly permutation flowshop scheduling problems
    Ying K.-C.
    Lin S.-W.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (08) : 11123 - 11138
  • [25] Q-Sorting: An Algorithm for Reinforcement Learning Problems with Multiple Cumulative Constraints
    Huang, Jianfeng
    Lu, Guoqiang
    Li, Yi
    Wu, Jiajun
    MATHEMATICS, 2024, 12 (13)
  • [26] Multiobjective optimization algorithm with objective-wise learning for continuous multiobjective problems
    Wang, Jiahai
    Zhong, Chenglin
    Zhou, Ying
    Zhou, Yalan
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2015, 6 (05) : 571 - 585
  • [27] Multiobjective optimization algorithm with objective-wise learning for continuous multiobjective problems
    Jiahai Wang
    Chenglin Zhong
    Ying Zhou
    Yalan Zhou
    Journal of Ambient Intelligence and Humanized Computing, 2015, 6 : 571 - 585
  • [28] Modified teaching-learning-based optimization algorithm for multi-objective optimization problems
    Wang, Zhi
    Song, Shufang
    Wei, Hongkui
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 6017 - 6026
  • [29] Collaborative Pareto Set Learning in Multiple Multi-Objective Optimization Problems
    Shang, Chikai
    Ye, Rongguang
    Jiang, Jiaqi
    Gu, Fangqing
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [30] Objective Extraction for Many-Objective Optimization Problems: Algorithm and Test Problems
    Cheung, Yiu-ming
    Gu, Fangqing
    Liu, Hai-Lin
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2016, 20 (05) : 755 - 772