Multi-objective Reinforcement Learning Algorithm for MOSDMP in Unknown Environment

被引:12
|
作者
Zhao, Yun [1 ]
Chen, Qingwei [1 ]
Hu, Weili [1 ]
机构
[1] Univ Sci & Technol, Dept Automat, Nanjing, Jiangsu Provinc, Peoples R China
关键词
Reinforcement learning; Markov decision processes (MDP); Fuzzy inference system; Action decision; MARKOV DECISION-PROCESSES;
D O I
10.1109/WCICA.2010.5553980
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new multi-objective reinforcement learning algorithm for multi-objective sequential decision making problems in unknown environment is proposed. The salient characters of the algorithm are: 1) decision maker's objective preference is introduced to guide learning direction; 2) a new measure of comparing action decisions under several objectives based on the fuzzy inference system is defined; 3) fast learning speed can be achieved. Simulation results demonstrate that the proposed algorithm has a good learning performance.
引用
收藏
页码:3190 / 3194
页数:5
相关论文
共 50 条
  • [1] Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning
    Cheng, Xiu
    Browne, Will N.
    Zhang, Mengjie
    [J]. 2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 622 - 629
  • [2] A Multi-objective Reinforcement Learning Algorithm for JS']JSSP
    Mendez-Hernandez, Beatriz M.
    Rodriguez-Bazan, Erick D.
    Martinez-Jimenez, Yailen
    Libin, Pieter
    Nowe, Ann
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 567 - 584
  • [3] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [4] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    [J]. Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [5] An XCS-based Algorithm for Multi-Objective Reinforcement Learning
    Cheng, Xiu
    Chen, Gang
    Zhang, Mengjie
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 4007 - 4014
  • [6] A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation
    Yang, Runzhe
    Sun, Xingyuan
    Narasimhan, Karthik
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] An Improved Multi-objective Optimization Algorithm Based on Reinforcement Learning
    Liu, Jun
    Zhou, Yi
    Qiu, Yimin
    Li, Zhongfeng
    [J]. ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT I, 2022, : 501 - 513
  • [8] Multi-objective ω-Regular Reinforcement Learning
    Hahn, Ernst Moritz
    Perez, Mateo
    Schewe, Sven
    Somenzi, Fabio
    Trivedi, Ashutosh
    Wojtczak, Dominik
    [J]. FORMAL ASPECTS OF COMPUTING, 2023, 35 (02)
  • [9] Multi-strategy multi-objective differential evolutionary algorithm with reinforcement learning
    Han, Yupeng
    Peng, Hu
    Mei, Changrong
    Cao, Lianglin
    Deng, Changshou
    Wang, Hui
    Wu, Zhijian
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [10] Federated multi-objective reinforcement learning
    Zhao, Fangyuan
    Ren, Xuebin
    Yang, Shusen
    Zhao, Peng
    Zhang, Rui
    Xu, Xinxin
    [J]. INFORMATION SCIENCES, 2023, 624 : 811 - 832