Multi-objective fuzzy Q-learning to solve continuous state-action problems

被引:4
|
作者
Asgharnia, Amirhossein [1 ]
Schwartz, Howard [1 ]
Atia, Mohamed [1 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Reinforcement learning; Differential games; Q-learning; Multi-objective reinforcement learning;
D O I
10.1016/j.neucom.2022.10.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real world problems are multi-objective. Thus, the need for multi-objective learning and optimiza-tion algorithms is inevitable. Although the multi-objective optimization algorithms are well-studied, the multi-objective learning algorithms have attracted less attention. In this paper, a fuzzy multi-objective reinforcement learning algorithm is proposed, and we refer to it as the multi-objective fuzzy Q-learning (MOFQL) algorithm. The algorithm is implemented to solve a bi-objective reach-avoid game. The majority of the multi-objective reinforcement algorithms proposed address solving problems in the discrete state-action domain. However, the MOFQL algorithm can also handle problems in a contin-uous state-action domain. A fuzzy inference system (FIS) is implemented to estimate the value function for the bi-objective problem. We used a temporal difference (TD) approach to update the fuzzy rules. The proposed method isa multi-policy multi-objective algorithm and can find the non-convex regions of the Pareto front.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:115 / 132
页数:18
相关论文
共 50 条
  • [21] Cognitive networks QoS multi-objective strategy based on Q-learning algorithm
    Wang, B. (wangbowx@163.com), 1600, Advanced Institute of Convergence Information Technology, Myoungbo Bldg 3F,, Bumin-dong 1-ga, Seo-gu, Busan, 602-816, Korea, Republic of (07):
  • [22] A FUZZY APPROACH TO SOLVE MULTI-OBJECTIVE TRANSPORTATION PROBLEM
    Lohgaonkar, M. H.
    Bajaj, V. H.
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2009, 5 (02): : 443 - 452
  • [23] Evaluating Q-Learning Policies for Multi-objective Foraging Task in a Multi-agent Environment
    Yogeswaran, M.
    Ponnambalam, S. G.
    INTELLIGENT ROBOTICS AND APPLICATIONS, PT II, 2010, 6425 : 587 - 598
  • [24] learning with policy prediction in continuous state-action multi-agent decision processes
    Ghorbani, Farzaneh
    Afsharchi, Mohsen
    Derhami, Vali
    SOFT COMPUTING, 2020, 24 (02) : 901 - 918
  • [25] learning with policy prediction in continuous state-action multi-agent decision processes
    Farzaneh Ghorbani
    Mohsen Afsharchi
    Vali Derhami
    Soft Computing, 2020, 24 : 901 - 918
  • [26] A new optimization algorithm to solve multi-objective problems
    Sharifi, Mohammad Reza
    Akbarifard, Saeid
    Qaderi, Kourosh
    Madadi, Mohamad Reza
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [27] A new optimization algorithm to solve multi-objective problems
    Mohammad Reza Sharifi
    Saeid Akbarifard
    Kourosh Qaderi
    Mohamad Reza Madadi
    Scientific Reports, 11
  • [28] Multi-objective optimization of radiotherapy: distributed Q-learning and agent-based simulation
    Jalalimanesh, Ammar
    Haghighi, Hamidreza Shahabi
    Ahmadi, Abbas
    Hejazian, Hossein
    Soltani, Madjid
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (05) : 1071 - 1086
  • [29] Q-Learning Based Multi-objective Optimization Routing Strategy in UAVs Deterministic Network
    Zhou, Zou
    Chen, Longjie
    Hu, Yu
    Zheng, Fei
    Liang, Caisheng
    Li, Kelin
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL II, CENET 2023, 2024, 1126 : 399 - 408
  • [30] Using Memory-Based Learning to Solve Tasks with State-Action Constraints
    Verghese, Mrinal
    Atkeson, Christopher
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9558 - 9565