Decentralized Incremental Fuzzy Reinforcement Learning for Multi-Agent Systems

被引:1
|
作者
Hamzeloo, Sam [1 ]
Jahromi, Mansoor Zolghadri [1 ]
机构
[1] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
multi-agent systems; decentralized partially observable Markov decision processes; planning under uncertainty; fuzzy inference systems; reinforcement learning;
D O I
10.1142/S021848852050004X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new incremental fuzzy reinforcement learning algorithm to find a sub-optimal policy for infinite-horizon Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs). The algorithm addresses the high computational complexity of solving large Dec-POMDPs by generating a compact fuzzy rule-base for each agent. In our method, each agent uses its own fuzzy rule-base to make the decisions. The fuzzy rules in these rule-bases are incrementally created and tuned according to experiences of the agents. Reinforcement learning is used to tune the behavior of each agent in such a way that maximum global reward is achieved. In addition, we propose a method to construct the initial rule-base for each agent using the solution of the underlying MDP. This drastically improves the performance of the algorithm in comparison with random initialization of the rule-base. We assess the performance of our proposed method using several benchmark problems in comparison with some state-of-the-art methods. Experimental results show that our algorithm achieves better or similar reward when compared with other methods. However, from the runtime point of view, our method is superior to all previous methods. Using a compact fuzzy rule-base not only decreases the amount of memory used but also significantly speeds up the learning phase.
引用
收藏
页码:79 / 98
页数:20
相关论文
共 50 条
  • [1] Decentralized Deterministic Multi-Agent Reinforcement Learning
    Grosnit, Antoine
    Cai, Desmond
    Wynter, Laura
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1548 - 1553
  • [2] An analysis of multi-agent reinforcement learning for decentralized inventory control systems
    Mousa, Marwan
    van de Berg, Damien
    Kotecha, Niki
    Chanona, Ehecatl Antonio del Rio
    Mowbray, Max
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2024, 188
  • [3] Modular fuzzy-reinforcement learning in multi-agent systems
    Kaya, M
    Gültekin, T
    Arslan, A
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 7, 2002, : 170 - 176
  • [4] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
    Li, Kuo
    Jia, Qing-Shan
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 13
  • [5] Multi-agent reinforcement learning as a rehearsal for decentralized planning
    Kraemer, Landon
    Banerjee, Bikramjit
    [J]. NEUROCOMPUTING, 2016, 190 : 82 - 94
  • [6] Multi-agent Reinforcement Learning for Decentralized Stable Matching
    Taywade, Kshitija
    Goldsmith, Judy
    Harrison, Brent
    [J]. ALGORITHMIC DECISION THEORY, ADT 2021, 2021, 13023 : 375 - 389
  • [7] Decentralized Multi-agent Reinforcement Learning with Shared Actions
    Mishra, Rajesh K.
    Vasal, Deepanshu
    Vishwanath, Sriram
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [8] Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
    Zimmer, Matthieu
    Glanois, Claire
    Siddique, Umer
    Weng, Paul
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning
    de Souza, Cristino, Jr.
    Newbury, Rhys
    Cosgun, Akansel
    Castillo, Pedro
    Vidolov, Boris
    Kulic, Dana
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03): : 4552 - 4559
  • [10] Multi-agent Reinforcement Learning for Decentralized Coalition Formation Games
    Taywade, Kshitija
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15738 - 15739