Fuzzy Reinforcement Learning Control for Decentralized Partially Observable Markov Decision Processes

被引:0
|
作者
Sharma, Rajneesh [1 ]
Spaan, Matthijs T. J. [2 ]
机构
[1] Netaji Subhas Inst Technol, Instrumentat & Control Div, New Delhi, India
[2] Inst Super Tecn, Inst Syst & Robot, Lisbon, Portugal
关键词
Reinforcement learning; Fuzzy systems; Cooperative multiagent systems; Decentralized POMDPs;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) offer a powerful platform for optimizing sequential decision making in partially observable stochastic environments. However, finding optimal solutions for Dec-POMDPs is known to be intractable, necessitating approximate/suboptimal approaches. To address this problem, this work proposes a novel fuzzy reinforcement learning (RL) based game theoretic controller for Dec-POMDPs. The proposed controller implements fuzzy RL on Dec-POMDPs, which are modeled as a sequence of Bayesian games (BG). The main contributions of the work are the introduction of a game based RL paradigm in a Dec-POMDP settings, and the use of fuzzy inference systems to effectively generalize the underlying belief space. We apply the proposed technique on two benchmark problems and compare results against state-of-the-art Dec-POMDP control approach. The results validate the feasibility and effectiveness of using game theoretic RL based fuzzy control for addressing intractability of Dec-POMDPs, thus opening up a new research direction.
引用
收藏
页码:1422 / 1429
页数:8
相关论文
共 50 条
  • [31] Partially observable Markov decision processes with reward information
    Cao, XR
    Guo, XP
    2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, : 4393 - 4398
  • [32] Partially Observable Markov Decision Processes in Robotics: A Survey
    Lauri, Mikko
    Hsu, David
    Pajarinen, Joni
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 21 - 40
  • [33] A primer on partially observable Markov decision processes (POMDPs)
    Chades, Iadine
    Pascal, Luz V.
    Nicol, Sam
    Fletcher, Cameron S.
    Ferrer-Mestres, Jonathan
    METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (11): : 2058 - 2072
  • [34] Partially observable Markov decision processes with imprecise parameters
    Itoh, Hideaki
    Nakamura, Kiyohiko
    ARTIFICIAL INTELLIGENCE, 2007, 171 (8-9) : 453 - 490
  • [35] Minimal Disclosure in Partially Observable Markov Decision Processes
    Bertrand, Nathalie
    Genest, Blaise
    IARCS ANNUAL CONFERENCE ON FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE (FSTTCS 2011), 2011, 13 : 411 - 422
  • [36] Nonapproximability results for partially observable Markov decision processes
    Lusena, Cristopher
    Goldsmith, Judy
    Mundhenk, Martin
    1600, Morgan Kaufmann Publishers (14):
  • [37] Control limits for two-state partially observable Markov decision processes
    Grosfeld-Nir, Abraham
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 182 (01) : 300 - 304
  • [38] Optimal Control of Logically Constrained Partially Observable and Multiagent Markov Decision Processes
    Kalagarla, Krishna C.
    Kartik, Dhruva
    Shen, Dongming
    Jain, Rahul
    Nayyar, Ashutosh
    Nuzzo, Pierluigi
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (01) : 263 - 277
  • [39] THE PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES FRAMEWORK IN MEDICAL DECISION MAKING
    Goulionis, John E.
    Stengos, Dimitrios I.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2008, 9 (02) : 205 - 232
  • [40] Decentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions
    Omidshafiei, Shayegan
    Agha-Mohammadi, Ali-Akbar
    Amato, Christopher
    Liu, Shih-Yuan
    How, Jonathan P.
    Vian, John
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (02): : 231 - 258