Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms

被引:0
|
作者
de Nijs, Frits [1 ]
Walraven, Erwin [2 ]
de Weerdt, Mathijs M. [2 ]
Spaan, Matthijs T. J. [2 ]
机构
[1] Monash Univ, Fac IT, Dept Data Sci & AI, 20 Exhibit Walk, Clayton, Vic 3168, Australia
[2] Delft Univ Technol, Van Mourik Broekmanweg 6, NL-2628 XE Delft, Netherlands
关键词
OPTIMAL POLICIES; COMPLEXITY; CHAINS; AGENTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.
引用
下载
收藏
页码:955 / 1001
页数:47
相关论文
共 50 条
  • [1] Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms
    de Nijs, Frits
    Walraven, Erwin
    de Weerdt, Mathijs M.
    Spaan, Matthijs T.J.
    Journal of Artificial Intelligence Research, 2021, 70 : 955 - 1001
  • [2] Learning algorithms for finite horizon constrained markov decision processes
    Mittal, A.
    Hemachandra, N.
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2007, 3 (03) : 429 - 444
  • [3] Efficient Algorithms for Budget-Constrained Markov Decision Processes
    Caramanis, Constantine
    Dimitrov, Nedialko B.
    Morton, David P.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (10) : 2813 - 2817
  • [4] On constrained Markov decision processes
    Department of Econometrics, University of Sydney, Sydney, NSW 2006, Australia
    不详
    Oper Res Lett, 1 (25-28):
  • [5] On constrained Markov decision processes
    Haviv, M
    OPERATIONS RESEARCH LETTERS, 1996, 19 (01) : 25 - 28
  • [6] Learning in Constrained Markov Decision Processes
    Singh, Rahul
    Gupta, Abhishek
    Shroff, Ness B.
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
  • [7] Multiagent, Multitarget Path Planning in Markov Decision Processes
    Nawaz, Farhad
    Ornik, Melkior
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7560 - 7574
  • [8] Dealing with Groups of Actions in Multiagent Markov Decision Processes
    Debras, Guillaume
    Mouaddib, Abdel-Illah
    Pierre, Laurent Jean
    Le Gloannec, Simon
    PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, VOL 1: ECTA, 2016, : 49 - 58
  • [9] Dynamic programming in constrained Markov decision processes
    Piunovskiy, A. B.
    CONTROL AND CYBERNETICS, 2006, 35 (03): : 645 - 660
  • [10] Robustness of policies in constrained Markov decision processes
    Zadorojniy, A
    Shwartz, A
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (04) : 635 - 638