Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms

被引:0
|
作者
de Nijs, Frits [1 ]
Walraven, Erwin [2 ]
de Weerdt, Mathijs M. [2 ]
Spaan, Matthijs T. J. [2 ]
机构
[1] Monash Univ, Fac IT, Dept Data Sci & AI, 20 Exhibit Walk, Clayton, Vic 3168, Australia
[2] Delft Univ Technol, Van Mourik Broekmanweg 6, NL-2628 XE Delft, Netherlands
关键词
OPTIMAL POLICIES; COMPLEXITY; CHAINS; AGENTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.
引用
收藏
页码:955 / 1001
页数:47
相关论文
共 50 条
  • [21] A taxonomy for similarity metrics between Markov decision processes
    Garcia, Javier
    Visus, Alvaro
    Fernandez, Fernando
    MACHINE LEARNING, 2022, 111 (11) : 4217 - 4247
  • [22] A taxonomy for similarity metrics between Markov decision processes
    Javier García
    Álvaro Visús
    Fernando Fernández
    Machine Learning, 2022, 111 : 4217 - 4247
  • [23] Solving Multiagent Markov Decision Processes: A Forest Management Example
    Chades, Iadine
    Bouteiller, Bertrand
    MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 1594 - 1600
  • [24] LEARNING ALGORITHMS FOR MARKOV DECISION-PROCESSES
    KURANO, M
    JOURNAL OF APPLIED PROBABILITY, 1987, 24 (01) : 270 - 276
  • [25] Polynomial Classification Algorithms for Markov Decision Processes
    Feinberg, Eugene A.
    Yang, Fenghsu
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 4485 - 4490
  • [26] Joint chance-constrained Markov decision processes
    Varagapriya, V.
    Singh, Vikas Vikram
    Lisser, Abdel
    ANNALS OF OPERATIONS RESEARCH, 2023, 322 (02) : 1013 - 1035
  • [27] Strict-sense constrained Markov decision processes
    Hsu, SP
    Arapostathis, A
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 194 - 199
  • [28] HCMDP: a Hierarchical Solution to Constrained Markov Decision Processes
    Feyzabadi, Seyedshams
    Carpin, Stefano
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3971 - 3978
  • [29] Constrained discounted Markov decision processes and Hamiltonian cycles
    Feinberg, EA
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2821 - 2826
  • [30] Constrained discounted Markov decision processes and Hamiltonian Cycles
    Feinberg, EA
    MATHEMATICS OF OPERATIONS RESEARCH, 2000, 25 (01) : 130 - 140