Planning, learning and coordination in multiagent decision processes

被引:0
|
作者
Boutilier, C [1 ]
机构
[1] UNIV BRITISH COLUMBIA,DEPT COMP SCI,VANCOUVER,BC V6T 1Z4,CANADA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a growing interest in AI in the design of multiagent systems, especially in multiagent cooperative planning. In this paper, we investigate the extent to which methods from single-agent planning and learning can be applied in multiagent settings. We survey a number of different techniques from decision-theoretic planning and reinforcement learning and describe a number of interesting issues that arise with regard to coordinating the policies of individual agents. To this end, we describe multiagent Markov decision processes as a general model in which to frame this discussion. These are special n-person cooperative games in which agents share the same utility function. We discuss coordination mechanisms based on imposed conventions (or social laws) as well as learning methods for coordination. Our focus is on the decomposition of sequential decision processes so that coordination can be learned (or imposed) locally, at the level of individual states. We also discuss the use of structured problem representations and their role in the generalization of learned conventions and in approximation.
引用
收藏
页码:195 / 210
页数:16
相关论文
共 50 条
  • [31] Multiagent cooperative learning based on coordination of boundary samples
    Han, Wei
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2008, 21 (01): : 111 - 115
  • [32] A layered approach to learning coordination knowledge in multiagent environments
    Guray Erus
    Faruk Polat
    Applied Intelligence, 2007, 27 : 249 - 267
  • [33] Learning spatial and expertise distribution coordination in multiagent systems
    El-Telbany, ME
    Abdel-Wahab, AH
    Shaheen, SI
    PROCEEDINGS OF THE 44TH IEEE 2001 MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 2001, : 636 - 640
  • [34] DECISION THEORY AND PLANNING PROCESSES
    NORDBERG, L
    EKONOMISKA SAMFUNDETS TIDSKRIFT, 1972, 25 (04): : 254 - 260
  • [36] Decision-Theoretic Planning with Communication in Open Multiagent Systems
    Kakarlapudi, Anirudh
    Anil, Gayathri
    Eck, Adam
    Doshi, Prashant
    Soh, Leen-Kiat
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 938 - 948
  • [37] Recursive Reductions of Action Dependencies for Coordination-Based Multiagent Planning
    Tozicka, Jan
    Jakubuv, Jan
    Komenda, Antonin
    TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE XXVIII, 2018, 10780 : 66 - 92
  • [38] Consistent epistemic planning for multiagent deep reinforcement learning
    Wu, Peiliang
    Luo, Shicheng
    Tian, Liqiang
    Mao, Bingyi
    Chen, Wenbai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1663 - 1675
  • [39] Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms
    de Nijs, Frits
    Walraven, Erwin
    de Weerdt, Mathijs M.
    Spaan, Matthijs T. J.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 955 - 1001
  • [40] Solving Multiagent Markov Decision Processes: A Forest Management Example
    Chades, Iadine
    Bouteiller, Bertrand
    MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 1594 - 1600