Planning, learning and coordination in multiagent decision processes

被引：0

作者：

Boutilier, C ^{[1
]}

机构：

[1] UNIV BRITISH COLUMBIA,DEPT COMP SCI,VANCOUVER,BC V6T 1Z4,CANADA

来源：

THEORETICAL ASPECTS OF RATIONALITY AND KNOWLEDGE | 1996年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There has been a growing interest in AI in the design of multiagent systems, especially in multiagent cooperative planning. In this paper, we investigate the extent to which methods from single-agent planning and learning can be applied in multiagent settings. We survey a number of different techniques from decision-theoretic planning and reinforcement learning and describe a number of interesting issues that arise with regard to coordinating the policies of individual agents. To this end, we describe multiagent Markov decision processes as a general model in which to frame this discussion. These are special n-person cooperative games in which agents share the same utility function. We discuss coordination mechanisms based on imposed conventions (or social laws) as well as learning methods for coordination. Our focus is on the decomposition of sequential decision processes so that coordination can be learned (or imposed) locally, at the level of individual states. We also discuss the use of structured problem representations and their role in the generalization of learned conventions and in approximation.

引用

页码：195 / 210

页数：16

共 50 条

[31] Multiagent cooperative learning based on coordination of boundary samples
Han, Wei
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2008, 21 (01): : 111 - 115
[32] A layered approach to learning coordination knowledge in multiagent environments
Guray Erus
Faruk Polat
Applied Intelligence, 2007, 27 : 249 - 267
[33] Learning spatial and expertise distribution coordination in multiagent systems
El-Telbany, ME
Abdel-Wahab, AH
Shaheen, SI
PROCEEDINGS OF THE 44TH IEEE 2001 MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 2001, : 636 - 640
[34] DECISION THEORY AND PLANNING PROCESSES
NORDBERG, L
EKONOMISKA SAMFUNDETS TIDSKRIFT, 1972, 25 (04): : 254 - 260
[35] Multiagent systems - Challenges and opportunities for decision-theoretic planning
Boutilier, C
AI MAGAZINE, 1999, 20 (04) : 35 - 43
[36] Decision-Theoretic Planning with Communication in Open Multiagent Systems
Kakarlapudi, Anirudh
Anil, Gayathri
Eck, Adam
Doshi, Prashant
Soh, Leen-Kiat
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 938 - 948
[37] Recursive Reductions of Action Dependencies for Coordination-Based Multiagent Planning
Tozicka, Jan
Jakubuv, Jan
Komenda, Antonin
TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE XXVIII, 2018, 10780 : 66 - 92
[38] Consistent epistemic planning for multiagent deep reinforcement learning
Wu, Peiliang
Luo, Shicheng
Tian, Liqiang
Mao, Bingyi
Chen, Wenbai
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (05) : 1663 - 1675
[39] Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms
de Nijs, Frits
Walraven, Erwin
de Weerdt, Mathijs M.
Spaan, Matthijs T. J.
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2021, 70 : 955 - 1001
[40] Solving Multiagent Markov Decision Processes: A Forest Management Example
Chades, Iadine
Bouteiller, Bertrand
MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 1594 - 1600

← 1 2 3 4 5 →