Large-Scale Multi-Agent Deep FBSDEs

被引：0

作者：

Chen, Tianrong ^{[1
]}

Wang, Ziyi ^{[2
]}

Exarchos, Ioannis ^{[3
]}

Theodorou, Evangelos A. ^{[2
,4
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

[2] Georgia Inst Technol, Ctr Machine Learning, Atlanta, GA 30332 USA

[3] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[4] Georgia Inst Technol, Sch Aerosp Engn, Atlanta, GA 30332 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷

关键词：

STOCHASTIC DIFFERENTIAL-GAMES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a scalable deep learning framework for finding Markovian Nash Equilibria in multi-agent stochastic games using fictitious play. The motivation is inspired by theoretical analysis of Forward Backward Stochastic Differential Equations (FBSDE) and their implementation in a deep learning setting, which is the source of our algorithm's sample efficiency improvement. By taking advantage of the permutation-invariant property of agents in symmetric games, the scalability and performance is further enhanced significantly. We showcase superior performance of our framework over the state-of-the-art deep fictitious play algorithm on an inter-bank lending/borrowing problem in terms of multiple metrics. More importantly, our approach scales up to 3000 agents in simulation, a scale which, to the best of our knowledge, represents a new state-of-the-art. We also demonstrate the applicability of our framework in robotics on a belief space autonomous racing problem.

引用

页数：9

共 50 条

[1] Large-scale multi-agent transportation simulations
Cetin, N
Nagel, K
Raney, B
Voellmy, A
[J]. COMPUTER PHYSICS COMMUNICATIONS, 2002, 147 (1-2) : 559 - 564
[2] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
Chu, Tianshu
Wang, Jie
Codeca, Lara
Li, Zhaojian
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
[3] Adaptive agent selection in large-scale multi-agent systems
Sugawara, Toshiharu
Fukuda, Kensuke
Hirotsu, Toshio
Sato, Shin-ya
Kurihara, Satoshi
[J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 818 - 822
[4] Requirements engineering for large-scale multi-agent systems
Cysneiros, LM
Yu, E
[J]. SOFTWARE ENGINEERING FOR LARGE-SCALE MULTI-AGENT SYSTEMS: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2003, 2603 : 39 - 56
[5] Detecting disagreements in large-scale multi-agent teams
Kaminka, Gal A.
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 501 - 525
[6] Multi-Agent Decision Making in Large-Scale Systems
ZHU Shijing
WANG Shuning
CHEN Ting Institute of Systems Engineering
[J]. Journal of Systems Science and Systems Engineering, 1994, (03) : 211 - 217
[7] Organizational Metamodel for Large-Scale Multi-Agent Systems
Duric, Bogdan Okresa
[J]. TRENDS IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS, THE PAAMS COLLECTION, 2016, 473 : 387 - 390
[8] Macroscopic Observation of Large-scale Multi-agent Systems
Lamarche-Perrin, Robin
Demazeau, Yves
Vincent, Jean-Marc
[J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 121 - 127
[9] Multi-agent large-scale parallel crowd simulation
Malinowski, Artur
Czarnul, Pawel
Czurylo, Krzysztof
Maciejewski, Maciej
Skowron, Pawel
[J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 917 - 926
[10] Towards reliable large-scale multi-agent systems
Guessoum, Z
Faci, N
[J]. MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 430 - 439

← 1 2 3 4 5 →