Large-Scale Multi-Agent Deep FBSDEs

被引:0
|
作者
Chen, Tianrong [1 ]
Wang, Ziyi [2 ]
Exarchos, Ioannis [3 ]
Theodorou, Evangelos A. [2 ,4 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Ctr Machine Learning, Atlanta, GA 30332 USA
[3] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[4] Georgia Inst Technol, Sch Aerosp Engn, Atlanta, GA 30332 USA
关键词
STOCHASTIC DIFFERENTIAL-GAMES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a scalable deep learning framework for finding Markovian Nash Equilibria in multi-agent stochastic games using fictitious play. The motivation is inspired by theoretical analysis of Forward Backward Stochastic Differential Equations (FBSDE) and their implementation in a deep learning setting, which is the source of our algorithm's sample efficiency improvement. By taking advantage of the permutation-invariant property of agents in symmetric games, the scalability and performance is further enhanced significantly. We showcase superior performance of our framework over the state-of-the-art deep fictitious play algorithm on an inter-bank lending/borrowing problem in terms of multiple metrics. More importantly, our approach scales up to 3000 agents in simulation, a scale which, to the best of our knowledge, represents a new state-of-the-art. We also demonstrate the applicability of our framework in robotics on a belief space autonomous racing problem.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Large-scale multi-agent transportation simulations
    Cetin, N
    Nagel, K
    Raney, B
    Voellmy, A
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2002, 147 (1-2) : 559 - 564
  • [2] Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control
    Chu, Tianshu
    Wang, Jie
    Codeca, Lara
    Li, Zhaojian
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1086 - 1095
  • [3] Adaptive agent selection in large-scale multi-agent systems
    Sugawara, Toshiharu
    Fukuda, Kensuke
    Hirotsu, Toshio
    Sato, Shin-ya
    Kurihara, Satoshi
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 818 - 822
  • [4] Requirements engineering for large-scale multi-agent systems
    Cysneiros, LM
    Yu, E
    [J]. SOFTWARE ENGINEERING FOR LARGE-SCALE MULTI-AGENT SYSTEMS: RESEARCH ISSUES AND PRACTICAL APPLICATIONS, 2003, 2603 : 39 - 56
  • [5] Detecting disagreements in large-scale multi-agent teams
    Kaminka, Gal A.
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 501 - 525
  • [6] Multi-Agent Decision Making in Large-Scale Systems
    ZHU Shijing
    WANG Shuning
    CHEN Ting Institute of Systems Engineering
    [J]. Journal of Systems Science and Systems Engineering, 1994, (03) : 211 - 217
  • [7] Organizational Metamodel for Large-Scale Multi-Agent Systems
    Duric, Bogdan Okresa
    [J]. TRENDS IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS, THE PAAMS COLLECTION, 2016, 473 : 387 - 390
  • [8] Macroscopic Observation of Large-scale Multi-agent Systems
    Lamarche-Perrin, Robin
    Demazeau, Yves
    Vincent, Jean-Marc
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 121 - 127
  • [9] Multi-agent large-scale parallel crowd simulation
    Malinowski, Artur
    Czarnul, Pawel
    Czurylo, Krzysztof
    Maciejewski, Maciej
    Skowron, Pawel
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 917 - 926
  • [10] Towards reliable large-scale multi-agent systems
    Guessoum, Z
    Faci, N
    [J]. MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 430 - 439