Multi-Agent Adversarial Inverse Reinforcement Learning

被引:0
|
作者
Yu, Lantao [1 ]
Song, Jiaming [1 ]
Ermon, Stefano [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning agents are prone to undesired behaviors due to reward mis-specification. Finding a set of reward functions to properly guide agent behaviors is particularly challenging in multi-agent scenarios. Inverse reinforcement learning provides a framework to automatically acquire suitable reward functions from expert demonstrations. Its extension to multi-agent settings, however, is difficult due to the more complex notions of rational behaviors. In this paper, we propose MA-AIRL, a new framework for multi-agent inverse reinforcement learning, which is effective and scalable for Markov games with high-dimensional state-action space and unknown dynamics We derive our algorithm based on a new solution concept and maximum pseudolikelihood estimation within an adversarial reward learning framework. In the experiments, we demonstrate that MA-AIRL can recover reward functions that are highly correlated with ground truth ones, and significantly outperforms prior methods in terms of policy imitation.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [41] Learning structured communication for multi-agent reinforcement learning
    Sheng, Junjie
    Wang, Xiangfeng
    Jin, Bo
    Yan, Junchi
    Li, Wenhao
    Chang, Tsung-Hui
    Wang, Jun
    Zha, Hongyuan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [42] Generalized learning automata for multi-agent reinforcement learning
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324
  • [43] Multi-agent Reinforcement Learning Aided Sampling Algorithms for a Class of Multiscale Inverse Problems
    Chung, Eric
    Leung, Wing Tat
    Pun, Sai-Mang
    Zhang, Zecheng
    JOURNAL OF SCIENTIFIC COMPUTING, 2023, 96 (02)
  • [44] Multi-agent Inverse Reinforcement Learning for Certain General-Sum Stochastic Games
    Lin, Xiaomin
    Adams, Stephen C.
    Beling, Peter A.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 66 : 473 - 502
  • [45] Multi-agent Reinforcement Learning Aided Sampling Algorithms for a Class of Multiscale Inverse Problems
    Eric Chung
    Wing Tat Leung
    Sai-Mang Pun
    Zecheng Zhang
    Journal of Scientific Computing, 2023, 96
  • [46] Reinforcement learning of multi-agent communicative acts
    Hoet S.
    Sabouret N.
    Revue d'Intelligence Artificielle, 2010, 24 (02) : 159 - 188
  • [47] Multi-agent reinforcement learning for character control
    Li, Cheng
    Fussell, Levi
    Komura, Taku
    VISUAL COMPUTER, 2021, 37 (12): : 3115 - 3123
  • [48] Parallel and distributed multi-agent reinforcement learning
    Kaya, M
    Arslan, A
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, : 437 - 441
  • [49] Multi-agent Reinforcement Learning for Service Composition
    Lei, Yu
    Yu, Philip S.
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 790 - 793
  • [50] Coding for Distributed Multi-Agent Reinforcement Learning
    Wang, Baoqian
    Xie, Junfei
    Atanasov, Nikolay
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10625 - 10631