Multi-Agent Adversarial Inverse Reinforcement Learning

被引:0
|
作者
Yu, Lantao [1 ]
Song, Jiaming [1 ]
Ermon, Stefano [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning agents are prone to undesired behaviors due to reward mis-specification. Finding a set of reward functions to properly guide agent behaviors is particularly challenging in multi-agent scenarios. Inverse reinforcement learning provides a framework to automatically acquire suitable reward functions from expert demonstrations. Its extension to multi-agent settings, however, is difficult due to the more complex notions of rational behaviors. In this paper, we propose MA-AIRL, a new framework for multi-agent inverse reinforcement learning, which is effective and scalable for Markov games with high-dimensional state-action space and unknown dynamics We derive our algorithm based on a new solution concept and maximum pseudolikelihood estimation within an adversarial reward learning framework. In the experiments, we demonstrate that MA-AIRL can recover reward functions that are highly correlated with ground truth ones, and significantly outperforms prior methods in terms of policy imitation.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Predicting Driver Behavior on the Highway with Multi-Agent Adversarial Inverse Reinforcement Learning
    Radtke, Henrik
    Bey, Henrik
    Sackmann, Moritz
    Schoen, Torsten
    [J]. 2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [2] Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
    Liu, Guanlin
    Lai, Lifeng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Distributed hierarchical reinforcement learning in multi-agent adversarial environments
    Naderializadeh, Navid
    Soleyman, Sean
    Hung, Fan
    Khosla, Deepak
    Chen, Yang
    Fadaie, Joshua G.
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [4] Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition
    Phan, Thomy
    Belzner, Lenz
    Gabor, Thomas
    Sedlmeier, Andreas
    Ritz, Fabian
    Linnhoff-Popien, Claudia
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11308 - 11316
  • [5] Optimal Robust Formation of Multi-Agent Systems as Adversarial Graphical Apprentice Games With Inverse Reinforcement Learning
    Golmisheh, Fatemeh Mahdavi
    Shamaghdari, Saeed
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024,
  • [6] Distributed Inverse Constrained Reinforcement Learning for Multi-agent Systems
    Liu, Shicheng
    Zhu, Minghui
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Evaluating strategic structures in multi-agent inverse reinforcement learning
    Fu, Justin
    Tacchetti, Andrea
    Perolat, Julien
    Bachrach, Yoram
    [J]. Journal of Artificial Intelligence Research, 2021, 71 (04): : 925 - 951
  • [8] Developing multi-agent adversarial environment using reinforcement learning and imitation learning
    Ziyao Han
    Yupeng Liang
    Kazuhiro Ohkura
    [J]. Artificial Life and Robotics, 2023, 28 : 703 - 709
  • [9] Developing multi-agent adversarial environment using reinforcement learning and imitation learning
    Han, Ziyao
    Liang, Yupeng
    Ohkura, Kazuhiro
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2023, 28 (04) : 703 - 709
  • [10] Multi-Agent Reinforcement Learning for Wireless Networks Against Adversarial Communications
    Lv, Zefang
    Chen, Yifan
    Xiao, Liang
    Yang, Helin
    Ji, Xiangyang
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3409 - 3414