Multi-Agent Adversarial Inverse Reinforcement Learning

被引:0
|
作者
Yu, Lantao [1 ]
Song, Jiaming [1 ]
Ermon, Stefano [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning agents are prone to undesired behaviors due to reward mis-specification. Finding a set of reward functions to properly guide agent behaviors is particularly challenging in multi-agent scenarios. Inverse reinforcement learning provides a framework to automatically acquire suitable reward functions from expert demonstrations. Its extension to multi-agent settings, however, is difficult due to the more complex notions of rational behaviors. In this paper, we propose MA-AIRL, a new framework for multi-agent inverse reinforcement learning, which is effective and scalable for Markov games with high-dimensional state-action space and unknown dynamics We derive our algorithm based on a new solution concept and maximum pseudolikelihood estimation within an adversarial reward learning framework. In the experiments, we demonstrate that MA-AIRL can recover reward functions that are highly correlated with ground truth ones, and significantly outperforms prior methods in terms of policy imitation.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Quantum Multi-Agent Reinforcement Learning for Inverse Radiotherapy Treatment Planning Optimization
    Jamaluddin, J.
    Polan, D.
    Matrosic, C. K.
    Niraula, D.
    Epelman, M. A.
    Allen, S.
    Jolly, S.
    Jarema, D.
    Matuszak, M. M.
    El Naqa, I. M.
    MEDICAL PHYSICS, 2024, 51 (10) : 7701 - 7701
  • [32] Competitive Multi-agent Inverse Reinforcement Learning with Sub-optimal Demonstrations
    Wang, Xingyu
    Klabjan, Diego
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [33] ADVERSARIAL MULTI-AGENT REINFORCEMENT LEARNING ALGORITHM FOR ANOMALY NETWORK INTRUSION DETECTION SYSTEM
    Mohamed, Safa
    Ejbali, Ridha
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2021, 13 (03): : 87 - 102
  • [34] MACS: Multi-agent Adversarial Reinforcement Learning for Finding Diverse Critical Driving Scenarios
    Kang, Shuting
    Dong, Qian
    Xue, Yunzhi
    Wu, Yanjun
    2024 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST 2024, 2024, : 1 - 12
  • [35] Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving Policies
    Sharif, Aizaz
    Marijan, Dusica
    2022 29TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC, 2022, : 61 - 70
  • [36] Multi-Agent Generative Adversarial Imitation Learning
    Song, Jiaming
    Ren, Hongyu
    Sadigh, Dorsa
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [37] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [38] Multi-agent inverse reinforcement learning with parallel coordinate descent method for improving learning speed
    Namikoshi K.
    Arai S.
    Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05):
  • [39] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
    Cassano, Lucas
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066
  • [40] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551