Multi-agent Reinforcement Learning in Spatial Domain Tasks using Inter Subtask Empowerment Rewards

被引:0
|
作者
Pateria, Shubham [1 ]
Subagdja, Budhitama [2 ]
Tan, Ah-Hwee [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Nanyang Technol Univ, ST Engn NTU Corp Lab, Singapore, Singapore
关键词
Multi-agent Coordination; Reinforcement Learning; search and rescue;
D O I
10.1109/ssci44817.2019.9002777
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the complex multi-agent tasks, various agents must cooperate to distribute relevant subtasks among each other to achieve joint task objectives. An agent's choice of the relevant subtask changes over time with the changes in the task environment state. Multi-agent Hierarchical Reinforcement Learning (MAHRL) provides an approach for learning to select the subtasks in response to the environment states, by using the joint task rewards to train various agents. When the joint task involves complex inter-agent dependencies, only a subset of agents might be capable of reaching the rewarding task states while other agents take precursory or intermediate roles. The delayed task reward might not be sufficient in such tasks to learn the coordinating policies for various agents. In this paper, we introduce a novel approach of MAHRL called Inter-Subtask Empowerment based Multi-agent Options (ISEMO) in which an Inter-Subtask Empowerment Reward (ISER) is given to an agent which enables the precondition(s) of other agents' subtasks. ISER is given in addition to the domain task reward in order to improve the inter-agent coordination. ISEMO also incorporates options model that can learn parameterized subtask termination functions and relax the limitations posed by hand-crafted termination conditions. Experiments in a spatial Search and Rescue domain show that ISEMO can learn the subtask selection policies of various agents grounded in the inter-dependencies among the agents, as well as learn the subtask termination conditions, and perform better than the standard MAHRL technique.
引用
收藏
页码:86 / 93
页数:8
相关论文
共 50 条
  • [21] Multi-agent Polygon Formation using Reinforcement Learning
    Prasad, B. K. Swathi
    Manjunath, Aditya G.
    Ramasangu, Hariharan
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2017, : 159 - 165
  • [22] Multi-agent Reinforcement Learning using strategies and voting
    Partalas, Loannis
    Feneris, Loannis
    Vlahavas, Loannis
    19TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, VOL II, PROCEEDINGS, 2007, : 318 - 324
  • [23] Cranes control using multi-agent reinforcement learning
    Arai, S
    Miyazaki, K
    Kobayashi, S
    INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 335 - 342
  • [24] Extended Markov Games to Learn Multiple Tasks in Multi-Agent Reinforcement Learning
    Leon, Borja G.
    Belardinelli, Francesco
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 139 - 146
  • [25] Credit assignment in heterogeneous multi-agent reinforcement learning for fully cooperative tasks
    Jiang, Kun
    Liu, Wenzhang
    Wang, Yuanda
    Dong, Lu
    Sun, Changyin
    APPLIED INTELLIGENCE, 2023, 53 (23) : 29205 - 29222
  • [26] Credit assignment in heterogeneous multi-agent reinforcement learning for fully cooperative tasks
    Kun Jiang
    Wenzhang Liu
    Yuanda Wang
    Lu Dong
    Changyin Sun
    Applied Intelligence, 2023, 53 : 29205 - 29222
  • [27] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [29] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [30] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229