Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination

被引:3
|
作者
Han, Dongge [1 ]
Bohmer, Wendelin [1 ]
Wooldridge, Michael [1 ]
Rogers, Alex [1 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford, England
关键词
Multi-agent Learning; Hierarchcial reinforcement learning;
D O I
10.1007/978-3-030-29911-8_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a multi-agent system, an agent's optimal policy will typically depend on the policies chosen by others. Therefore, a key issue in multi-agent systems research is that of predicting the behaviours of others, and responding promptly to changes in such behaviours. One obvious possibility is for each agent to broadcast their current intention, for example, the currently executed option in a hierarchical reinforcement learning framework. However, this approach results in inflexibility of agents if options have an extended duration and are dynamic. While adjusting the executed option at each step improves flexibility from a single-agent perspective, frequent changes in options can induce inconsistency between an agent's actual behaviour and its broadcast intention. In order to balance flexibility and predictability, we propose a dynamic termination Bellman equation that allows the agents to flexibly terminate their options. We evaluate our models empirically on a set of multi-agent pursuit and taxi tasks, and show that our agents learn to adapt flexibly across scenarios that require different termination behaviours.
引用
收藏
页码:80 / 92
页数:13
相关论文
共 50 条
  • [1] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
  • [2] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    [J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [3] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [4] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Ma, Aaron
    Ouimet, Michael
    Cortes, Jorge
    [J]. AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 485 - 503
  • [5] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Aaron Ma
    Michael Ouimet
    Jorge Cortés
    [J]. Autonomous Robots, 2020, 44 : 485 - 503
  • [6] Studies on hierarchical reinforcement learning in multi-agent environment
    Yu Lasheng
    Marin, Alonso
    Hong Fei
    Lin Jian
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1714 - 1720
  • [7] Multi-agent hierarchical reinforcement learning for energy management
    Jendoubi, Imen
    Bouffard, Francois
    [J]. APPLIED ENERGY, 2023, 332
  • [8] HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy
    Zhang, Hancheng
    Li, Guozheng
    Liu, Chi Harold
    Wang, Guoren
    Tang, Jian
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3239 - 3248
  • [9] Hierarchical Architecture for Multi-Agent Reinforcement Learning in Intelligent Game
    Li, Bin
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
    Ding, Wenhao
    Li, Shuaijun
    Qian, Huihuan
    Chen, Yongquan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 237 - 242