Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination

被引：3

作者：

Han, Dongge ^{[1
]}

Bohmer, Wendelin ^{[1
]}

Wooldridge, Michael ^{[1
]}

Rogers, Alex ^{[1
]}

机构：

[1] Univ Oxford, Dept Comp Sci, Oxford, England

来源：

PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II | 2019年 / 11671卷

关键词：

Multi-agent Learning; Hierarchcial reinforcement learning;

D O I：

10.1007/978-3-030-29911-8_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In a multi-agent system, an agent's optimal policy will typically depend on the policies chosen by others. Therefore, a key issue in multi-agent systems research is that of predicting the behaviours of others, and responding promptly to changes in such behaviours. One obvious possibility is for each agent to broadcast their current intention, for example, the currently executed option in a hierarchical reinforcement learning framework. However, this approach results in inflexibility of agents if options have an extended duration and are dynamic. While adjusting the executed option at each step improves flexibility from a single-agent perspective, frequent changes in options can induce inconsistency between an agent's actual behaviour and its broadcast intention. In order to balance flexibility and predictability, we propose a dynamic termination Bellman equation that allows the agents to flexibly terminate their options. We evaluate our models empirically on a set of multi-agent pursuit and taxi tasks, and show that our agents learn to adapt flexibly across scenarios that require different termination behaviours.

引用

页码：80 / 92

页数：13

共 50 条

[1] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
Han, Dongge
Boehmer, Wendelin
Wooldridge, Michael
Rogers, Alex
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
[2] Hierarchical multi-agent reinforcement learning
Mohammad Ghavamzadeh
Sridhar Mahadevan
Rajbala Makar
[J]. Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
[3] Hierarchical multi-agent reinforcement learning
Ghavamzadeh, Mohammad
Mahadevan, Sridhar
Makar, Rajbala
[J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
[4] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
Ma, Aaron
Ouimet, Michael
Cortes, Jorge
[J]. AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 485 - 503
[5] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
Aaron Ma
Michael Ouimet
Jorge Cortés
[J]. Autonomous Robots, 2020, 44 : 485 - 503
[6] Studies on hierarchical reinforcement learning in multi-agent environment
Yu Lasheng
Marin, Alonso
Hong Fei
Lin Jian
[J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1714 - 1720
[7] Multi-agent hierarchical reinforcement learning for energy management
Jendoubi, Imen
Bouffard, Francois
[J]. APPLIED ENERGY, 2023, 332
[8] HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy
Zhang, Hancheng
Li, Guozheng
Liu, Chi Harold
Wang, Guoren
Tang, Jian
[J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3239 - 3248
[9] Hierarchical Architecture for Multi-Agent Reinforcement Learning in Intelligent Game
Li, Bin
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[10] Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
Ding, Wenhao
Li, Shuaijun
Qian, Huihuan
Chen, Yongquan
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 237 - 242

← 1 2 3 4 5 →