Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning

被引：19

作者：

Ma, Aaron ^{[1
]}

Ouimet, Michael ^{[2
]}

Cortes, Jorge ^{[1
]}

机构：

[1] Univ Calif San Diego, Dept Mech & Aerosp Engn, La Jolla, CA 92093 USA

[2] Naval Informat Warfare Ctr Pacific, San Diego, CA USA

来源：

AUTONOMOUS ROBOTS | 2020年 / 44卷 / 3-4期

关键词：

Reinforcement learning; Multi-agent planning; Distributed robotics; Semi-Markov decision processes; Markov decision processes; Upper confidence bound tree search; Hierarchical planning; Hierarchical Markov decision processes; Model-based reinforcement learning; Swarm robotics; Dynamic domain reduction; Submodularity; POMDPS;

D O I：

10.1007/s10514-019-09871-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider scenarios where a swarm of unmanned vehicles (UxVs) seek to satisfy a number of diverse, spatially distributed objectives. The UxVs strive to determine an efficient plan to service the objectives while operating in a coordinated fashion. We focus on developing autonomous high-level planning, where low-level controls are leveraged from previous work in distributed motion, target tracking, localization, and communication. We rely on the use of state and action abstractions in a Markov decision processes framework to introduce a hierarchical algorithm, Dynamic Domain Reduction for Multi-Agent Planning, that enables multi-agent planning for large multi-objective environments. Our analysis establishes the correctness of our search procedure within specific subsets of the environments, termed 'sub-environment' and characterizes the algorithm performance with respect to the optimal trajectories in single-agent and sequential multi-agent deployment scenarios using tools from submodularity. Simulated results show significant improvement over using a standard Monte Carlo tree search in an environment with large state and action spaces.

引用

页码：485 / 503

页数：19

共 50 条

[41] Dynamic Multichannel Access via Multi-Agent Reinforcement Learning: Throughput and Fairness Guarantees
Sohaib, Muhammad
Jeong, Jongjin
Jeon, Sang-Woon
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 3994 - 4008
[42] Dynamic Multichannel Access via Multi-agent Reinforcement Learning: Throughput and Fairness Guarantees
Sohaib, Muhammad
Jeong, Jongjin
Jeon, Sang-Woon
[J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
[43] Hierarchical Heterogeneous Multi-Agent Cross-Domain Search Method Based on Deep Reinforcement Learning
Dong, Shangqun
Liu, Meiqin
Dong, Shanling
Zheng, Ronghao
Wei, Ping
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
[44] Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation
Liang, Zhixuan
Cao, Jiannong
Jiang, Shan
Saxena, Divya
Xu, Huafeng
[J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 884 - 894
[45] Hierarchical reinforcement learning based on multi-agent cooperation game theory
Tang, Hengliang
Dong, Chengang
[J]. International Journal of Wireless and Mobile Computing, 2019, 16 (04): : 369 - 376
[46] Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning
Bai, He
George, Jemin
Chakrabortty, Aranya
[J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 340 - 345
[47] AHAC: Actor Hierarchical Attention Critic for Multi-Agent Reinforcement Learning
Wang, Yajie
Shi, Dianxi
Xue, Chao
Jiang, Hao
Wang, Gongju
Gong, Peng
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3013 - 3020
[48] Hierarchical graph multi-agent reinforcement learning for traffic signal control
Yang, Shantian
[J]. INFORMATION SCIENCES, 2023, 634 : 55 - 72
[49] Target-Oriented Multi-Agent Coordination with Hierarchical Reinforcement Learning
Yu, Yuekang
Zhai, Zhongyi
Li, Weikun
Ma, Jianyu
[J]. APPLIED SCIENCES-BASEL, 2024, 14 (16):
[50] HCTA:Hierarchical Cooperative Task Allocation in Multi-Agent Reinforcement Learning
Wang, Mengke
Xie, Shaorong
Luo, Xiangfeng
Li, Yang
Zhang, Han
Yu, Hang
[J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 934 - 941

← 1 2 3 4 5 →