Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning

被引:19
|
作者
Ma, Aaron [1 ]
Ouimet, Michael [2 ]
Cortes, Jorge [1 ]
机构
[1] Univ Calif San Diego, Dept Mech & Aerosp Engn, La Jolla, CA 92093 USA
[2] Naval Informat Warfare Ctr Pacific, San Diego, CA USA
关键词
Reinforcement learning; Multi-agent planning; Distributed robotics; Semi-Markov decision processes; Markov decision processes; Upper confidence bound tree search; Hierarchical planning; Hierarchical Markov decision processes; Model-based reinforcement learning; Swarm robotics; Dynamic domain reduction; Submodularity; POMDPS;
D O I
10.1007/s10514-019-09871-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider scenarios where a swarm of unmanned vehicles (UxVs) seek to satisfy a number of diverse, spatially distributed objectives. The UxVs strive to determine an efficient plan to service the objectives while operating in a coordinated fashion. We focus on developing autonomous high-level planning, where low-level controls are leveraged from previous work in distributed motion, target tracking, localization, and communication. We rely on the use of state and action abstractions in a Markov decision processes framework to introduce a hierarchical algorithm, Dynamic Domain Reduction for Multi-Agent Planning, that enables multi-agent planning for large multi-objective environments. Our analysis establishes the correctness of our search procedure within specific subsets of the environments, termed 'sub-environment' and characterizes the algorithm performance with respect to the optimal trajectories in single-agent and sequential multi-agent deployment scenarios using tools from submodularity. Simulated results show significant improvement over using a standard Monte Carlo tree search in an environment with large state and action spaces.
引用
收藏
页码:485 / 503
页数:19
相关论文
共 50 条
  • [41] Dynamic Multichannel Access via Multi-Agent Reinforcement Learning: Throughput and Fairness Guarantees
    Sohaib, Muhammad
    Jeong, Jongjin
    Jeon, Sang-Woon
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (06) : 3994 - 4008
  • [42] Dynamic Multichannel Access via Multi-agent Reinforcement Learning: Throughput and Fairness Guarantees
    Sohaib, Muhammad
    Jeong, Jongjin
    Jeon, Sang-Woon
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [43] Hierarchical Heterogeneous Multi-Agent Cross-Domain Search Method Based on Deep Reinforcement Learning
    Dong, Shangqun
    Liu, Meiqin
    Dong, Shanling
    Zheng, Ronghao
    Wei, Ping
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
  • [44] Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation
    Liang, Zhixuan
    Cao, Jiannong
    Jiang, Shan
    Saxena, Divya
    Xu, Huafeng
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 884 - 894
  • [45] Hierarchical reinforcement learning based on multi-agent cooperation game theory
    Tang, Hengliang
    Dong, Chengang
    [J]. International Journal of Wireless and Mobile Computing, 2019, 16 (04): : 369 - 376
  • [46] Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    [J]. 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 340 - 345
  • [47] AHAC: Actor Hierarchical Attention Critic for Multi-Agent Reinforcement Learning
    Wang, Yajie
    Shi, Dianxi
    Xue, Chao
    Jiang, Hao
    Wang, Gongju
    Gong, Peng
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3013 - 3020
  • [48] Hierarchical graph multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    [J]. INFORMATION SCIENCES, 2023, 634 : 55 - 72
  • [49] Target-Oriented Multi-Agent Coordination with Hierarchical Reinforcement Learning
    Yu, Yuekang
    Zhai, Zhongyi
    Li, Weikun
    Ma, Jianyu
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [50] HCTA:Hierarchical Cooperative Task Allocation in Multi-Agent Reinforcement Learning
    Wang, Mengke
    Xie, Shaorong
    Luo, Xiangfeng
    Li, Yang
    Zhang, Han
    Yu, Hang
    [J]. 2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 934 - 941