FTPSG: Feature mixture transformer and potential-based subgoal generation for hierarchical multi-agent reinforcement learning

被引:0
|
作者
Nicholaus, Isack Thomas [1 ]
Kang, Dae-Ki [1 ]
机构
[1] Dongseo Univ, Dept Comp Engn, Busan 47011, South Korea
基金
新加坡国家研究基金会;
关键词
Hierarchical reinforcement learning; Subgoal generation; Multi-agent reinforcement learning;
D O I
10.1016/j.eswa.2025.126540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical multi-agent reinforcement learning (HMAR) presents a promising approach for addressing complex multi-agent tasks. However, HMAR faces the challenge of identifying potential states or skills-subgoals that agents can efficiently solve. Our paper introduces a novel approach to subgoal generation within HMAR in response to learning signals in sparse delayed reward environments. We propose a Feature Mixture Transformer and Potential-based Subgoal Generation (FTPSG) as an efficient method for automatically generating promising subgoals by extracting and combining relevant features across past observations within a trajectory. Also, FTPSG utilizes a potential function to assess the probability of each subgoal leading agents to the ultimate goal. We design our potential function to rank these subgoals to achieve an actual goal and provide meaningful learning signals. Subgoals are then grouped based on their potential, prioritizing those with high potential as more crucial. This grouping enables agents to concentrate on the most important subgoals initially. We investigate the effectiveness of our proposed method across various multi-agent tasks, and the results consistently show that FTPSG outperforms state-of-the-art methods across all evaluated tasks. These findings affirm FTPSG's promising role in subgoal generation within the HMAR framework.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation
    Liang, Zhixuan
    Cao, Jiannong
    Jiang, Shan
    Saxena, Divya
    Xu, Huafeng
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 884 - 894
  • [32] Mitigating Bus Bunching via Hierarchical Multi-Agent Reinforcement Learning
    Yu, Mengdi
    Yang, Tao
    Li, Chunxiao
    Jin, Yaohui
    Xu, Yanyan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9675 - 9692
  • [33] Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 340 - 345
  • [34] AHAC: Actor Hierarchical Attention Critic for Multi-Agent Reinforcement Learning
    Wang, Yajie
    Shi, Dianxi
    Xue, Chao
    Jiang, Hao
    Wang, Gongju
    Gong, Peng
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3013 - 3020
  • [35] Hierarchical graph multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    INFORMATION SCIENCES, 2023, 634 : 55 - 72
  • [36] HCTA:Hierarchical Cooperative Task Allocation in Multi-Agent Reinforcement Learning
    Wang, Mengke
    Xie, Shaorong
    Luo, Xiangfeng
    Li, Yang
    Zhang, Han
    Yu, Hang
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 934 - 941
  • [37] Target-Oriented Multi-Agent Coordination with Hierarchical Reinforcement Learning
    Yu, Yuekang
    Zhai, Zhongyi
    Li, Weikun
    Ma, Jianyu
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [38] Limitations and improvements on potential-based connectivity preservation of multi-agent systems
    Fan, Yuan
    Song, Cheng
    2013 IEEE 3RD ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL AND INTELLIGENT SYSTEMS (CYBER), 2013, : 349 - +
  • [39] Image Feature Classification Based on Multi-Agent Deep Reinforcement
    Zhang, Zewei
    Zhang, Jianxun
    Zou, Hang
    Li, Lin
    Nan, Hai
    Computer Engineering and Applications, 60 (07): : 222 - 228
  • [40] Automating Feature Subspace Exploration via Multi-Agent Reinforcement Learning
    Liu, Kunpeng
    Fu, Yanjie
    Wang, Pengfei
    Wu, Le
    Bo, Rui
    Li, Xiaolin
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 207 - 215