FTPSG: Feature mixture transformer and potential-based subgoal generation for hierarchical multi-agent reinforcement learning

被引:0
|
作者
Nicholaus, Isack Thomas [1 ]
Kang, Dae-Ki [1 ]
机构
[1] Dongseo Univ, Dept Comp Engn, Busan 47011, South Korea
基金
新加坡国家研究基金会;
关键词
Hierarchical reinforcement learning; Subgoal generation; Multi-agent reinforcement learning;
D O I
10.1016/j.eswa.2025.126540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical multi-agent reinforcement learning (HMAR) presents a promising approach for addressing complex multi-agent tasks. However, HMAR faces the challenge of identifying potential states or skills-subgoals that agents can efficiently solve. Our paper introduces a novel approach to subgoal generation within HMAR in response to learning signals in sparse delayed reward environments. We propose a Feature Mixture Transformer and Potential-based Subgoal Generation (FTPSG) as an efficient method for automatically generating promising subgoals by extracting and combining relevant features across past observations within a trajectory. Also, FTPSG utilizes a potential function to assess the probability of each subgoal leading agents to the ultimate goal. We design our potential function to rank these subgoals to achieve an actual goal and provide meaningful learning signals. Subgoals are then grouped based on their potential, prioritizing those with high potential as more crucial. This grouping enables agents to concentrate on the most important subgoals initially. We investigate the effectiveness of our proposed method across various multi-agent tasks, and the results consistently show that FTPSG outperforms state-of-the-art methods across all evaluated tasks. These findings affirm FTPSG's promising role in subgoal generation within the HMAR framework.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Reinforcement learning based on multi-agent in RoboCup
    Zhang, W
    Li, JG
    Ruan, XG
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 967 - 975
  • [22] Multi-AGV Scheduling based on Hierarchical Intrinsically Rewarded Multi-Agent Reinforcement Learning
    Zhang, Jiangshan
    Guo, Bin
    Sun, Zhuo
    Li, Mengyuan
    Liu, Jiaqi
    Yu, Zhiwen
    Fan, Xiaopeng
    2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 155 - 161
  • [23] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Hao Jiang
    Dianxi Shi
    Chao Xue
    Yajie Wang
    Gongju Wang
    Yongjun Zhang
    Applied Intelligence, 2021, 51 : 5793 - 5808
  • [24] Reinforcement Learning Based Hierarchical Multi-Agent Robotic Search Team in Uncertain Environment
    Hamid, Shahzaib
    Nasir, Ali
    Saleem, Yasir
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2021, 40 (03) : 645 - 662
  • [25] A Study on Multi-Agent Reinforcement Learning Problem Based on Hierarchical Modular Fuzzy Model
    Watanabe, Toshihiko
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 2041 - 2046
  • [26] Hierarchical Policy Network with Multi-agent for Knowledge Graph Reasoning Based on Reinforcement Learning
    Zheng, Mingming
    Zhou, Yanquan
    Cui, Qingyao
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 445 - 457
  • [27] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Jiang, Hao
    Shi, Dianxi
    Xue, Chao
    Wang, Yajie
    Wang, Gongju
    Zhang, Yongjun
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5793 - 5808
  • [28] GHGC: Goal-based Hierarchical Group Communication in Multi-Agent Reinforcement Learning
    Jiang, Hao
    Shi, Dianxi
    Xue, Chao
    Wang, Yajie
    Wang, Gongju
    Zhang, Yongjun
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3507 - 3514
  • [29] Feature Selection Method Using Multi-Agent Reinforcement Learning Based on Guide Agents
    Kim, Minwoo
    Bae, Jinhee
    Wang, Bohyun
    Ko, Hansol
    Lim, Joon S.
    SENSORS, 2023, 23 (01)
  • [30] Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation
    Liang, Zhixuan
    Cao, Jiannong
    Jiang, Shan
    Saxena, Divya
    Xu, Huafeng
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 884 - 894