A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation

被引:7
|
作者
Song, Chunwei [1 ]
He, Zichen
Dong, Lu [2 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[2] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
基金
中国国家自然科学基金;
关键词
Index Terms-Cooperative navigation; multiagent reinforcement learning (MARL); multiagent systems; self-attention; COLLISION-AVOIDANCE;
D O I
10.1109/TNNLS.2022.3220798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The cooperative navigation algorithm is the crucial technology for multirobot systems to accomplish autonomous collaborative operations, and it is still a challenge for researchers. In this work, we propose a new multiagent reinforcement learning algorithm called multiagent local-and-global attention actor-critic (MLGA2C) for multiagent cooperative navigation. Inspired by the attention mechanism, we design the local-and-global attention module to dynamically extract and encode critical environmental features. Meanwhile, based on the centralized training and decentralized execution (CTDE) paradigm, we extend a new actor-critic method to handle feature encoding and make navigation decisions. We also evaluate the proposed algorithm in two cooperative navigation scenarios: static target navigation and dynamic pedestrian target tracking. The multiple experimental results show that our algorithm performs well in cooperative navigation tasks with increasing agents.
引用
收藏
页码:7767 / 7777
页数:11
相关论文
共 50 条
  • [1] Hierarchical and Stable Multiagent Reinforcement Learning for Cooperative Navigation Control
    Jin, Yue
    Wei, Shuangqing
    Yuan, Jian
    Zhang, Xudong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) : 90 - 103
  • [2] A symbiotic genetic algorithm with local-and-global mapping search for reinforcement fuzzy control
    Juang, Chia-Feng
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2008, 19 (02) : 103 - 114
  • [3] A symbiotic genetic algorithm with local-and-global mapping search for reinforcement fuzzy control
    Juang, Chia-Feng
    [J]. Journal of Intelligent and Fuzzy Systems, 2008, 19 (02): : 103 - 114
  • [4] A REINFORCEMENT LEARNING APPROACH FOR MULTIAGENT NAVIGATION
    Martinez-Gil, Francisco
    Barber, Fernando
    Lozano, Miguel
    Grimaldo, Francisco
    Fernandez, Fernando
    [J]. ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 607 - 610
  • [5] Learning to Teach in Cooperative Multiagent Reinforcement Learning
    Omidshafiei, Shayegan
    Kim, Dong-Ki
    Liu, Miao
    Tesauro, Gerald
    Riemer, Matthew
    Amato, Christopher
    Campbell, Murray
    How, Jonathan P.
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
  • [6] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
    Phon-Amnuaisuk, Somnuk
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579
  • [7] FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks
    Zhang, Zhen
    Zhao, Dongbin
    Gao, Junwei
    Wang, Dongqing
    Dai, Yujie
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) : 1367 - 1379
  • [8] A multiagent cooperative learning algorithm
    Liu, Fei
    Zeng, Guangzhou
    [J]. COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN III, 2007, 4402 : 739 - +
  • [9] Cooperative Multiagent Reinforcement Learning With Partial Observations
    Zhang, Yan
    Zavlanos, Michael M.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 968 - 981
  • [10] The dynamics of reinforcement learning in cooperative multiagent systems
    Claus, C
    Boutilier, C
    [J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 746 - 752