Stigmergic Independent Reinforcement Learning for Multiagent Collaboration

被引:16
|
作者
Xu, Xing [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [2 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Collaboration; Training; Shape; Insects; Task analysis; Intelligent agents; Wireless communication; Artificial intelligence; collective intelligence; multiagent collaboration; reinforcement learning; stigmergy; ANT COLONY; COORDINATION; ENVIRONMENT;
D O I
10.1109/TNNLS.2021.3056418
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid evolution of wireless mobile devices, there emerges an increased need to design effective collaboration mechanisms between intelligent agents to gradually approach the final collective objective by continuously learning from the environment based on their individual observations. In this regard, independent reinforcement learning (IRL) is often deployed in multiagent collaboration to alleviate the problem of a nonstationary learning environment. However, behavioral strategies of intelligent agents in IRL can be formulated only upon their local individual observations of the global environment, and appropriate communication mechanisms must be introduced to reduce their behavioral localities. In this article, we address the problem of communication between intelligent agents in IRL by jointly adopting mechanisms with two different scales. For the large scale, we introduce the stigmergy mechanism as an indirect communication bridge between independent learning agents, and carefully design a mathematical method to indicate the impact of digital pheromone. For the small scale, we propose a conflict-avoidance mechanism between adjacent agents by implementing an additionally embedded neural network to provide more opportunities for participants with higher action priorities. In addition, we present a federal training method to effectively optimize the neural network of each agent in a decentralized manner. Finally, we establish a simulation scenario in which a number of mobile agents in a certain area move automatically to form a specified target shape. Extensive simulations demonstrate the effectiveness of our proposed method.
引用
收藏
页码:4285 / 4299
页数:15
相关论文
共 50 条
  • [1] Trustable Policy Collaboration Scheme for Multi-Agent Stigmergic Reinforcement Learning
    Xu, Xing
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    [J]. IEEE COMMUNICATIONS LETTERS, 2022, 26 (04) : 823 - 827
  • [2] Satisficing Paths and Independent Multiagent Reinforcement Learning in Stochastic Games
    Yongacoglu, Bora
    Arslan, Gurdal
    Yuksel, Serdar
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (03): : 745 - 773
  • [3] Multiagent Collaboration for Emergency Evacuation Using Reinforcement Learning for Transportation Systems
    Yang, Yupeng
    Yu, Jiahao
    Liu, Dahai
    Lee, Sang-A
    Namilae, Sirish
    Islam, Sabique
    Gou, Huaxing
    Park, Hyoshin
    Song, Houbing
    [J]. IEEE Journal on Miniaturization for Air and Space Systems, 2022, 3 (04): : 232 - 241
  • [4] Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem
    Zhang, Chengwei
    Jin, Shan
    Xue, Wanli
    Xie, Xiaofei
    Chen, Shengyong
    Chen, Rong
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) : 7426 - 7436
  • [5] A multiagent reinforcement learning algorithm to solve the maximum independent set problem
    Alipour, Mir Mohammad
    Abdolhosseinzadeh, Mohsen
    [J]. MULTIAGENT AND GRID SYSTEMS, 2020, 16 (01) : 101 - 115
  • [6] Asymmetric multiagent reinforcement learning
    Könönen, V
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [7] Lateral Transfer Learning for Multiagent Reinforcement Learning
    Shi, Haobin
    Li, Jingchen
    Mao, Jiahui
    Hwang, Kao-Shing
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1699 - 1711
  • [8] Learning to Teach in Cooperative Multiagent Reinforcement Learning
    Omidshafiei, Shayegan
    Kim, Dong-Ki
    Liu, Miao
    Tesauro, Gerald
    Riemer, Matthew
    Amato, Christopher
    Campbell, Murray
    How, Jonathan P.
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6128 - 6136
  • [9] Simultaneously Learning and Advising in Multiagent Reinforcement Learning
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    [J]. AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1100 - 1108
  • [10] Learning Cooperative Behaviours in Multiagent Reinforcement Learning
    Phon-Amnuaisuk, Somnuk
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 570 - 579