A Projection-based Exploration Method for Multi-Agent Coordination

被引:0
|
作者
Tang, Hainan [1 ]
Liu, Juntao [1 ]
Wang, Zhenjie [1 ]
Gao, Ziwen [1 ]
Li, You [2 ]
机构
[1] Wuhan Digital Engn Inst, Wuhan, Hubei, Peoples R China
[2] Hubei Univ, Wuhan, Hubei, Peoples R China
来源
PROCEEDINGS OF THE 2024 3RD INTERNATIONAL SYMPOSIUM ON INTELLIGENT UNMANNED SYSTEMS AND ARTIFICIAL INTELLIGENCE, SIUSAI 2024 | 2024年
关键词
Projection Exploration; Multi-agent Coordination; Maximum distribution entropy;
D O I
10.1145/3669721.3669723
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent reinforcement learning (MARL), states with high exploration value are difficult to be identified and coordinately visited, resulting in low learning efficiency. To this end, a projection-based exploration method for multi-agent coordination (PEMAC) is proposed. Goal states are selected using the count-based approach in the optimal projection space, of which the entropy of state distribution is maximal. Then, by reshaping the rewards in the replay buffer, agents are trained to visit those high-value states in a coordinated manner. In order to verify the effectiveness of the proposed method, comparative experiments are conducted in the multi-particle environment (MPE), in which dense-reward and sparse-reward settings are all both considered. Corresponding results suggest that PEMAC can effectively improve learning efficiency.
引用
收藏
页码:8 / 14
页数:7
相关论文
共 50 条
  • [21] Multi-agent Coordination Based On Contract Net Protocol
    Sun, Defu
    Wu, Juhua
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 353 - +
  • [22] Multi-Agent Coordination with Event-based Communication
    Teixeira, Pedro V.
    Dimarogonas, Dimos V.
    Johansson, Karl H.
    Sousa, Joao
    2010 AMERICAN CONTROL CONFERENCE, 2010, : 824 - 829
  • [23] Microgrids Coordination Based On Heterogeneous Multi-Agent Systems
    Toro, Vladimir
    Mojica-Nava, Eduardo
    2015 IEEE 2ND COLOMBIAN CONFERENCE ON AUTOMATIC CONTROL (CCAC), 2015,
  • [24] An organisation infrastructure for Multi-Agent Systems based on agent coordination contexts
    Viroli, M
    Omicini, A
    Ricci, A
    AI*IA2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3673 : 198 - 211
  • [25] Research on the multi-Agent modeling and simulating method of CAS and the Agent coordination model
    Ni, JJ
    Ma, XP
    Xu, LZ
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 1426 - 1435
  • [26] Multi-agent cooperation policy gradient method based on enhanced exploration for cooperative tasks
    Li-yang Zhao
    Tian-qing Chang
    Lei Zhang
    Xin-lu Zhang
    Jiang-feng Wang
    International Journal of Machine Learning and Cybernetics, 2024, 15 : 1431 - 1452
  • [27] Optimal Distributed Controllers Based on Gradient-Flow Method for Multi-Agent Coordination
    Sakurama, Kazunori
    Azuma, Shun-ichi
    Sugie, Toshiharu
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2133 - 2138
  • [28] Multi-agent cooperation policy gradient method based on enhanced exploration for cooperative tasks
    Zhao, Li-yang
    Chang, Tian-qing
    Zhang, Lei
    Zhang, Xin-lu
    Wang, Jiang-feng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (04) : 1431 - 1452
  • [29] Distributed Coordination Method of Microgrid Economic Operation Optimization Based on Multi-Agent System
    Luo, Kui
    Shi, Wenhui
    2014 INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2014, : 3135 - 3140
  • [30] A Projection-Based Method for Shape Measurement
    Thanh Phuong Nguyen
    Xuan Son Nguyen
    Mohamed Anouar Borgi
    M. K. Nguyen
    Journal of Mathematical Imaging and Vision, 2020, 62 : 489 - 504