A Projection-based Exploration Method for Multi-Agent Coordination

被引：0

作者：

Tang, Hainan ^{[1
]}

Liu, Juntao ^{[1
]}

Wang, Zhenjie ^{[1
]}

Gao, Ziwen ^{[1
]}

Li, You ^{[2
]}

机构：

[1] Wuhan Digital Engn Inst, Wuhan, Hubei, Peoples R China

[2] Hubei Univ, Wuhan, Hubei, Peoples R China

来源：

PROCEEDINGS OF THE 2024 3RD INTERNATIONAL SYMPOSIUM ON INTELLIGENT UNMANNED SYSTEMS AND ARTIFICIAL INTELLIGENCE, SIUSAI 2024 | 2024年

关键词：

Projection Exploration; Multi-agent Coordination; Maximum distribution entropy;

D O I：

10.1145/3669721.3669723

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-agent reinforcement learning (MARL), states with high exploration value are difficult to be identified and coordinately visited, resulting in low learning efficiency. To this end, a projection-based exploration method for multi-agent coordination (PEMAC) is proposed. Goal states are selected using the count-based approach in the optimal projection space, of which the entropy of state distribution is maximal. Then, by reshaping the rewards in the replay buffer, agents are trained to visit those high-value states in a coordinated manner. In order to verify the effectiveness of the proposed method, comparative experiments are conducted in the multi-particle environment (MPE), in which dense-reward and sparse-reward settings are all both considered. Corresponding results suggest that PEMAC can effectively improve learning efficiency.

引用

页码：8 / 14

页数：7

共 50 条

[21] Multi-agent Coordination Based On Contract Net Protocol
Sun, Defu
Wu, Juhua
2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 353 - +
[22] Multi-Agent Coordination with Event-based Communication
Teixeira, Pedro V.
Dimarogonas, Dimos V.
Johansson, Karl H.
Sousa, Joao
2010 AMERICAN CONTROL CONFERENCE, 2010, : 824 - 829
[23] Microgrids Coordination Based On Heterogeneous Multi-Agent Systems
Toro, Vladimir
Mojica-Nava, Eduardo
2015 IEEE 2ND COLOMBIAN CONFERENCE ON AUTOMATIC CONTROL (CCAC), 2015,
[24] An organisation infrastructure for Multi-Agent Systems based on agent coordination contexts
Viroli, M
Omicini, A
Ricci, A
AI*IA2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3673 : 198 - 211
[25] Research on the multi-Agent modeling and simulating method of CAS and the Agent coordination model
Ni, JJ
Ma, XP
Xu, LZ
DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES A-MATHEMATICAL ANALYSIS, 2006, 13 : 1426 - 1435
[26] Multi-agent cooperation policy gradient method based on enhanced exploration for cooperative tasks
Li-yang Zhao
Tian-qing Chang
Lei Zhang
Xin-lu Zhang
Jiang-feng Wang
International Journal of Machine Learning and Cybernetics, 2024, 15 : 1431 - 1452
[27] Optimal Distributed Controllers Based on Gradient-Flow Method for Multi-Agent Coordination
Sakurama, Kazunori
Azuma, Shun-ichi
Sugie, Toshiharu
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 2133 - 2138
[28] Multi-agent cooperation policy gradient method based on enhanced exploration for cooperative tasks
Zhao, Li-yang
Chang, Tian-qing
Zhang, Lei
Zhang, Xin-lu
Wang, Jiang-feng
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (04) : 1431 - 1452
[29] Distributed Coordination Method of Microgrid Economic Operation Optimization Based on Multi-Agent System
Luo, Kui
Shi, Wenhui
2014 INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2014, : 3135 - 3140
[30] A Projection-Based Method for Shape Measurement
Thanh Phuong Nguyen
Xuan Son Nguyen
Mohamed Anouar Borgi
M. K. Nguyen
Journal of Mathematical Imaging and Vision, 2020, 62 : 489 - 504

← 1 2 3 4 5 →