Reward-based epigenetic learning algorithm for a decentralised multi-agent system

被引：3

作者：

Mukhlish, Faqihza ^{[1
]}

Page, John ^{[1
]}

Bain, Michael ^{[2
]}

机构：

[1] Univ New South Wales, Sch Mech & Mfg Engn, Sydney, NSW, Australia

[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT UNMANNED SYSTEMS | 2020年 / 8卷 / 03期

关键词：

Swarm robotics; Evolutionary algorithm; Multi-agent learning; Reinforcement learning; Epigenetic; SWARM ROBOTICS; DECISION-MAKING;

D O I：

10.1108/IJIUS-12-2018-0036

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Purpose This paper aims to propose a novel epigenetic learning (EpiLearn) algorithm, which is designed specifically for a decentralised multi-agent system such as swarm robotics. Design/methodology/approach First, this paper begins with overview of swarm robotics and the challenges in designing swarm behaviour automatically. This should indicate the direction of improvements required to enhance an automatic swarm design. Second, the evolutionary learning (EpiLearn) algorithm for a swarm system using an epigenetic layer is formulated and discussed. The algorithm is then tested through various test functions to investigate its performance. Finally, the results are discussed along with possible future research directions. Findings Through various test functions, the algorithm can solve non-local and many local minima problems. This article also shows that by using a reward system, the algorithm can handle the deceptive problem which often occurs in dynamic problems. Moreover, utilization of rewards from the environment in the form of a methylation process on the epigenetic layer improves the performance of traditional evolutionary algorithms applied to automatic swarm design. Finally, this article shows that a regeneration process that embeds an epigenetic layer in the inheritance process performs better than a traditional crossover operator in a swarm system. Originality/value This paper proposes a novel method for automatic swarm design by taking into account the importance of multi-agent settings and environmental characteristics surrounding the swarm. The novel evolutionary learning (EpiLearn) algorithm using an epigenetic layer gives the swarm the ability to perform co-evolution and co-learning.

引用

页码：201 / 224

页数：24

共 50 条

[1] Multi-agent Reward-Based Intruder Capture
Grimaldi, Michele
Herpson, Cedric
[J]. INTELLIGENT DISTRIBUTED COMPUTING XVI, IDC 2023, 2024, 1138 : 251 - 266
[2] Probabilistic Reward-Based Reinforcement Learning for Multi-Agent Pursuit and Evasion
Zhang, Bo-Kun
Hu, Bin
Chen, Long
Zhang, Ding-Xue
Cheng, Xin-Ming
Guan, Zhi-Hong
[J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3352 - 3357
[3] MuDE: Multi-agent decomposed reward-based exploration
Yoo, Byunghyun
Yi, Sungwon
Kim, Hyunwoo
Shin, Younghwan
Han, Ran
Seo, Seungwoo
Song, Hwa Jeon
Chung, Euisok
Yang, Jeongmin
[J]. NEURAL NETWORKS, 2024, 179
[4] Decentralised coordination of a multi-agent system based on intermittent data
DeLellis, Pietro
Garofalo, Franco
Lo Iudice, Francesco
Mancini, Giovanni
[J]. INTERNATIONAL JOURNAL OF CONTROL, 2015, 88 (08) : 1523 - 1532
[5] Decentralised coordinated control of microgrid based on multi-agent system
Dou, Chunxia
Lv, Mengfei
Zhao, Tianyu
Ji, Yeping
Li, Heng
[J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2015, 9 (16) : 2474 - 2484
[6] A Design of Reward Function Based on Knowledge in Multi-agent Learning
Fan, Bo
Pu, Jiexin
[J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 596 - 603
[7] A novel multi-agent Q-learning algorithm in cooperative multi-agent system
Ou, HT
Zhang, WD
Zhang, WY
Xu, XM
[J]. PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 272 - 276
[8] Multi-Agent Reinforcement Learning with Reward Delays
Zhang, Yuyang
Zhang, Runyu
Gu, Yuantao
Li, Na
[J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[9] Direct reward and indirect reward in multi-agent reinforcement learning
Ohta, M
[J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 359 - 366
[10] A Collaborative Learning System based on Multi-agent
Wang, Yuanzhi
Zhang, Fei
Chen, Liwei
Hu, Guosheng
Jiang, Shanhe
[J]. ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 353 - 357

← 1 2 3 4 5 →