Reward-based epigenetic learning algorithm for a decentralised multi-agent system

被引:3
|
作者
Mukhlish, Faqihza [1 ]
Page, John [1 ]
Bain, Michael [2 ]
机构
[1] Univ New South Wales, Sch Mech & Mfg Engn, Sydney, NSW, Australia
[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
关键词
Swarm robotics; Evolutionary algorithm; Multi-agent learning; Reinforcement learning; Epigenetic; SWARM ROBOTICS; DECISION-MAKING;
D O I
10.1108/IJIUS-12-2018-0036
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Purpose This paper aims to propose a novel epigenetic learning (EpiLearn) algorithm, which is designed specifically for a decentralised multi-agent system such as swarm robotics. Design/methodology/approach First, this paper begins with overview of swarm robotics and the challenges in designing swarm behaviour automatically. This should indicate the direction of improvements required to enhance an automatic swarm design. Second, the evolutionary learning (EpiLearn) algorithm for a swarm system using an epigenetic layer is formulated and discussed. The algorithm is then tested through various test functions to investigate its performance. Finally, the results are discussed along with possible future research directions. Findings Through various test functions, the algorithm can solve non-local and many local minima problems. This article also shows that by using a reward system, the algorithm can handle the deceptive problem which often occurs in dynamic problems. Moreover, utilization of rewards from the environment in the form of a methylation process on the epigenetic layer improves the performance of traditional evolutionary algorithms applied to automatic swarm design. Finally, this article shows that a regeneration process that embeds an epigenetic layer in the inheritance process performs better than a traditional crossover operator in a swarm system. Originality/value This paper proposes a novel method for automatic swarm design by taking into account the importance of multi-agent settings and environmental characteristics surrounding the swarm. The novel evolutionary learning (EpiLearn) algorithm using an epigenetic layer gives the swarm the ability to perform co-evolution and co-learning.
引用
收藏
页码:201 / 224
页数:24
相关论文
共 50 条
  • [1] Multi-agent Reward-Based Intruder Capture
    Grimaldi, Michele
    Herpson, Cedric
    [J]. INTELLIGENT DISTRIBUTED COMPUTING XVI, IDC 2023, 2024, 1138 : 251 - 266
  • [2] Probabilistic Reward-Based Reinforcement Learning for Multi-Agent Pursuit and Evasion
    Zhang, Bo-Kun
    Hu, Bin
    Chen, Long
    Zhang, Ding-Xue
    Cheng, Xin-Ming
    Guan, Zhi-Hong
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3352 - 3357
  • [3] MuDE: Multi-agent decomposed reward-based exploration
    Yoo, Byunghyun
    Yi, Sungwon
    Kim, Hyunwoo
    Shin, Younghwan
    Han, Ran
    Seo, Seungwoo
    Song, Hwa Jeon
    Chung, Euisok
    Yang, Jeongmin
    [J]. NEURAL NETWORKS, 2024, 179
  • [4] Decentralised coordination of a multi-agent system based on intermittent data
    DeLellis, Pietro
    Garofalo, Franco
    Lo Iudice, Francesco
    Mancini, Giovanni
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2015, 88 (08) : 1523 - 1532
  • [5] Decentralised coordinated control of microgrid based on multi-agent system
    Dou, Chunxia
    Lv, Mengfei
    Zhao, Tianyu
    Ji, Yeping
    Li, Heng
    [J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2015, 9 (16) : 2474 - 2484
  • [6] A Design of Reward Function Based on Knowledge in Multi-agent Learning
    Fan, Bo
    Pu, Jiexin
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2008, 5139 : 596 - 603
  • [7] A novel multi-agent Q-learning algorithm in cooperative multi-agent system
    Ou, HT
    Zhang, WD
    Zhang, WY
    Xu, XM
    [J]. PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 272 - 276
  • [8] Multi-Agent Reinforcement Learning with Reward Delays
    Zhang, Yuyang
    Zhang, Runyu
    Gu, Yuantao
    Li, Na
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [9] Direct reward and indirect reward in multi-agent reinforcement learning
    Ohta, M
    [J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 359 - 366
  • [10] A Collaborative Learning System based on Multi-agent
    Wang, Yuanzhi
    Zhang, Fei
    Chen, Liwei
    Hu, Guosheng
    Jiang, Shanhe
    [J]. ALPIT 2008: SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 353 - 357