Emotion-Based Heterogeneous Multi-agent Reinforcement Learning with Sparse Reward

被引:0
|
作者
Fang, Baofu [1 ,2 ]
Ma, Yunting [1 ,2 ]
Wang, Zaijun [3 ]
Wang, Hao [1 ,2 ]
机构
[1] School of Computer Science and Information Engineering, Hefei University of Technology, Hefei,230601, China
[2] Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei,230601, China
[3] Key Laboratory of Flight Techniques and Flight Safety, Civil Aviation Flight University of China, Guanghan,618307, China
基金
中国国家自然科学基金;
关键词
Simulation platform - Learning systems - Learning algorithms - Multi agent systems;
D O I
10.16451/j.cnki.issn1003-6059.202103004
中图分类号
学科分类号
摘要
In reinforcement learning, the convergence speed and efficiency of the agent are greatly reduced due to its inability to acquire effective experience in an sparse reward distribution environment. Aiming at this kind of sparse reward problem, a method of emotion-based heterogeneous multi-agent reinforcement learning with sparse reward is proposed in this paper. Firstly, the emotion model based on personality is established to provide incentive mechanism for multiple heterogeneous agents as an effective supplement to external rewards. Then, based on this mechanism, a deep deterministic strategy gradient reinforcement learning algorithm based on intrinsic emotional incentive mechanism under sparse rewards is proposed to accelerate the convergence speed of agents. Finally, multi-robot pursuit is used as a simulation experiment platform to construct sparse reward scenarios with different difficulty levels, and the effectiveness and superiority of the proposed method in pursuit success rate and convergence speed are verified. © 2021, Science Press. All right reserved.
引用
收藏
页码:223 / 231
相关论文
共 50 条
  • [1] An Emotion-Based Approach to Reinforcement Learning Reward Design
    Yu, Haixu
    Yang, Pei
    [J]. PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 346 - 351
  • [2] Multi-Agent Reinforcement Learning with Reward Delays
    Zhang, Yuyang
    Zhang, Runyu
    Gu, Yuantao
    Li, Na
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [3] Direct reward and indirect reward in multi-agent reinforcement learning
    Ohta, M
    [J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 359 - 366
  • [4] Underexplored Subspace Mining for Sparse-Reward Cooperative Multi-Agent Reinforcement Learning
    Yu, Yang
    Yin, Qiyue
    Zhang, Junge
    Chen, Hao
    Huang, Kaiqi
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [5] Plan-based reward shaping for multi-agent reinforcement learning
    Devlin, Sam
    Kudenko, Daniel
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2016, 31 (01): : 44 - 58
  • [6] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [7] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    [J]. New Generation Computing, 2001, 19 : 157 - 172
  • [8] Individual Reward Assisted Multi-Agent Reinforcement Learning
    Wang, Li
    Zhang, Yupeng
    Hu, Yujing
    Wang, Weixun
    Zhang, Chongjie
    Gao, Yang
    Hao, Jianye
    Lv, Tangjie
    Fan, Changjie
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] Multi-agent Cooperation Algorithm Based on Individual Gap Emotion in Sparse Reward Scenarios
    Wang, Hao
    Wang, Jing
    Fang, Baofu
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (05): : 451 - 460
  • [10] Autonomous learning of reward distribution for each agent in multi-agent reinforcement learning
    Shibata, K
    Ito, K
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 6, 2000, : 495 - 502