Particle swarm optimization based multi-task parallel reinforcement learning algorithm

被引:3
|
作者
Duan Junhua [1 ]
Zhu Yi-an [1 ]
Zhong Dong [1 ]
Zhang Lixiang [1 ]
Zhang Lin [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp, 127 West Youyi Rd, Xian 710072, Shaanxi, Peoples R China
关键词
Multi-task reinforcement learning; parallel reinforcement learning; particle swarm optimization; transfer learning;
D O I
10.3233/JIFS-190209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transfer learning has been identified as conducive to improving the speed of machine learning in many areas. In multi-task reinforcement learning, transfer learning can assist the transfer of experiences between different tasks. The research conducted in this article is focused on two aspects. On the one hand, multi-task parallel transfer learning can improve the learning speed of parallel learning tasks. On the other hand, the learning of the current optimal experience can help the target point rewards to be transmitted to the starting point. The value of this self-learning can also accelerate the convergence speed of the reinforcement learning. According to the research into these two aspects, this paper uses the idea of particle swarm optimization (PSO) to conduct self-learning and interactive learning in multi-task parallel learning. In this paper, a new multi-task learning algorithm named PSO-MTPRL (Multi-Task Parallel Reinforcement Learning based on PSO) is proposed. Based on the idea of PSO algorithm, the Boltzmann strategy, Self-Learning Process (SLP) and Interactive Learning Process (ILP) are selected probabilistically. Based on the characteristic exhibited by reinforcement learning, segmented learning model is recommended. In the early learning stages, the complete Boltzmann exploration strategy is applied, and B-SLP-ILP (Boltzmann-SLP- ILP) learning procedure is conducted exclusively in the middle stage of the learning. In the late learning stages, Boltzmann exploration is involved again. The segmented learning model can help ensure the balance of the exploration and exploitation, in addition to ensuring that all tasks convergence.
引用
下载
收藏
页码:8567 / 8575
页数:9
相关论文
共 50 条
  • [1] A Q-learning-based multi-task multi-objective particle swarm optimization algorithm
    Han H.-G.
    Xu Z.-A.
    Wang J.-J.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (11): : 3039 - 3047
  • [2] Novel Parallel Particle Swarm Optimization Algorithms Applied on the Multi-task Cooperation
    Wang Jing-lian
    Liu Hong
    Li Shao-hui
    2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 1208 - +
  • [3] A reinforcement learning assisted evolutionary algorithm for constrained multi-task optimization
    Yang, Yufei
    Zhang, Changsheng
    Zhang, Bin
    Ning, Jiaxu
    INFORMATION SCIENCES, 2024, 678
  • [4] A novel parallel multi-swarm algorithm based on comprehensive learning particle swarm optimization
    Gulcu, Saban
    Kodaz, Halife
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 33 - 45
  • [5] Multi-task coalition parallel formation strategy based on reinforcement learning
    Department of Computer and Information Science, Hefei University of Technology, Hefei 230009, China
    不详
    Zidonghua Xuebao, 2008, 3 (349-352):
  • [6] Multi-Task Particle Swarm Optimization With Dynamic Neighbor and Level-Based Inter-Task Learning
    Tang, Zedong
    Gong, Maoguo
    Xie, Yu
    Li, Hao
    Qin, A. K.
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (02): : 300 - 314
  • [7] Research on Ship and Aircraft Joint Multi-Task Management Based on Discrete Particle Swarm Optimization Algorithm
    Yu, Jin-Yong
    Kou, Kun-Hu
    Zhang, Feng-Xia
    INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATION CONTROL (ICEEAC 2017), 2017, 123 : 247 - 254
  • [8] Cooperative multi-task assignment modeling of UAV based on particle swarm optimization
    Zhou, Xiaoming
    Yang, Kun
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2024, 18 (02): : 919 - 934
  • [9] Multi-strategy self-learning particle swarm optimization algorithm based on reinforcement learning
    Meng, Xiaoding
    Li, Hecheng
    Chen, Anshan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 8498 - 8530
  • [10] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
    Zhang, Lingxin
    Qi, Qi
    Wang, Jingyu
    Sun, Haifeng
    Liao, Jianxin
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001