Particle swarm optimization based multi-task parallel reinforcement learning algorithm

被引：3

作者：

Duan Junhua ^{[1
]}

Zhu Yi-an ^{[1
]}

Zhong Dong ^{[1
]}

Zhang Lixiang ^{[1
]}

Zhang Lin ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp, 127 West Youyi Rd, Xian 710072, Shaanxi, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2019年 / 37卷 / 06期

关键词：

Multi-task reinforcement learning; parallel reinforcement learning; particle swarm optimization; transfer learning;

D O I：

10.3233/JIFS-190209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Transfer learning has been identified as conducive to improving the speed of machine learning in many areas. In multi-task reinforcement learning, transfer learning can assist the transfer of experiences between different tasks. The research conducted in this article is focused on two aspects. On the one hand, multi-task parallel transfer learning can improve the learning speed of parallel learning tasks. On the other hand, the learning of the current optimal experience can help the target point rewards to be transmitted to the starting point. The value of this self-learning can also accelerate the convergence speed of the reinforcement learning. According to the research into these two aspects, this paper uses the idea of particle swarm optimization (PSO) to conduct self-learning and interactive learning in multi-task parallel learning. In this paper, a new multi-task learning algorithm named PSO-MTPRL (Multi-Task Parallel Reinforcement Learning based on PSO) is proposed. Based on the idea of PSO algorithm, the Boltzmann strategy, Self-Learning Process (SLP) and Interactive Learning Process (ILP) are selected probabilistically. Based on the characteristic exhibited by reinforcement learning, segmented learning model is recommended. In the early learning stages, the complete Boltzmann exploration strategy is applied, and B-SLP-ILP (Boltzmann-SLP- ILP) learning procedure is conducted exclusively in the middle stage of the learning. In the late learning stages, Boltzmann exploration is involved again. The segmented learning model can help ensure the balance of the exploration and exploitation, in addition to ensuring that all tasks convergence.

引用

下载

页码：8567 / 8575

页数：9

共 50 条

[1] A Q-learning-based multi-task multi-objective particle swarm optimization algorithm
Han H.-G.
Xu Z.-A.
Wang J.-J.
Kongzhi yu Juece/Control and Decision, 2023, 38 (11): : 3039 - 3047
[2] Novel Parallel Particle Swarm Optimization Algorithms Applied on the Multi-task Cooperation
Wang Jing-lian
Liu Hong
Li Shao-hui
2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 1208 - +
[3] A reinforcement learning assisted evolutionary algorithm for constrained multi-task optimization
Yang, Yufei
Zhang, Changsheng
Zhang, Bin
Ning, Jiaxu
INFORMATION SCIENCES, 2024, 678
[4] A novel parallel multi-swarm algorithm based on comprehensive learning particle swarm optimization
Gulcu, Saban
Kodaz, Halife
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 33 - 45
[5] Multi-task coalition parallel formation strategy based on reinforcement learning
Department of Computer and Information Science, Hefei University of Technology, Hefei 230009, China
不详
Zidonghua Xuebao, 2008, 3 (349-352):
[6] Multi-Task Particle Swarm Optimization With Dynamic Neighbor and Level-Based Inter-Task Learning
Tang, Zedong
Gong, Maoguo
Xie, Yu
Li, Hao
Qin, A. K.
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (02): : 300 - 314
[7] Research on Ship and Aircraft Joint Multi-Task Management Based on Discrete Particle Swarm Optimization Algorithm
Yu, Jin-Yong
Kou, Kun-Hu
Zhang, Feng-Xia
INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATION CONTROL (ICEEAC 2017), 2017, 123 : 247 - 254
[8] Cooperative multi-task assignment modeling of UAV based on particle swarm optimization
Zhou, Xiaoming
Yang, Kun
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2024, 18 (02): : 919 - 934
[9] Multi-strategy self-learning particle swarm optimization algorithm based on reinforcement learning
Meng, Xiaoding
Li, Hecheng
Chen, Anshan
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 8498 - 8530
[10] Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling
Zhang, Lingxin
Qi, Qi
Wang, Jingyu
Sun, Haifeng
Liao, Jianxin
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2992 - 3001

← 1 2 3 4 5 →