Particle swarm optimization based multi-task parallel reinforcement learning algorithm

被引：3

作者：

Duan Junhua ^{[1
]}

Zhu Yi-an ^{[1
]}

Zhong Dong ^{[1
]}

Zhang Lixiang ^{[1
]}

Zhang Lin ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp, 127 West Youyi Rd, Xian 710072, Shaanxi, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2019年 / 37卷 / 06期

关键词：

Multi-task reinforcement learning; parallel reinforcement learning; particle swarm optimization; transfer learning;

D O I：

10.3233/JIFS-190209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Transfer learning has been identified as conducive to improving the speed of machine learning in many areas. In multi-task reinforcement learning, transfer learning can assist the transfer of experiences between different tasks. The research conducted in this article is focused on two aspects. On the one hand, multi-task parallel transfer learning can improve the learning speed of parallel learning tasks. On the other hand, the learning of the current optimal experience can help the target point rewards to be transmitted to the starting point. The value of this self-learning can also accelerate the convergence speed of the reinforcement learning. According to the research into these two aspects, this paper uses the idea of particle swarm optimization (PSO) to conduct self-learning and interactive learning in multi-task parallel learning. In this paper, a new multi-task learning algorithm named PSO-MTPRL (Multi-Task Parallel Reinforcement Learning based on PSO) is proposed. Based on the idea of PSO algorithm, the Boltzmann strategy, Self-Learning Process (SLP) and Interactive Learning Process (ILP) are selected probabilistically. Based on the characteristic exhibited by reinforcement learning, segmented learning model is recommended. In the early learning stages, the complete Boltzmann exploration strategy is applied, and B-SLP-ILP (Boltzmann-SLP- ILP) learning procedure is conducted exclusively in the middle stage of the learning. In the late learning stages, Boltzmann exploration is involved again. The segmented learning model can help ensure the balance of the exploration and exploitation, in addition to ensuring that all tasks convergence.

引用

下载

页码：8567 / 8575

页数：9

共 50 条

[31] Optimization of Multi-core Task Scheduling based on Improved Particle Swarm Optimization Algorithm
Cheng, Xiaohui
Chi, Jinqiu
2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 438 - 444
[32] IMPROVED PARTICLE SWARM ALGORITHM FOR COOPERATIVE MULTI-TASK ALLOCATION OF HETEROGENEOUS UAVs
Lu, Qilin
Chen, Yu
Qi, Xiaogang
Liu, Lifang
MECHATRONIC SYSTEMS AND CONTROL, 2023, 51 (01): : 42 - 52
[33] Swarm Reinforcement Learning Algorithm Based on Particle Swarm Optimization Whose Personal Bests Have Lifespans
Iima, Hitoshi
Kuroe, Yasuaki
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 169 - 178
[34] A multi-task learning model with reinforcement optimization for ASD comorbidity discrimination
Dong, Heyou
Chen, Dan
Chen, Yukang
Tang, Yunbo
Yin, Dingze
Li, Xiaoli
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
[35] An Adaptive Online Parameter Control Algorithm for Particle Swarm Optimization Based on Reinforcement Learning
Liu, Yaxian
Lu, Hui
Cheng, Shi
Shi, Yuhui
2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 815 - 822
[36] Unsupervised Task Clustering for Multi-task Reinforcement Learning
Ackermann, Johannes
Richter, Oliver
Wattenhofer, Roger
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 222 - 237
[37] Multi-objective multi-task particle swarm optimization based on objective space division and adaptive transfer
Liang, Zhengping
Yan, Jiabiao
Zheng, Fan
Wang, Jigang
Liu, Ling
Zhu, Zexuan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
[38] A parallel particle swarm optimization algorithm
Ma, Yan
Sun, Jun
Xu, Wenbo
DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 61 - 64
[39] Curriculum-Based Asymmetric Multi-Task Reinforcement Learning
Huang, Hanchi
Ye, Deheng
Shen, Li
Liu, Wei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7258 - 7269
[40] Multi-Task Reinforcement Learning with Context-based Representations
Sodhani, Shagun
Zhang, Amy
Pineau, Joelle
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139

← 1 2 3 4 5 →