Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引：1

作者：

Zhang, Qichen ^{[1
]}

Shao, Weishi ^{[1
,3
,4
]}

Shao, Zhongshi ^{[2
]}

Pi, Dechang ^{[4
]}

Gao, Jiaquan ^{[1
,3
]}

机构：

[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China

[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China

[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China

[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

来源：

SWARM AND EVOLUTIONARY COMPUTATION | 2024年 / 91卷

基金：

中国博士后科学基金;

关键词：

Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;

D O I：

10.1016/j.swevo.2024.101753

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.

引用

页数：23

共 50 条

[1] A DEEP REINFORCEMENT LEARNING BASED SOLUTION FOR FLEXIBLE JOB SHOP SCHEDULING PROBLEM
Han, B. A.
Yang, J. J.
INTERNATIONAL JOURNAL OF SIMULATION MODELLING, 2021, 20 (02) : 375 - 386
[2] Mathematical modelling and a meta-heuristic for flexible job shop scheduling
Roshanaei, V.
Azab, Ahmed
ElMaraghy, H.
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2013, 51 (20) : 6247 - 6274
[3] Preference learning based deep reinforcement learning for flexible job shop scheduling problem
Liu, Xinning
Han, Li
Kang, Ling
Liu, Jiannan
Miao, Huadong
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
[4] Deep reinforcement learning for flexible assembly job shop scheduling problem
Hu Y.
Zhang L.
Bai X.
Tang Q.
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (02): : 153 - 160
[5] Job Shop Scheduling Problem Based on Deep Reinforcement Learning
Li, Baoshuai
Ye, Chunming
Computer Engineering and Applications, 2024, 57 (23) : 248 - 254
[6] Linguistic-based meta-heuristic optimization model for flexible job shop scheduling
Baykasoglu, A
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2002, 40 (17) : 4523 - 4543
[7] An Efficient Meta-Heuristic for Multi-Objective Flexible Job Shop Inverse Scheduling Problem
Wu, Rui
Li, Yibing
Guo, Shunsheng
Li, Xixing
IEEE ACCESS, 2018, 6 : 59515 - 59527
[8] META-HEURISTIC APPROACH BASED ON GENETIC AND GREEDY ALGORITHMS TO SOLVE FLEXIBLE JOB-SHOP SCHEDULING PROBLEM
Rezaeipanah, Amin
Sarhangnia, Fariba
Abdollahi, Mohammad Javad
COMPUTER SCIENCE-AGH, 2021, 22 (04): : 463 - 488
[9] Deep reinforcement learning for dynamic distributed job shop scheduling problem with transfers
Lei, Yong
Deng, Qianwang
Liao, Mengqi
Gao, Shuocheng
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
[10] Dynamic flexible job shop scheduling based on deep reinforcement learning
Yang, Dan
Shu, Xiantao
Yu, Zhen
Lu, Guangtao
Ji, Songlin
Wang, Jiabing
He, Kongde
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024,

← 1 2 3 4 5 →