Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引：1

作者：

Zhang, Qichen ^{[1
]}

Shao, Weishi ^{[1
,3
,4
]}

Shao, Zhongshi ^{[2
]}

Pi, Dechang ^{[4
]}

Gao, Jiaquan ^{[1
,3
]}

机构：

[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China

[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China

[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China

[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

来源：

SWARM AND EVOLUTIONARY COMPUTATION | 2024年 / 91卷

基金：

中国博士后科学基金;

关键词：

Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;

D O I：

10.1016/j.swevo.2024.101753

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.

引用

页数：23

共 50 条

[31] Reinforcement Learning-Based Estimation of Distribution Algorithm for Energy-Efficient Distributed Heterogeneous Flexible Job Shop Scheduling Problem
Zhao, Fuqing
Li, Mengjie
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14862 : 183 - 195
[32] Two meta-heuristic algorithms for the dual-resource constrained flexible job-shop scheduling problem
Yazdani, M.
Zandieh, M.
Tavakkoli-Moghaddam, R.
Jolai, F.
SCIENTIA IRANICA, 2015, 22 (03) : 1242 - 1257
[33] A heuristic-assisted deep reinforcement learning algorithm for flexible job shop scheduling with transport constraints
Dong, Xiaoting
Wan, Guangxi
Zeng, Peng
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (05)
[34] A meta-heuristic to solve the just-in-time job-shop scheduling problem
Ahmadian, Mohammad Mahdi
Salehipour, Amir
Cheng, T. C. E.
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 288 (01) : 14 - 29
[35] Solving a new robust reverse job shop scheduling problem by meta-heuristic algorithms
Dehghan-Sanej, K.
Eghbali-Zarch, M.
Tavakkoli-Moghaddam, R.
Sajadi, S. M.
Sadjadi, S. J.
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 101
[36] Deep Reinforcement Learning Based on Graph Neural Network for Flexible Job Shop Scheduling Problem with Lot Streaming
He, Junchao
Li, Junqing
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 85 - 95
[37] A hybrid meta-heuristic for the flexible flow shop scheduling with blocking
Wang, Yan
Jia, Zhao-hong
Zhang, Xing-yi
SWARM AND EVOLUTIONARY COMPUTATION, 2022, 75
[38] A Two-Phase Meta-Heuristic for Multiobjective Flexible Job Shop Scheduling Problem With Total Energy Consumption Threshold
Lei, Deming
Li, Ming
Wang, Ling
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (03) : 1097 - 1109
[39] A novel hybrid meta-heuristic algorithm for solving multi objective flexible job shop scheduling
Shahsavari-Pour, Nasser
Ghasemishabankareh, Behrooz
JOURNAL OF MANUFACTURING SYSTEMS, 2013, 32 (04) : 771 - 780
[40] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
Chang, Jingru
Yu, Dong
Hu, Yi
He, Wuwei
Yu, Haoyu
PROCESSES, 2022, 10 (04)

← 1 2 3 4 5 →