Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引:1
|
作者
Zhang, Qichen [1 ]
Shao, Weishi [1 ,3 ,4 ]
Shao, Zhongshi [2 ]
Pi, Dechang [4 ]
Gao, Jiaquan [1 ,3 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China
[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国博士后科学基金;
关键词
Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;
D O I
10.1016/j.swevo.2024.101753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A DEEP REINFORCEMENT LEARNING BASED SOLUTION FOR FLEXIBLE JOB SHOP SCHEDULING PROBLEM
    Han, B. A.
    Yang, J. J.
    INTERNATIONAL JOURNAL OF SIMULATION MODELLING, 2021, 20 (02) : 375 - 386
  • [2] Mathematical modelling and a meta-heuristic for flexible job shop scheduling
    Roshanaei, V.
    Azab, Ahmed
    ElMaraghy, H.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2013, 51 (20) : 6247 - 6274
  • [3] Preference learning based deep reinforcement learning for flexible job shop scheduling problem
    Liu, Xinning
    Han, Li
    Kang, Ling
    Liu, Jiannan
    Miao, Huadong
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
  • [4] Deep reinforcement learning for flexible assembly job shop scheduling problem
    Hu Y.
    Zhang L.
    Bai X.
    Tang Q.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (02): : 153 - 160
  • [5] Job Shop Scheduling Problem Based on Deep Reinforcement Learning
    Li, Baoshuai
    Ye, Chunming
    Computer Engineering and Applications, 2024, 57 (23) : 248 - 254
  • [6] Linguistic-based meta-heuristic optimization model for flexible job shop scheduling
    Baykasoglu, A
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2002, 40 (17) : 4523 - 4543
  • [7] An Efficient Meta-Heuristic for Multi-Objective Flexible Job Shop Inverse Scheduling Problem
    Wu, Rui
    Li, Yibing
    Guo, Shunsheng
    Li, Xixing
    IEEE ACCESS, 2018, 6 : 59515 - 59527
  • [8] META-HEURISTIC APPROACH BASED ON GENETIC AND GREEDY ALGORITHMS TO SOLVE FLEXIBLE JOB-SHOP SCHEDULING PROBLEM
    Rezaeipanah, Amin
    Sarhangnia, Fariba
    Abdollahi, Mohammad Javad
    COMPUTER SCIENCE-AGH, 2021, 22 (04): : 463 - 488
  • [9] Deep reinforcement learning for dynamic distributed job shop scheduling problem with transfers
    Lei, Yong
    Deng, Qianwang
    Liao, Mengqi
    Gao, Shuocheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [10] Dynamic flexible job shop scheduling based on deep reinforcement learning
    Yang, Dan
    Shu, Xiantao
    Yu, Zhen
    Lu, Guangtao
    Ji, Songlin
    Wang, Jiabing
    He, Kongde
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024,