Deep reinforcement learning driven trajectory-based meta-heuristic for distributed heterogeneous flexible job shop scheduling problem

被引:1
|
作者
Zhang, Qichen [1 ]
Shao, Weishi [1 ,3 ,4 ]
Shao, Zhongshi [2 ]
Pi, Dechang [4 ]
Gao, Jiaquan [1 ,3 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Sch Artificial Intelligence, Nanjing, Peoples R China
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian, Peoples R China
[3] Minist Educ, Key Lab Numer Simulat Large Scale Complex Syst, Beijing, Peoples R China
[4] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
基金
中国博士后科学基金;
关键词
Distributed heterogeneous flexible job shop; scheduling problem; Deep Q network; Variable neighborhood search; Makespan; Critical path; ALGORITHM; SEARCH;
D O I
10.1016/j.swevo.2024.101753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the production environment evolves, distributed manufacturing exhibits heterogeneous characteristics, including diverse machines, workers, and production processes. This paper examines a distributed heterogeneous flexible job shop scheduling problem (DHFJSP) with varying processing times. A mixed integer linear programming (MILP) model of the DHFJSP is formulated with the objective of minimizing the makespan. To solve the DHFJSP, we propose a deep Q network-aided automatic design of a variable neighborhood search algorithm (DQN-VNS). By analyzing schedules, sixty-one types of scheduling features are extracted. These features, along with six shaking strategies, are used as states and actions. A DHFJSP environment simulator is developed to train the deep Q network. The well-trained DQN then generates the shaking procedure for VNS. Additionally, a greedy initialization method is proposed to enhance the quality of the initial solution. Seven efficient critical path-based neighborhood structures with three-vector encoding scheme are introduced to improve local search. Numerical experiments on various scales of instances validate the effectiveness of the MILP model and the DQN-VNS algorithm. The results show that the DQN-VNS algorithm achieves an average relative percentage deviation (ARPD) of 3.2%, which represents an approximately 88.45% reduction compared to the best-performing algorithm among the six compared, with an ARPD of 27.7%. This significant reduction in ARPD highlights the superior stability and performance of the proposed DQN-VNS algorithm.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Reinforcement Learning-Based Estimation of Distribution Algorithm for Energy-Efficient Distributed Heterogeneous Flexible Job Shop Scheduling Problem
    Zhao, Fuqing
    Li, Mengjie
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024, 2024, 14862 : 183 - 195
  • [32] Two meta-heuristic algorithms for the dual-resource constrained flexible job-shop scheduling problem
    Yazdani, M.
    Zandieh, M.
    Tavakkoli-Moghaddam, R.
    Jolai, F.
    SCIENTIA IRANICA, 2015, 22 (03) : 1242 - 1257
  • [33] A heuristic-assisted deep reinforcement learning algorithm for flexible job shop scheduling with transport constraints
    Dong, Xiaoting
    Wan, Guangxi
    Zeng, Peng
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (05)
  • [34] A meta-heuristic to solve the just-in-time job-shop scheduling problem
    Ahmadian, Mohammad Mahdi
    Salehipour, Amir
    Cheng, T. C. E.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 288 (01) : 14 - 29
  • [35] Solving a new robust reverse job shop scheduling problem by meta-heuristic algorithms
    Dehghan-Sanej, K.
    Eghbali-Zarch, M.
    Tavakkoli-Moghaddam, R.
    Sajadi, S. M.
    Sadjadi, S. J.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 101
  • [36] Deep Reinforcement Learning Based on Graph Neural Network for Flexible Job Shop Scheduling Problem with Lot Streaming
    He, Junchao
    Li, Junqing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 85 - 95
  • [37] A hybrid meta-heuristic for the flexible flow shop scheduling with blocking
    Wang, Yan
    Jia, Zhao-hong
    Zhang, Xing-yi
    SWARM AND EVOLUTIONARY COMPUTATION, 2022, 75
  • [38] A Two-Phase Meta-Heuristic for Multiobjective Flexible Job Shop Scheduling Problem With Total Energy Consumption Threshold
    Lei, Deming
    Li, Ming
    Wang, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (03) : 1097 - 1109
  • [39] A novel hybrid meta-heuristic algorithm for solving multi objective flexible job shop scheduling
    Shahsavari-Pour, Nasser
    Ghasemishabankareh, Behrooz
    JOURNAL OF MANUFACTURING SYSTEMS, 2013, 32 (04) : 771 - 780
  • [40] Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival
    Chang, Jingru
    Yu, Dong
    Hu, Yi
    He, Wuwei
    Yu, Haoyu
    PROCESSES, 2022, 10 (04)