Real-time scheduling for distributed permutation flowshops with dynamic job arrivals using deep reinforcement learning

被引:31
|
作者
Yang, Shengluo [1 ]
Wang, Junyi [2 ,3 ,4 ]
Xu, Zhigang [2 ,3 ,4 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Mech Engn, Shanghai 200093, Peoples R China
[2] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
[3] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[4] 135 Chuangxin Rd, Shenyang, Liaoning, Peoples R China
关键词
Distributed flowshop scheduling; Deep reinforcement learning; Real-time scheduling; Dynamic job arrivals; Intelligent scheduling; Deep Q -network; ITERATED GREEDY ALGORITHM; SHOP; METAHEURISTICS; SEARCH;
D O I
10.1016/j.aei.2022.101776
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributed manufacturing plays an important role for large-scale companies to reduce production and trans-portation costs for globalized orders. However, how to real-timely and properly assign dynamic orders to distributed workshops is a challenging problem. To provide real-time and intelligent decision-making of scheduling for distributed flowshops, we studied the distributed permutation flowshop scheduling problem (DPFSP) with dynamic job arrivals using deep reinforcement learning (DRL). The objective is to minimize the total tardiness cost of all jobs. We provided the training and execution procedures of intelligent scheduling based on DRL for the dynamic DPFSP. In addition, we established a DRL-based scheduling model for distributed flowshops by designing suitable reward function, scheduling actions, and state features. A novel reward function is designed to directly relate to the objective. Various problem-specific dispatching rules are introduced to provide efficient actions for different production states. Furthermore, four efficient DRL algorithms, including deep Q-network (DQN), double DQN (DbDQN), dueling DQN (DlDQN), and advantage actor-critic (A2C), are adapted to train the scheduling agent. The training curves show that the agent learned to generate better so-lutions effectively and validate that the system design is reasonable. After training, all DRL algorithms outper-form traditional meta-heuristics and well-known priority dispatching rules (PDRs) by a large margin in terms of solution quality and computation efficiency. This work shows the effectiveness of DRL for the real-time sched-uling of dynamic DPFSP.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Real-time scheduling for two-stage assembly flowshop with dynamic job arrivals by deep reinforcement learning
    Chen, Jian
    Zhang, Hanlei
    Ma, Wenjing
    Xu, Gangyan
    ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [2] Real-time scheduling for dynamic workshops with random new job insertions by using deep reinforcement learning
    Sun, Z. Y.
    Han, W. M.
    Gao, L. L.
    ADVANCES IN PRODUCTION ENGINEERING & MANAGEMENT, 2023, 18 (02): : 137 - 151
  • [3] Distributed Real-Time Scheduling in Cloud Manufacturing by Deep Reinforcement Learning
    Zhang, Lixiang
    Yang, Chen
    Yan, Yan
    Hu, Yaoguang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (12) : 8999 - 9007
  • [4] Cost-aware real-time job scheduling for hybrid cloud using deep reinforcement learning
    Cheng, Long
    Kalapgar, Archana
    Jain, Amogh
    Wang, Yue
    Qin, Yongtai
    Li, Yuancheng
    Liu, Cong
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (21): : 18579 - 18593
  • [5] Cost-aware real-time job scheduling for hybrid cloud using deep reinforcement learning
    Long Cheng
    Archana Kalapgar
    Amogh Jain
    Yue Wang
    Yongtai Qin
    Yuancheng Li
    Cong Liu
    Neural Computing and Applications, 2022, 34 : 18579 - 18593
  • [6] Real-Time Scheduling for Dynamic Partial-No-Wait Multiobjective Flexible Job Shop by Deep Reinforcement Learning
    Luo, Shu
    Zhang, Linxuan
    Fan, Yushun
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 3020 - 3038
  • [7] A Hierarchical Multi-Action Deep Reinforcement Learning Method for Dynamic Distributed Job-Shop Scheduling Problem With Job Arrivals
    Huang, Jiang-Ping
    Gao, Liang
    Li, Xin-Yu
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 22 : 1 - 13
  • [8] Real-time optimal scheduling for microgrid systems based on distributed deep reinforcement learning
    Guo F.-H.
    He T.
    Wu X.
    Dong H.
    Liu B.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (10): : 1881 - 1889
  • [9] Deep reinforcement learning for dynamic distributed job shop scheduling problem with transfers
    Lei Y.
    Deng Q.
    Liao M.
    Gao S.
    Expert Systems with Applications, 2024, 251
  • [10] Real-time dynamic scheduling for garment sewing process based on deep reinforcement learning
    Liu F.
    Xu J.
    Ke W.
    Fangzhi Xuebao/Journal of Textile Research, 2022, 43 (09): : 41 - 48