Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

被引:34
|
作者
Yang, Shengluo [1 ,2 ,3 ,4 ]
Xu, Zhigang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang, Peoples R China
[2] Chinese Acad Sci, Inst Robot, Shenyang, Peoples R China
[3] Inst Intelligent Mfg, Shenyang, Peoples R China
[4] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; dynamic scheduling and reconfiguration; A2C; reconfigurable manufacturing system (RMS); intelligent scheduling; dynamic job arrival; ITERATED GREEDY ALGORITHM; PERMUTATION FLOW-SHOP; TOTAL TARDINESS; OPTIMIZATION; MINIMIZATION; HEURISTICS; EARLINESS;
D O I
10.1080/00207543.2021.1943037
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 parts per thousand and 2.20 parts per thousand CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration.
引用
下载
收藏
页码:4936 / 4953
页数:18
相关论文
共 50 条
  • [41] Solving job shop scheduling problems via deep reinforcement learning
    Yuan, Erdong
    Cheng, Shuli
    Wang, Liejun
    Song, Shiji
    Wu, Fang
    APPLIED SOFT COMPUTING, 2023, 143
  • [42] Scheduling Algorithm for Raw Material Transportation Via Deep Reinforcement Learning
    Zhang, Yi
    Chen, Yang-Yang
    Zhang, Faxiang
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2218 - 2223
  • [43] Deep Reinforcement Learning for Intelligent Communications
    Tan J.-J.
    Liang Y.-C.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (02): : 169 - 181
  • [44] An Intelligent Task Scheduling Mechanism for Autonomous Vehicles via Deep Learning
    Balasekaran, Gomatheeshwari
    Jayakumar, Selvakumar
    Perez de Prado, Rocio
    ENERGIES, 2021, 14 (06)
  • [45] Deep Reinforcement Learning-Based Dynamic Reconfiguration Planning for Digital Twin-Driven Smart Manufacturing Systems With Reconfigurable Machine Tools
    Huang, Jintang
    Huang, Sihan
    Moghaddam, Shokraneh K.
    Lu, Yuqian
    Wang, Guoxin
    Yan, Yan
    Shi, Xuejiang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, : 13135 - 13146
  • [46] An improved deep reinforcement learning-based scheduling approach for dynamic task scheduling in cloud manufacturing
    Wang, Xiaohan
    Zhang, Lin
    Liu, Yongkui
    Laili, Yuanjun
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (11) : 4014 - 4030
  • [47] Intelligent scheduling of double-deck traversable cranes based on deep reinforcement learning
    Xu, Zhenyu
    Chang, Daofang
    Luo, Tian
    Gao, Yinping
    ENGINEERING OPTIMIZATION, 2023, 55 (12) : 2034 - 2050
  • [48] Intelligent deep reinforcement learning-based scheduling in relay-based HetNets
    Chen, Chao
    Wu, Zhengyang
    Yu, Xiaohan
    Ma, Bo
    Li, Chuanhuang
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2023, 2023 (01)
  • [49] Intelligent deep reinforcement learning-based scheduling in relay-based HetNets
    Chao Chen
    Zhengyang Wu
    Xiaohan Yu
    Bo Ma
    Chuanhuang Li
    EURASIP Journal on Wireless Communications and Networking, 2023
  • [50] Distribution Network Reconfiguration Using Deep Reinforcement Learning
    Gautam, Mukesh
    Benidris, Mohammed
    2022 17TH INTERNATIONAL CONFERENCE ON PROBABILISTIC METHODS APPLIED TO POWER SYSTEMS (PMAPS), 2022,