Manufacturing Resource Scheduling Based on Deep Q-Network

被引:1
|
作者
ZHANG Yufei [1 ]
ZOU Yuanhao [1 ]
ZHAO Xiaodong [1 ]
机构
[1] School of Electronic and Information Engineering, Tongji University
关键词
D O I
暂无
中图分类号
TH186 [生产技术管理]; TP183 [人工神经网络与计算];
学科分类号
0802 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
To optimize machine allocation and task dispatching in smart manufacturing factories, this paper proposes a manufacturing resource scheduling framework based on reinforcement learning(RL). The framework formulates the entire scheduling process as a multi-stage sequential decision problem, and further obtains the scheduling order by the combination of deep convolutional neural network(CNN) and improved deep Q-network(DQN). Specifically, with respect to the representation of the Markov decision process(MDP), the feature matrix is considered as the state space and a set of heuristic dispatching rules are denoted as the action space. In addition, the deep CNN is employed to approximate the state-action values, and the double dueling deep Qnetwork with prioritized experience replay and noisy network(D3QPN2) is adopted to determine the appropriate action according to the current state. In the experiments, compared with the traditional heuristic method, the proposed method is able to learn high-quality scheduling policy and achieve shorter makespan on the standard public datasets.
引用
收藏
页码:531 / 538
页数:8
相关论文
共 50 条
  • [31] Dueling Double Deep Q-Network Based Computation Offloading and Resource Allocation Scheme for Internet of Vehicles
    Jiang, Fan
    Li, Yan
    Sun, Changyin
    Wang, Chaowei
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [32] Deep Q-Network based resource allocation for UAV-assisted Ultra-Dense Networks
    Chen, Xin
    Liu, Xu
    Chen, Ying
    Jiao, Libo
    Min, Geyong
    COMPUTER NETWORKS, 2021, 196
  • [33] A Hybrid Deep Q-Network for the SVM Lagrangian
    Kim, Chayoung
    Kim, Hye-young
    INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 643 - 651
  • [34] Deep Recurrent Q-Network with Truncated History
    Oh, Hyunwoo
    Kaneko, Tomoyuki
    2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2018, : 34 - 39
  • [35] Dynamic fusion for ensemble of deep Q-network
    Patrick P. K. Chan
    Meng Xiao
    Xinran Qin
    Natasha Kees
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 1031 - 1040
  • [36] Optimal Evacuation Route Prediction in Fpso Based on Deep Q-Network
    Hong, Seokyoung
    Jang, Kyojin
    Lee, Jiheon
    Yoon, Hyungjoon
    Cho, Hyungtae
    Moon, Il
    30TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, PTS A-C, 2020, 48 : 1867 - 1872
  • [37] Twice Sampling Method in Deep Q-network
    Zhao Y.-N.
    Liu P.
    Zhao W.
    Tang X.-L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (10): : 1870 - 1882
  • [38] Dynamic fusion for ensemble of deep Q-network
    Chan, Patrick P. K.
    Xiao, Meng
    Qin, Xinran
    Kees, Natasha
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (04) : 1031 - 1040
  • [39] Trax Solver on Zynq with Deep Q-Network
    Sugimoto, Naru
    Mitsuishi, Takuji
    Kaneda, Takahiro
    Tsuruta, Chiharu
    Sakai, Ryotaro
    Shimura, Hideki
    Amano, Hideharu
    2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 272 - 275
  • [40] Deep Q-Network Using Reward Distribution
    Nakaya, Yuta
    Osana, Yuko
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2018, PT I, 2018, 10841 : 160 - 169