Manufacturing Resource Scheduling Based on Deep Q-Network

被引：1

作者：

ZHANG Yufei ^{[1
]}

ZOU Yuanhao ^{[1
]}

ZHAO Xiaodong ^{[1
]}

机构：

[1] School of Electronic and Information Engineering, Tongji University

来源：

Wuhan University Journal of Natural Sciences | 2022年 / 27卷 / 06期

关键词：

D O I：

暂无

中图分类号：

TH186 [生产技术管理]; TP183 [人工神经网络与计算];

学科分类号：

0802 ; 081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To optimize machine allocation and task dispatching in smart manufacturing factories, this paper proposes a manufacturing resource scheduling framework based on reinforcement learning(RL). The framework formulates the entire scheduling process as a multi-stage sequential decision problem, and further obtains the scheduling order by the combination of deep convolutional neural network(CNN) and improved deep Q-network(DQN). Specifically, with respect to the representation of the Markov decision process(MDP), the feature matrix is considered as the state space and a set of heuristic dispatching rules are denoted as the action space. In addition, the deep CNN is employed to approximate the state-action values, and the double dueling deep Qnetwork with prioritized experience replay and noisy network(D3QPN2) is adopted to determine the appropriate action according to the current state. In the experiments, compared with the traditional heuristic method, the proposed method is able to learn high-quality scheduling policy and achieve shorter makespan on the standard public datasets.

引用

页码：531 / 538

页数：8

共 50 条

[31] Dueling Double Deep Q-Network Based Computation Offloading and Resource Allocation Scheme for Internet of Vehicles
Jiang, Fan
Li, Yan
Sun, Changyin
Wang, Chaowei
2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
[32] Deep Q-Network based resource allocation for UAV-assisted Ultra-Dense Networks
Chen, Xin
Liu, Xu
Chen, Ying
Jiao, Libo
Min, Geyong
COMPUTER NETWORKS, 2021, 196
[33] A Hybrid Deep Q-Network for the SVM Lagrangian
Kim, Chayoung
Kim, Hye-young
INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 643 - 651
[34] Deep Recurrent Q-Network with Truncated History
Oh, Hyunwoo
Kaneko, Tomoyuki
2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2018, : 34 - 39
[35] Dynamic fusion for ensemble of deep Q-network
Patrick P. K. Chan
Meng Xiao
Xinran Qin
Natasha Kees
International Journal of Machine Learning and Cybernetics, 2021, 12 : 1031 - 1040
[36] Optimal Evacuation Route Prediction in Fpso Based on Deep Q-Network
Hong, Seokyoung
Jang, Kyojin
Lee, Jiheon
Yoon, Hyungjoon
Cho, Hyungtae
Moon, Il
30TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, PTS A-C, 2020, 48 : 1867 - 1872
[37] Twice Sampling Method in Deep Q-network
Zhao Y.-N.
Liu P.
Zhao W.
Tang X.-L.
Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (10): : 1870 - 1882
[38] Dynamic fusion for ensemble of deep Q-network
Chan, Patrick P. K.
Xiao, Meng
Qin, Xinran
Kees, Natasha
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (04) : 1031 - 1040
[39] Trax Solver on Zynq with Deep Q-Network
Sugimoto, Naru
Mitsuishi, Takuji
Kaneda, Takahiro
Tsuruta, Chiharu
Sakai, Ryotaro
Shimura, Hideki
Amano, Hideharu
2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 272 - 275
[40] Deep Q-Network Using Reward Distribution
Nakaya, Yuta
Osana, Yuko
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2018, PT I, 2018, 10841 : 160 - 169

← 1 2 3 4 5 →