Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引：0

作者：

Zhicong Zhang

Li Zheng

Michael X. Weng

机构：

[1] Tsinghua University,Department of Industrial Engineering

[2] University of South Florida,Department of Industrial and Management Systems Engineering

来源：

The International Journal of Advanced Manufacturing Technology | 2007年 / 34卷

关键词：

Scheduling; Parallel machine; Reinforcement learning; Q-Learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.

引用

下载

页码：968 / 980

页数：12

共 50 条

[1] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
Zhang, Zhicong
Zheng, Li
Weng, Michael X.
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
[2] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
Zhang, Zhicong
Zheng, Li
Weng, Michael X.
International Journal of Advanced Manufacturing Technology, 2007, 34 (9-10): : 968 - 980
[3] Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
Zhang, Zhicong
Zheng, Li
Li, Na
Wang, Weiping
Zhong, Shouyan
Hu, Kaishun
COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (07) : 1315 - 1324
[4] Unrelated parallel machine scheduling with setup times and a total weighted tardiness objective
Kim, DW
Na, DG
Chen, FF
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2003, 19 (1-2) : 173 - 181
[5] Dynamic single machine scheduling using Q-learning agent
Kong, LF
Wu, J
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3237 - 3241
[6] Unrelated parallel machine scheduling to minimize total weighted tardiness and earliness
Sophia University
J. Jpn Ind. Manage. Assoc., 2007, 5 (333-341):
[7] Weighted earliness/tardiness parallel machine scheduling problem with a common due date
Arik, Oguzhan Ahmet
Schutten, Marco
Topan, Engin
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
[8] Dynamic scheduling with fuzzy clustering based Q-learning
Wang, Guo-Lei
Lin, Lin
Zhong, Shi-Sheng
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2009, 15 (04): : 751 - 757
[9] Comparisons of metaheuristic algorithms for unrelated parallel machine weighted earliness/tardiness scheduling problems
Oğuzhan Ahmet Arık
Evolutionary Intelligence, 2020, 13 : 415 - 425
[10] Genetic algorithm for parallel-machine batching and scheduling to minimize total weighted tardiness
Chou, Fuh-Der
Wang, Hui-Mei
INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS II, PTS 1-3, 2011, 58-60 : 1142 - +

← 1 2 3 4 5 →