Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning

被引:0
|
作者
Zhicong Zhang
Li Zheng
Michael X. Weng
机构
[1] Tsinghua University,Department of Industrial Engineering
[2] University of South Florida,Department of Industrial and Management Systems Engineering
关键词
Scheduling; Parallel machine; Reinforcement learning; Q-Learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we discuss a dynamic unrelated parallel machine scheduling problem with sequence-dependant setup times and machine–job qualification consideration. To apply the Q-Learning algorithm, we convert the scheduling problem into reinforcement learning problems by constructing a semi-Markov decision process (SMDP), including the definition of state representation, actions and the reward function. We use five heuristics, WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT, as actions and prove the equivalence of the reward function and the scheduling objective: minimisation of mean weighted tardiness. We carry out computational experiments to examine the performance of the Q-Learning algorithm and the heuristics. Experiment results show that Q-Learning always outperforms all heuristics remarkably. Averaged over all test problems, the Q-Learning algorithm achieved performance improvements over WSPT, WMDD, WCOVERT, RATCS and LFJ-WCOVERT by considerable amounts of 61.38%, 60.82%, 56.23%, 57.48% and 66.22%, respectively.
引用
下载
收藏
页码:968 / 980
页数:12
相关论文
共 50 条
  • [1] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
    Zhang, Zhicong
    Zheng, Li
    Weng, Michael X.
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2007, 34 (9-10): : 968 - 980
  • [2] Dynamic parallel machine scheduling with mean weighted tardiness objective by Q-Learning
    Zhang, Zhicong
    Zheng, Li
    Weng, Michael X.
    International Journal of Advanced Manufacturing Technology, 2007, 34 (9-10): : 968 - 980
  • [3] Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning
    Zhang, Zhicong
    Zheng, Li
    Li, Na
    Wang, Weiping
    Zhong, Shouyan
    Hu, Kaishun
    COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (07) : 1315 - 1324
  • [4] Unrelated parallel machine scheduling with setup times and a total weighted tardiness objective
    Kim, DW
    Na, DG
    Chen, FF
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2003, 19 (1-2) : 173 - 181
  • [5] Dynamic single machine scheduling using Q-learning agent
    Kong, LF
    Wu, J
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3237 - 3241
  • [6] Unrelated parallel machine scheduling to minimize total weighted tardiness and earliness
    Sophia University
    J. Jpn Ind. Manage. Assoc., 2007, 5 (333-341):
  • [7] Weighted earliness/tardiness parallel machine scheduling problem with a common due date
    Arik, Oguzhan Ahmet
    Schutten, Marco
    Topan, Engin
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [8] Dynamic scheduling with fuzzy clustering based Q-learning
    Wang, Guo-Lei
    Lin, Lin
    Zhong, Shi-Sheng
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2009, 15 (04): : 751 - 757
  • [9] Comparisons of metaheuristic algorithms for unrelated parallel machine weighted earliness/tardiness scheduling problems
    Oğuzhan Ahmet Arık
    Evolutionary Intelligence, 2020, 13 : 415 - 425
  • [10] Genetic algorithm for parallel-machine batching and scheduling to minimize total weighted tardiness
    Chou, Fuh-Der
    Wang, Hui-Mei
    INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS II, PTS 1-3, 2011, 58-60 : 1142 - +