Reinforcement learning and stochastic dynamic programming for jointly scheduling jobs and preventive maintenance on a single machine to minimise earliness-tardiness

被引:3
|
作者
Sabri, Abderrazzak [1 ,2 ,3 ]
Allaoui, Hamid [1 ]
Souissi, Omar [2 ]
机构
[1] Univ Artois, Lab Genie Informat & Automatique Artois, Bethune, France
[2] Inst Natl postes Telecommun, Rabat, Morocco
[3] 801 Rue Horlogerie, F-62400 Bethune, France
关键词
Advanced planning and scheduling systems; machine learning; dynamic programming; stochastic programming; COMMON;
D O I
10.1080/00207543.2023.2172472
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper addresses the problem of stochastic jointly scheduling of resumable jobs and preventive maintenance on a single machine, subject to random breakdowns, to minimise the earliness tardiness cost. The main objective is to investigate using trending machine learning-based methods compared to stochastic optimisation approaches. We propose two different methods from both fields as we solve the same problem firstly with a stochastic dynamic programming model in an approximation way, then with an attention-based deep reinforcement learning model. We conduct a detailed experimental study according to solution quality, run time, and robustness to analyse their performances compared to those of an existing approach in the literature as a baseline. Both algorithms outperform the baseline. Moreover, the machine learning-based algorithm outperforms the stochastic dynamic programming-based heuristic as we report up to 30.5% saving in total cost, a reduction of computational time from 67 min to less than 1s on big instances, and a better robustness. These facts highlight clearly its potential for solving such problems.
引用
收藏
页码:705 / 719
页数:15
相关论文
共 50 条