Performance of deep reinforcement learning algorithms in two-echelon inventory control systems

被引:0
|
作者
Stranieri, Francesco [1 ,2 ]
Stella, Fabio [1 ]
Kouki, Chaaben [3 ]
机构
[1] Univ Milano Bicocca, Dept Informat Syst & Commun DISCo, Viale Sarca 336, I-20126 Milan, Italy
[2] Politecn Torino, Dept Control & Comp Engn DAUIN, Corso Duca Abruzzi 24, I-10129 Turin, Italy
[3] ESSCA Sch Management, Dept Operat Management & Decis Sci OMDS, Angers, France
关键词
inventory management; inventory control systems; inventory control policies; artificial intelligence; deep learning; reinforcement learning; LEVEL; GAME;
D O I
10.1080/00207543.2024.2311180
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This study conducts a comprehensive analysis of deep reinforcement learning (DRL) algorithms applied to supply chain inventory management (SCIM), which consists of a sequential decision-making problem focussed on determining the optimal quantity of products to produce and ship across multiple capacitated local warehouses over a specific time horizon. In detail, we formulate the problem as a Markov decision process for a divergent two-echelon inventory control system characterised by stochastic and seasonal demand, also presenting a balanced allocation rule designed to prevent backorders in the first echelon. Through numerical experiments, we evaluate the performance of state-of-the-art DRL algorithms and static inventory policies in terms of both cost minimisation and training time while varying the number of local warehouses and product types and the length of replenishment lead times. Our results reveal that the Proximal Policy Optimization algorithm consistently outperforms other algorithms across all experiments, proving to be a robust method for tackling the SCIM problem. Furthermore, the (s, Q)-policy stands as a solid alternative, offering a compromise in performance and computational efficiency. Lastly, this study presents an open-source software library that provides a customisable simulation environment for addressing the SCIM problem, utilising a wide range of DRL algorithms and static inventory policies.
引用
收藏
页码:6211 / 6226
页数:16
相关论文
共 50 条