Use of Proximal Policy Optimization for the Joint Replenishment Problem

被引:57
|
作者
Vanvuchelen, Nathalie [1 ]
Gijsbrechts, Joren [1 ]
Boute, Robert [1 ,2 ]
机构
[1] Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium
[2] Vlerick Business Sch, Technol & Operat Management Area, Ghent, Belgium
关键词
Collaborative Shipping; Physical Internet; Joint Replenishment Problem; Machine Learning; Deep Reinforcement Learning; Proximal Policy Optimization; INVENTORY CONTROL; ORDER; TRANSPORTATION;
D O I
10.1016/j.compind.2020.103239
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep reinforcement learning has been coined as a promising research avenue to solve sequential decision-making problems, especially if few is known about the optimal policy structure. We apply the proximal policy optimization algorithm to the intractable joint replenishment problem. We demonstrate how the algorithm approaches the optimal policy structure and outperforms two other heuristics. Its deployment in supply chain control towers can orchestrate and facilitate collaborative shipping in the Physical Internet. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] The submodular joint replenishment problem
    Cheung, Maurice
    Elmachtoub, Adam N.
    Levi, Retsef
    Shmoys, David B.
    [J]. MATHEMATICAL PROGRAMMING, 2016, 158 (1-2) : 207 - 233
  • [12] The submodular joint replenishment problem
    Maurice Cheung
    Adam N. Elmachtoub
    Retsef Levi
    David B. Shmoys
    [J]. Mathematical Programming, 2016, 158 : 207 - 233
  • [13] An analytical study of the Q(s, S) policy applied to the joint replenishment problem
    Nielsen, C
    Larsen, C
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2005, 163 (03) : 721 - 732
  • [14] Can-order policy for the periodic-review joint replenishment problem
    Johansen, SG
    Melchiors, P
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (03) : 283 - 290
  • [15] A replenishment policy based on joint optimization in a downstream pharmaceutical supply chain: centralized vs. decentralized replenishment
    Baboli, Armand
    Fondrevelle, Julien
    Tavakkoli-Moghaddam, Reza
    Mehrabi, Ali
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2011, 57 (1-4): : 367 - 378
  • [16] A replenishment policy based on joint optimization in a downstream pharmaceutical supply chain: centralized vs. decentralized replenishment
    Armand Baboli
    Julien Fondrevelle
    Reza Tavakkoli-Moghaddam
    Ali Mehrabi
    [J]. The International Journal of Advanced Manufacturing Technology, 2011, 57 : 367 - 378
  • [17] Modeling and Optimization of Stochastic Joint Replenishment and Delivery Scheduling Problem with Uncertain Costs
    Wang, Lin
    Qu, Hui
    Li, Yanhui
    He, Jing
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2013, 2013
  • [18] On the Complexity of the Collaborative Joint Replenishment Problem
    Otero-Palencia, Carlos
    Montoya-Torres, Jairo R.
    Amaya-Mier, Rene
    [J]. SERVICE ORIENTED, HOLONIC AND MULTI-AGENT MANUFACTURING SYSTEMS FOR INDUSTRY OF THE FUTURE, SOHOMA LATIN AMERICA 2021, 2021, 987 : 311 - 318
  • [19] Joint replenishment problem for deteriorating item
    Li, Cheng-Yan
    Xu, Xiao-Fei
    Zhan, De-Chen
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2009, 15 (02): : 412 - 416
  • [20] On optimal algorithms for the joint replenishment problem
    Viswanathan, S
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2002, 53 (11) : 1286 - 1290