Solving the online batching problem using deep reinforcement learning

被引:18
|
作者
Cals, Bram [1 ]
Zhang, Yingqian [1 ]
Dijkman, Remco [1 ]
van Dorst, Claudy [2 ]
机构
[1] Eindhoven Univ Technol, Sch Ind Engn, POB 513, NL-5600 MB Eindhoven, Netherlands
[2] Vanderlande Ind BV, POB 18, NL-5460 AA Veghel, Netherlands
关键词
Deep reinforcement learning; Order batching; Sequential decision making; Machine learning; Warehousing; E-commerce; ORDER PICKING; MULTIPLE PICKERS; TARDINESS; TIME;
D O I
10.1016/j.cie.2021.107221
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In e-commerce markets, on-time delivery is of great importance to customer satisfaction. In this paper, we present a Deep Reinforcement Learning (DRL) approach, together with a heuristic, for deciding how and when arrived orders should be batched and picked in a warehouse to minimize the number of tardy orders. In particular, the technique facilitates making decisions on whether an order should be picked individually (pick-by-order) or picked in a batch with other orders (pick-by-batch), and if so, with which other orders. We approach the problem by formulating it as a semi-Markov decision process and developing a vector-based state representation that includes the characteristics of the warehouse system. This allows us to create a deep reinforcement learning solution that learns a strategy by interacting with the environment and solve the problem with a proximal policy optimization algorithm. We evaluate the performance of the proposed DRL approach by comparing it with several batching and sequencing heuristics in different problem settings. The results show that the DRL approach can develop a strategy that produces consistent, good solutions and performs better than the proposed heuristics in most of the tested cases. We show that the strategy learned by DRL is different from the hand-crafted heuristics. In this paper, we demonstrate that the benefits from recent advancements of Deep Reinforcement Learning can be transferred to solve sequential decision-making problems in warehousing operations.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] RETRACTED ARTICLE: Solving the protein folding problem in hydrophobic-polar model using deep reinforcement learning
    Reza Jafari
    Mohammad Masoud Javidi
    [J]. SN Applied Sciences, 2020, 2
  • [32] Reinforcement Learning for Solving the Vehicle Routing Problem
    Nazari, Mohammadreza
    Oroojlooy, Afshin
    Takac, Martin
    Snyder, Lawrence V.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [33] Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
    Wang, Rui
    Gan, Xianghua
    Li, Qing
    Yan, Xiao
    [J]. COMPLEXITY, 2021, 2021
  • [34] Solving the RNA design problem with reinforcement learning
    Eastman, Peter
    Shi, Jade
    Ramsundar, Bharath
    Pande, Vijay S.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (06)
  • [35] Deep reinforcement learning for solving the joint scheduling problem of machines and AGVs in job shop
    基于深度强化学习求解作业车间机器与AGV联合调度问题
    [J]. Lei, Qi (leiqi@cqu.edu.cn), 1600, Northeast University (39):
  • [36] A reinforcement learning approach to cooperative problem solving
    Yoshida, T
    Hori, K
    Nakasuka, S
    [J]. INTERNATIONAL CONFERENCE ON MULTI-AGENT SYSTEMS, PROCEEDINGS, 1998, : 479 - 480
  • [37] Hierarchical reinforcement learning as creative problem solving
    Colin, Thomas R.
    Belpaeme, Tony
    Cangelosi, Angelo
    Hemion, Nikolas
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 86 : 196 - 206
  • [38] A maintenance planning framework using online and offline deep reinforcement learning
    Bukhsh, Zaharah A.
    Molegraaf, Hajo
    Jansen, Nils
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023,
  • [39] Online Index Selection Using Deep Reinforcement Learning for a Cluster Database
    Sadri, Zahra
    Gruenwald, Le
    Leal, Eleazar
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2020), 2020, : 158 - 161
  • [40] Online Energy Management in Commercial Buildings using Deep Reinforcement Learning
    Naug, Avisek
    Ahmed, Ibrahim
    Biswas, Gautam
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2019), 2019, : 249 - 257