Solving the online batching problem using deep reinforcement learning

被引:18
|
作者
Cals, Bram [1 ]
Zhang, Yingqian [1 ]
Dijkman, Remco [1 ]
van Dorst, Claudy [2 ]
机构
[1] Eindhoven Univ Technol, Sch Ind Engn, POB 513, NL-5600 MB Eindhoven, Netherlands
[2] Vanderlande Ind BV, POB 18, NL-5460 AA Veghel, Netherlands
关键词
Deep reinforcement learning; Order batching; Sequential decision making; Machine learning; Warehousing; E-commerce; ORDER PICKING; MULTIPLE PICKERS; TARDINESS; TIME;
D O I
10.1016/j.cie.2021.107221
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In e-commerce markets, on-time delivery is of great importance to customer satisfaction. In this paper, we present a Deep Reinforcement Learning (DRL) approach, together with a heuristic, for deciding how and when arrived orders should be batched and picked in a warehouse to minimize the number of tardy orders. In particular, the technique facilitates making decisions on whether an order should be picked individually (pick-by-order) or picked in a batch with other orders (pick-by-batch), and if so, with which other orders. We approach the problem by formulating it as a semi-Markov decision process and developing a vector-based state representation that includes the characteristics of the warehouse system. This allows us to create a deep reinforcement learning solution that learns a strategy by interacting with the environment and solve the problem with a proximal policy optimization algorithm. We evaluate the performance of the proposed DRL approach by comparing it with several batching and sequencing heuristics in different problem settings. The results show that the DRL approach can develop a strategy that produces consistent, good solutions and performs better than the proposed heuristics in most of the tested cases. We show that the strategy learned by DRL is different from the hand-crafted heuristics. In this paper, we demonstrate that the benefits from recent advancements of Deep Reinforcement Learning can be transferred to solve sequential decision-making problems in warehousing operations.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Deep reinforcement learning for a color-batching resequencing problem
    Leng, Jinling
    Jin, Chun
    Vogl, Alexander
    Liu, Huiyu
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 : 175 - 187
  • [2] Deep Reinforcement Learning for Solving AGVs Routing Problem
    Lu, Chengxuan
    Long, Jinjun
    Xing, Zichao
    Wu, Weimin
    Gu, Yong
    Luo, Jiliang
    Huang, Yisheng
    [J]. VERIFICATION AND EVALUATION OF COMPUTER AND COMMUNICATION SYSTEMS, VECOS 2020, 2020, 12519 : 222 - 236
  • [3] Solving Permutation Flowshop Problem with Deep Reinforcement Learning
    Pan, Ruyuan
    Dong, Xingye
    Han, Sheng
    [J]. 2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020), 2020, : 349 - 353
  • [4] Deep Graph Reinforcement Learning for Solving Multicut Problem
    Li, Zhenchen
    Yang, Xu
    Zhang, Yanchao
    Zeng, Shaofeng
    Yuan, Jingbin
    Liu, Jiazheng
    Liu, Zhiyong
    Han, Hua
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] Using deep reinforcement learning approach for solving the multiple sequence alignment problem
    Jafari, Reza
    Javidi, Mohammad Masoud
    Rafsanjani, Marjan Kuchaki
    [J]. SN APPLIED SCIENCES, 2019, 1 (06):
  • [6] Cherrypick: Solving the Steiner Tree Problem in Graphs using Deep Reinforcement Learning
    Yan, Zong
    Du, Haizhou
    Zhang, Jiahao
    Li, Guoqing
    [J]. PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 35 - 40
  • [7] Using deep reinforcement learning approach for solving the multiple sequence alignment problem
    Reza Jafari
    Mohammad Masoud Javidi
    Marjan Kuchaki Rafsanjani
    [J]. SN Applied Sciences, 2019, 1
  • [8] Deep reinforcement learning for solving the single container loading problem
    Hajlaoui, Yakin
    Jaoua, Amel
    Layeb, Safa Bhar
    [J]. ENGINEERING OPTIMIZATION, 2023, 55 (04) : 668 - 684
  • [9] Solving the train dispatching problem via deep reinforcement learning
    Agasucci, Valerio
    Grani, Giorgio
    Lamorgese, Leonardo
    [J]. JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2023, 26
  • [10] Solving the Deadlock Problem with Deep Reinforcement Learning Using Information from Multiple Vehicles
    Goto, Tsuyoshi
    Itaya, Hidenori
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    [J]. 2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1026 - 1032