Solving the online batching problem using deep reinforcement learning

被引：18

作者：

Cals, Bram ^{[1
]}

Zhang, Yingqian ^{[1
]}

Dijkman, Remco ^{[1
]}

van Dorst, Claudy ^{[2
]}

机构：

[1] Eindhoven Univ Technol, Sch Ind Engn, POB 513, NL-5600 MB Eindhoven, Netherlands

[2] Vanderlande Ind BV, POB 18, NL-5460 AA Veghel, Netherlands

来源：

COMPUTERS & INDUSTRIAL ENGINEERING | 2021年 / 156卷

关键词：

Deep reinforcement learning; Order batching; Sequential decision making; Machine learning; Warehousing; E-commerce; ORDER PICKING; MULTIPLE PICKERS; TARDINESS; TIME;

D O I：

10.1016/j.cie.2021.107221

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In e-commerce markets, on-time delivery is of great importance to customer satisfaction. In this paper, we present a Deep Reinforcement Learning (DRL) approach, together with a heuristic, for deciding how and when arrived orders should be batched and picked in a warehouse to minimize the number of tardy orders. In particular, the technique facilitates making decisions on whether an order should be picked individually (pick-by-order) or picked in a batch with other orders (pick-by-batch), and if so, with which other orders. We approach the problem by formulating it as a semi-Markov decision process and developing a vector-based state representation that includes the characteristics of the warehouse system. This allows us to create a deep reinforcement learning solution that learns a strategy by interacting with the environment and solve the problem with a proximal policy optimization algorithm. We evaluate the performance of the proposed DRL approach by comparing it with several batching and sequencing heuristics in different problem settings. The results show that the DRL approach can develop a strategy that produces consistent, good solutions and performs better than the proposed heuristics in most of the tested cases. We show that the strategy learned by DRL is different from the hand-crafted heuristics. In this paper, we demonstrate that the benefits from recent advancements of Deep Reinforcement Learning can be transferred to solve sequential decision-making problems in warehousing operations.

引用

页数：15

共 50 条

[31] RETRACTED ARTICLE: Solving the protein folding problem in hydrophobic-polar model using deep reinforcement learning
Reza Jafari
Mohammad Masoud Javidi
[J]. SN Applied Sciences, 2020, 2
[32] Reinforcement Learning for Solving the Vehicle Routing Problem
Nazari, Mohammadreza
Oroojlooy, Afshin
Takac, Martin
Snyder, Lawrence V.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[33] Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning
Wang, Rui
Gan, Xianghua
Li, Qing
Yan, Xiao
[J]. COMPLEXITY, 2021, 2021
[34] Solving the RNA design problem with reinforcement learning
Eastman, Peter
Shi, Jade
Ramsundar, Bharath
Pande, Vijay S.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (06)
[35] Deep reinforcement learning for solving the joint scheduling problem of machines and AGVs in job shop
基于深度强化学习求解作业车间机器与AGV联合调度问题
[J]. Lei, Qi (leiqi@cqu.edu.cn), 1600, Northeast University (39):
[36] A reinforcement learning approach to cooperative problem solving
Yoshida, T
Hori, K
Nakasuka, S
[J]. INTERNATIONAL CONFERENCE ON MULTI-AGENT SYSTEMS, PROCEEDINGS, 1998, : 479 - 480
[37] Hierarchical reinforcement learning as creative problem solving
Colin, Thomas R.
Belpaeme, Tony
Cangelosi, Angelo
Hemion, Nikolas
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 86 : 196 - 206
[38] A maintenance planning framework using online and offline deep reinforcement learning
Bukhsh, Zaharah A.
Molegraaf, Hajo
Jansen, Nils
[J]. NEURAL COMPUTING & APPLICATIONS, 2023,
[39] Online Index Selection Using Deep Reinforcement Learning for a Cluster Database
Sadri, Zahra
Gruenwald, Le
Leal, Eleazar
[J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2020), 2020, : 158 - 161
[40] Online Energy Management in Commercial Buildings using Deep Reinforcement Learning
Naug, Avisek
Ahmed, Ibrahim
Biswas, Gautam
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2019), 2019, : 249 - 257

← 1 2 3 4 5 →