Solving the online batching problem using deep reinforcement learning

被引：18

作者：

Cals, Bram ^{[1
]}

Zhang, Yingqian ^{[1
]}

Dijkman, Remco ^{[1
]}

van Dorst, Claudy ^{[2
]}

机构：

[1] Eindhoven Univ Technol, Sch Ind Engn, POB 513, NL-5600 MB Eindhoven, Netherlands

[2] Vanderlande Ind BV, POB 18, NL-5460 AA Veghel, Netherlands

来源：

COMPUTERS & INDUSTRIAL ENGINEERING | 2021年 / 156卷

关键词：

Deep reinforcement learning; Order batching; Sequential decision making; Machine learning; Warehousing; E-commerce; ORDER PICKING; MULTIPLE PICKERS; TARDINESS; TIME;

D O I：

10.1016/j.cie.2021.107221

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In e-commerce markets, on-time delivery is of great importance to customer satisfaction. In this paper, we present a Deep Reinforcement Learning (DRL) approach, together with a heuristic, for deciding how and when arrived orders should be batched and picked in a warehouse to minimize the number of tardy orders. In particular, the technique facilitates making decisions on whether an order should be picked individually (pick-by-order) or picked in a batch with other orders (pick-by-batch), and if so, with which other orders. We approach the problem by formulating it as a semi-Markov decision process and developing a vector-based state representation that includes the characteristics of the warehouse system. This allows us to create a deep reinforcement learning solution that learns a strategy by interacting with the environment and solve the problem with a proximal policy optimization algorithm. We evaluate the performance of the proposed DRL approach by comparing it with several batching and sequencing heuristics in different problem settings. The results show that the DRL approach can develop a strategy that produces consistent, good solutions and performs better than the proposed heuristics in most of the tested cases. We show that the strategy learned by DRL is different from the hand-crafted heuristics. In this paper, we demonstrate that the benefits from recent advancements of Deep Reinforcement Learning can be transferred to solve sequential decision-making problems in warehousing operations.

引用

页数：15

共 50 条

[1] Deep reinforcement learning for a color-batching resequencing problem
Leng, Jinling
Jin, Chun
Vogl, Alexander
Liu, Huiyu
[J]. JOURNAL OF MANUFACTURING SYSTEMS, 2020, 56 : 175 - 187
[2] Deep Reinforcement Learning for Solving AGVs Routing Problem
Lu, Chengxuan
Long, Jinjun
Xing, Zichao
Wu, Weimin
Gu, Yong
Luo, Jiliang
Huang, Yisheng
[J]. VERIFICATION AND EVALUATION OF COMPUTER AND COMMUNICATION SYSTEMS, VECOS 2020, 2020, 12519 : 222 - 236
[3] Solving Permutation Flowshop Problem with Deep Reinforcement Learning
Pan, Ruyuan
Dong, Xingye
Han, Sheng
[J]. 2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020), 2020, : 349 - 353
[4] Deep Graph Reinforcement Learning for Solving Multicut Problem
Li, Zhenchen
Yang, Xu
Zhang, Yanchao
Zeng, Shaofeng
Yuan, Jingbin
Liu, Jiazheng
Liu, Zhiyong
Han, Hua
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[5] Using deep reinforcement learning approach for solving the multiple sequence alignment problem
Jafari, Reza
Javidi, Mohammad Masoud
Rafsanjani, Marjan Kuchaki
[J]. SN APPLIED SCIENCES, 2019, 1 (06):
[6] Cherrypick: Solving the Steiner Tree Problem in Graphs using Deep Reinforcement Learning
Yan, Zong
Du, Haizhou
Zhang, Jiahao
Li, Guoqing
[J]. PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 35 - 40
[7] Using deep reinforcement learning approach for solving the multiple sequence alignment problem
Reza Jafari
Mohammad Masoud Javidi
Marjan Kuchaki Rafsanjani
[J]. SN Applied Sciences, 2019, 1
[8] Deep reinforcement learning for solving the single container loading problem
Hajlaoui, Yakin
Jaoua, Amel
Layeb, Safa Bhar
[J]. ENGINEERING OPTIMIZATION, 2023, 55 (04) : 668 - 684
[9] Solving the train dispatching problem via deep reinforcement learning
Agasucci, Valerio
Grani, Giorgio
Lamorgese, Leonardo
[J]. JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2023, 26
[10] Solving the Deadlock Problem with Deep Reinforcement Learning Using Information from Multiple Vehicles
Goto, Tsuyoshi
Itaya, Hidenori
Hirakawa, Tsubasa
Yamashita, Takayoshi
Fujiyoshi, Hironobu
[J]. 2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1026 - 1032

← 1 2 3 4 5 →