A deep reinforcement learning hyper-heuristic to solve order batching problem with mobile robots

被引:0
|
作者
Cheng, Bayi [1 ]
Wang, Lingjun [1 ]
Tan, Qi [1 ]
Zhou, Mi [1 ]
机构
[1] Hefei Univ Technol, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
E-commerce logistics; Order batching; Mobile robots; Hyper-heuristic method; Deep reinforcement learning;
D O I
10.1007/s10489-024-05532-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In e-commerce logistics, it is critical to enhance the efficiency of the order-picking system. Motivated by applications of automatic logistics, we consider the mobile robot based order batching problem. In this problem, mobile robots carry shelves to the picking station for order picking and then return them. The objective is to reduce shelf movements while minimizing the number of delayed orders. We introduce a hyper-heuristic method based on deep reinforcement learning to optimize the order batching strategy in the system. The proposed method adaptively selects the order batching strategy, significantly improving the sequential decision-making process in order picking. Through extensive tests, we demonstrate the superiority of the proposed method over several existing heuristic methods in a range of test scenarios. The results show that the proposed method outperforms other existing heuristic methods in a range of test scenarios, offering more stable and effective solutions. This study is a pioneer in the application of deep reinforcement learning to the mobile robot based order batching problem, offering a novel perspective and methodology to overcome the challenges of sequential decision-making optimization in order picking systems.
引用
收藏
页码:6865 / 6887
页数:23
相关论文
共 50 条
  • [1] Hyper-heuristic for CVRP with reinforcement learning
    Zhang, Jingling
    Feng, Qinbing
    Zhao, Yanwei
    Liu, Jinlong
    Leng, Longlong
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (04): : 1118 - 1129
  • [2] Mobile robot sequential decision making using a deep reinforcement learning hyper-heuristic approach
    Cui, Tianxiang
    Yang, Xiaoying
    Jia, Fuhua
    Jin, Jiahuan
    Ye, Yujian
    Bai, Ruibin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 257
  • [3] A deep reinforcement learning based hyper-heuristic for modular production control
    Panzer, Marcel
    Bender, Benedict
    Gronau, Norbert
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (08) : 2747 - 2768
  • [4] A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties
    Zhang, Yuchang
    Bai, Ruibin
    Qu, Rong
    Tu, Chaofan
    Jin, Jiahuan
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 300 (02) : 418 - 427
  • [5] A Study on Online Hyper-heuristic Learning for Swarm Robots
    Yu, Shuang
    Song, Andy
    Aleti, Aldeida
    [J]. 2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2721 - 2728
  • [6] A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems
    Tu, Chaofan
    Bai, Ruibin
    Aickelin, Uwe
    Zhang, Yuchang
    Du, Heshan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [7] A Reinforcement Learning Hyper-heuristic for the Optimisation of Flight Connections
    Pylyavskyy, Yaroslav
    Kheiri, Ahmed
    Ahmed, Leena
    [J]. 2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [8] Automatic design of hyper-heuristic based on reinforcement learning
    Choong, Shin Siang
    Wong, Li-Pei
    Lim, Chee Peng
    [J]. INFORMATION SCIENCES, 2018, 436 : 89 - 107
  • [9] A reinforcement learning hyper-heuristic algorithm for the distributed flowshops scheduling problem under consideration of emergency order insertion
    Zhao, Fuqing
    Liu, Yuebao
    Xu, Tianpeng
    Jonrinaldi
    [J]. Applied Soft Computing, 2024, 167
  • [10] A novel reinforcement learning-based hyper-heuristic for heterogeneous vehicle routing problem
    Qin, Wei
    Zhuang, Zilong
    Huang, Zizhao
    Huang, Haozhe
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 156