A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems

被引:11
|
作者
Tu, Chaofan [1 ]
Bai, Ruibin [1 ]
Aickelin, Uwe [2 ]
Zhang, Yuchang [1 ]
Du, Heshan [1 ]
机构
[1] Univ Nottingham Ningbo China, China & Nottingham Ningbo China Beacons Excellence, Sch Comp Sci, Ningbo, Peoples R China
[2] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Australia
基金
中国国家自然科学基金;
关键词
Hyper; -heuristic; Deep reinforcement learning; Feature fusion; Knapsack problem; Strip packing problem; KNAPSACK; ALGORITHMS; APPROXIMATION; AUCTIONS; DESIGN;
D O I
10.1016/j.eswa.2023.120568
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep reinforcement learning has shown great potential in solving computer games with sequential decision-making scenarios. Hyper-heuristic is a generic search framework, capable of intelligently selecting or generating algorithms to solve a class of optimisation problems with stochastic or dynamic settings. This paper proposes a new general framework for solving online packing problems using deep reinforcement learning hyper-heuristics. Although analytical approaches can address most offline packing problems successfully, their online versions have proved much more challenging and the performance of the existing methods is often not satisfactory. In this paper, we extend a recent deep reinforcement learning hyper-heuristic framework by fusing the visual information of real-time packing with distributional information of random parameters of the problem. Computational experiments show that our method outperforms the state of the art online methods with reductions in optimality gap between 2%-19% for knapsack problem and 0.7% for the online strip packing problem. In addition, a new visual analysis presentation is also devised to better interpret the learned packing strategies, which can reveal more information than the widely used landscape analysis. As online packing problems are widely available in production environments, the proposed approach can serve as an important reference to solve other similar combinatorial optimisation problems for which visual layout inputs would aid learning.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Hyper-heuristic for CVRP with reinforcement learning
    Zhang, Jingling
    Feng, Qinbing
    Zhao, Yanwei
    Liu, Jinlong
    Leng, Longlong
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (04): : 1118 - 1129
  • [2] A Hyper-Heuristic Approach to Strip Packing Problems
    Burke, Edmund K.
    Guo, Qiang
    Kendall, Graham
    [J]. PARALLEL PROBLEMS SOLVING FROM NATURE - PPSN XI, PT I, 2010, 6238 : 465 - 474
  • [3] A deep reinforcement learning based hyper-heuristic for modular production control
    Panzer, Marcel
    Bender, Benedict
    Gronau, Norbert
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024, 62 (08) : 2747 - 2768
  • [4] A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties
    Zhang, Yuchang
    Bai, Ruibin
    Qu, Rong
    Tu, Chaofan
    Jin, Jiahuan
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 300 (02) : 418 - 427
  • [5] A Lifelong Learning Hyper-heuristic Method for Bin Packing
    Sim, Kevin
    Hart, Emma
    Paechter, Ben
    [J]. EVOLUTIONARY COMPUTATION, 2015, 23 (01) : 37 - 67
  • [6] A Reinforcement Learning Hyper-heuristic for the Optimisation of Flight Connections
    Pylyavskyy, Yaroslav
    Kheiri, Ahmed
    Ahmed, Leena
    [J]. 2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [7] Automatic design of hyper-heuristic based on reinforcement learning
    Choong, Shin Siang
    Wong, Li-Pei
    Lim, Chee Peng
    [J]. INFORMATION SCIENCES, 2018, 436 : 89 - 107
  • [8] A unified hyper-heuristic framework for solving bin packing problems
    Lopez-Camacho, Eunice
    Terashima-Marin, Hugo
    Ross, Peter
    Ochoa, Gabriela
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (15) : 6876 - 6889
  • [9] A deep reinforcement learning hyper-heuristic to solve order batching problem with mobile robots
    Cheng, Bayi
    Wang, Lingjun
    Tan, Qi
    Zhou, Mi
    [J]. APPLIED INTELLIGENCE, 2024, 54 (9-10) : 6865 - 6887
  • [10] A Study on Online Hyper-heuristic Learning for Swarm Robots
    Yu, Shuang
    Song, Andy
    Aleti, Aldeida
    [J]. 2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2721 - 2728