A deep reinforcement learning based hyper-heuristic for modular production control

被引:9
|
作者
Panzer, Marcel [1 ,2 ]
Bender, Benedict [1 ]
Gronau, Norbert [1 ]
机构
[1] Univ Potsdam, Chair Business Informat Proc & Syst, Potsdam, Germany
[2] Univ Potsdam, Chair Business Informat Proc & Syst, Karl Marx St 67, D-14482 Potsdam, Germany
关键词
Production control; modular production; multi-agent system; deep reinforcement learning; deep learning; multi-objective optimisation; DISPATCHING RULES; FRAMEWORK; SIMULATION; SYSTEMS;
D O I
10.1080/00207543.2023.2233641
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In nowadays production, fluctuations in demand, shortening product life-cycles, and highly configurable products require an adaptive and robust control approach to maintain competitiveness. This approach must not only optimise desired production objectives but also cope with unforeseen machine failures, rush orders, and changes in short-term demand. Previous control approaches were often implemented using a single operations layer and a standalone deep learning approach, which may not adequately address the complex organisational demands of modern manufacturing systems. To address this challenge, we propose a hyper-heuristics control model within a semi-heterarchical production system, in which multiple manufacturing and distribution agents are spread across pre-defined modules. The agents employ a deep reinforcement learning algorithm to learn a policy for selecting low-level heuristics in a situation-specific manner, thereby leveraging system performance and adaptability. We tested our approach in simulation and transferred it to a hybrid production environment. By that, we were able to demonstrate its multi-objective optimisation capabilities compared to conventional approaches in terms of mean throughput time, tardiness, and processing of prioritised orders in a multi-layered production system. The modular design is promising in reducing the overall system complexity and facilitates a quick and seamless integration into other scenarios.
引用
收藏
页码:2747 / 2768
页数:22
相关论文
共 50 条
  • [1] A deep reinforcement learning based hyper-heuristic for modular production control (Jul, 10.1080/00207543.2023.2233641, 2023)
    Panzer, M.
    Bender, B.
    Gronau, N.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023,
  • [2] A deep reinforcement learning based hyper-heuristic for combinatorial optimisation with uncertainties
    Zhang, Yuchang
    Bai, Ruibin
    Qu, Rong
    Tu, Chaofan
    Jin, Jiahuan
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 300 (02) : 418 - 427
  • [3] Hyper-heuristic for CVRP with reinforcement learning
    Zhang J.
    Feng Q.
    Zhao Y.
    Liu J.
    Leng L.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (04): : 1118 - 1129
  • [4] Automatic design of hyper-heuristic based on reinforcement learning
    Choong, Shin Siang
    Wong, Li-Pei
    Lim, Chee Peng
    INFORMATION SCIENCES, 2018, 436 : 89 - 107
  • [5] PHH: Policy-Based Hyper-Heuristic With Reinforcement Learning
    Udomkasemsub, Orachun
    Sirinaovakul, Booncharoen
    Achalakul, Tiranee
    IEEE ACCESS, 2023, 11 : 52026 - 52049
  • [6] A deep reinforcement learning hyper-heuristic with feature fusion for online packing problems
    Tu, Chaofan
    Bai, Ruibin
    Aickelin, Uwe
    Zhang, Yuchang
    Du, Heshan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [7] A Reinforcement Learning Hyper-heuristic for the Optimisation of Flight Connections
    Pylyavskyy, Yaroslav
    Kheiri, Ahmed
    Ahmed, Leena
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [8] A deep reinforcement learning hyper-heuristic to solve order batching problem with mobile robots
    Cheng, Bayi
    Wang, Lingjun
    Tan, Qi
    Zhou, Mi
    APPLIED INTELLIGENCE, 2024, 54 (9-10) : 6865 - 6887
  • [9] Hyper-Heuristic Task Scheduling Algorithm Based on Reinforcement Learning in Cloud Computing
    Yin, Lei
    Sun, Chang
    Gao, Ming
    Fang, Yadong
    Li, Ming
    Zhou, Fengyu
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (02): : 1587 - 1608
  • [10] Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach
    Cui, Tianxiang
    Du, Nanjiang
    Yang, Xiaoying
    Ding, Shusheng
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 198