A deep reinforcement learning based hyper-heuristic for modular production control

被引:9
|
作者
Panzer, Marcel [1 ,2 ]
Bender, Benedict [1 ]
Gronau, Norbert [1 ]
机构
[1] Univ Potsdam, Chair Business Informat Proc & Syst, Potsdam, Germany
[2] Univ Potsdam, Chair Business Informat Proc & Syst, Karl Marx St 67, D-14482 Potsdam, Germany
关键词
Production control; modular production; multi-agent system; deep reinforcement learning; deep learning; multi-objective optimisation; DISPATCHING RULES; FRAMEWORK; SIMULATION; SYSTEMS;
D O I
10.1080/00207543.2023.2233641
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In nowadays production, fluctuations in demand, shortening product life-cycles, and highly configurable products require an adaptive and robust control approach to maintain competitiveness. This approach must not only optimise desired production objectives but also cope with unforeseen machine failures, rush orders, and changes in short-term demand. Previous control approaches were often implemented using a single operations layer and a standalone deep learning approach, which may not adequately address the complex organisational demands of modern manufacturing systems. To address this challenge, we propose a hyper-heuristics control model within a semi-heterarchical production system, in which multiple manufacturing and distribution agents are spread across pre-defined modules. The agents employ a deep reinforcement learning algorithm to learn a policy for selecting low-level heuristics in a situation-specific manner, thereby leveraging system performance and adaptability. We tested our approach in simulation and transferred it to a hybrid production environment. By that, we were able to demonstrate its multi-objective optimisation capabilities compared to conventional approaches in terms of mean throughput time, tardiness, and processing of prioritised orders in a multi-layered production system. The modular design is promising in reducing the overall system complexity and facilitates a quick and seamless integration into other scenarios.
引用
收藏
页码:2747 / 2768
页数:22
相关论文
共 50 条
  • [21] Learning a Hidden Markov Model-Based Hyper-heuristic
    Van Onsem, Willem
    Demoen, Bart
    De Causmaecker, Patrick
    LEARNING AND INTELLIGENT OPTIMIZATION, LION 9, 2015, 8994 : 74 - 88
  • [22] A hierarchical reinforcement learning-aware hyper-heuristic algorithm with fitness landscape analysis
    Zhu, Ningning
    Zhao, Fuqing
    Yu, Yang
    Wang, Ling
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 90
  • [23] Hyper-heuristic Image Enhancement (HHIE): A Reinforcement Learning Method for Image Contrast Enhancement
    Montazeri, Mitra
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 363 - 375
  • [24] Optimising Deep Learning by Hyper-heuristic Approach for Classifying Good Quality Images
    ul Hassan, Muneeb
    Sabar, Nasser R.
    Song, Andy
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 528 - 539
  • [25] Optimising Deep Belief Networks by Hyper-heuristic Approach
    Sabar, Nasser R.
    Turky, Ayad
    Song, Andy
    Sattar, Abdul
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 2738 - 2745
  • [26] A new Hyper-heuristic based on Adaptive Simulated Annealing and Reinforcement Learning for the Capacitated Electric Vehicle Routing Problem
    Rodríguez-Esparza E.
    Masegosa A.D.
    Oliva D.
    Onieva E.
    Expert Systems with Applications, 2024, 252
  • [27] Thermophotovoltaic emitter design with a hyper-heuristic custom optimizer enabled by deep learning surrogates
    Bohm, Preston
    Yang, Chiyu
    Menon, Akanksha K.
    Zhang, Zhuomin M.
    ENERGY, 2024, 291
  • [28] A Study on Online Hyper-heuristic Learning for Swarm Robots
    Yu, Shuang
    Song, Andy
    Aleti, Aldeida
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2721 - 2728
  • [29] A reinforcement learning hyper-heuristic in multi-objective optimization with application to structural damage identification
    Cao, Pei
    Zhang, Yang
    Zhou, Kai
    Tang, J.
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2023, 66 (01)
  • [30] A reinforcement learning hyper-heuristic in multi-objective optimization with application to structural damage identification
    Pei Cao
    Yang Zhang
    Kai Zhou
    J. Tang
    Structural and Multidisciplinary Optimization, 2023, 66