A deep reinforcement learning based hyper-heuristic for modular production control

被引:9
|
作者
Panzer, Marcel [1 ,2 ]
Bender, Benedict [1 ]
Gronau, Norbert [1 ]
机构
[1] Univ Potsdam, Chair Business Informat Proc & Syst, Potsdam, Germany
[2] Univ Potsdam, Chair Business Informat Proc & Syst, Karl Marx St 67, D-14482 Potsdam, Germany
关键词
Production control; modular production; multi-agent system; deep reinforcement learning; deep learning; multi-objective optimisation; DISPATCHING RULES; FRAMEWORK; SIMULATION; SYSTEMS;
D O I
10.1080/00207543.2023.2233641
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In nowadays production, fluctuations in demand, shortening product life-cycles, and highly configurable products require an adaptive and robust control approach to maintain competitiveness. This approach must not only optimise desired production objectives but also cope with unforeseen machine failures, rush orders, and changes in short-term demand. Previous control approaches were often implemented using a single operations layer and a standalone deep learning approach, which may not adequately address the complex organisational demands of modern manufacturing systems. To address this challenge, we propose a hyper-heuristics control model within a semi-heterarchical production system, in which multiple manufacturing and distribution agents are spread across pre-defined modules. The agents employ a deep reinforcement learning algorithm to learn a policy for selecting low-level heuristics in a situation-specific manner, thereby leveraging system performance and adaptability. We tested our approach in simulation and transferred it to a hybrid production environment. By that, we were able to demonstrate its multi-objective optimisation capabilities compared to conventional approaches in terms of mean throughput time, tardiness, and processing of prioritised orders in a multi-layered production system. The modular design is promising in reducing the overall system complexity and facilitates a quick and seamless integration into other scenarios.
引用
收藏
页码:2747 / 2768
页数:22
相关论文
共 50 条
  • [41] Modular production control using deep reinforcement learning: proximal policy optimization
    Sebastian Mayer
    Tobias Classen
    Christian Endisch
    Journal of Intelligent Manufacturing, 2021, 32 : 2335 - 2351
  • [42] Modular production control using deep reinforcement learning: proximal policy optimization
    Mayer, Sebastian
    Classen, Tobias
    Endisch, Christian
    JOURNAL OF INTELLIGENT MANUFACTURING, 2021, 32 (08) : 2335 - 2351
  • [43] Hyper-Heuristic Based Resource Scheduling in Grid Environment
    Aron, Rajni
    Chana, Inderveer
    Abraham, Ajith
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1075 - 1080
  • [44] An ant based Hyper-heuristic for the travelling tournament problem
    Chen, Pai-Chun
    Kendall, Graham
    Vanden Berghe, Greet
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN SCHEDULING, 2007, : 19 - +
  • [45] A selection hyper-heuristic algorithm with Q-learning mechanism
    Zhao, Fuqing
    Liu, Yuebao
    Zhu, Ningning
    Xu, Tianpeng
    Jonrinaldi
    APPLIED SOFT COMPUTING, 2023, 147
  • [46] The Effect of Pheromone in Ant-based Hyper-heuristic
    Abd Aziz, Zalilah
    ADVANCED RESEARCH IN MATERIAL SCIENCE AND MECHANICAL ENGINEERING, PTS 1 AND 2, 2014, 446-447 : 1202 - 1206
  • [47] A Hyper-Heuristic Based on Random Gradient, Greedy and Dominance
    Ozcan, Ender
    Kheiri, Ahmed
    COMPUTER AND INFORMATION SCIENCES II, 2012, : 557 - 563
  • [48] A hyper-heuristic based framework for dynamic optimization problems
    Topcuoglu, Haluk Rahmi
    Ucar, Abdulvahid
    Altin, Lokman
    APPLIED SOFT COMPUTING, 2014, 19 : 236 - 251
  • [49] A Sequence-Based Hyper-Heuristic for Traveling Thieves
    Rodriguez, Daniel
    Cruz-Duarte, Jorge M.
    Carlos Ortiz-Bayliss, Jose
    Amaya, Ivan
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [50] A Hyper-heuristic for Dynamic Scheduling of Cyber-Physical Production Systems Using Incremental Learning
    Bouazza, Wassim
    Sallez, Yves
    Cardin, Olivier
    SERVICE ORIENTED, HOLONIC AND MULTI-AGENT MANUFACTURING SYSTEMS FOR INDUSTRY OF THE FUTURE, SOHOMA 2023, 2024, 1136 : 200 - 211