A deep reinforcement learning based hyper-heuristic for modular production control

被引:9
|
作者
Panzer, Marcel [1 ,2 ]
Bender, Benedict [1 ]
Gronau, Norbert [1 ]
机构
[1] Univ Potsdam, Chair Business Informat Proc & Syst, Potsdam, Germany
[2] Univ Potsdam, Chair Business Informat Proc & Syst, Karl Marx St 67, D-14482 Potsdam, Germany
关键词
Production control; modular production; multi-agent system; deep reinforcement learning; deep learning; multi-objective optimisation; DISPATCHING RULES; FRAMEWORK; SIMULATION; SYSTEMS;
D O I
10.1080/00207543.2023.2233641
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In nowadays production, fluctuations in demand, shortening product life-cycles, and highly configurable products require an adaptive and robust control approach to maintain competitiveness. This approach must not only optimise desired production objectives but also cope with unforeseen machine failures, rush orders, and changes in short-term demand. Previous control approaches were often implemented using a single operations layer and a standalone deep learning approach, which may not adequately address the complex organisational demands of modern manufacturing systems. To address this challenge, we propose a hyper-heuristics control model within a semi-heterarchical production system, in which multiple manufacturing and distribution agents are spread across pre-defined modules. The agents employ a deep reinforcement learning algorithm to learn a policy for selecting low-level heuristics in a situation-specific manner, thereby leveraging system performance and adaptability. We tested our approach in simulation and transferred it to a hybrid production environment. By that, we were able to demonstrate its multi-objective optimisation capabilities compared to conventional approaches in terms of mean throughput time, tardiness, and processing of prioritised orders in a multi-layered production system. The modular design is promising in reducing the overall system complexity and facilitates a quick and seamless integration into other scenarios.
引用
收藏
页码:2747 / 2768
页数:22
相关论文
共 50 条
  • [31] A Selection Hyper-Heuristic for Transfer Learning in Genetic Programming
    Russell, Jeffrey
    Pillay, Nelishia
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 631 - 634
  • [32] A Lifelong Learning Hyper-heuristic Method for Bin Packing
    Sim, Kevin
    Hart, Emma
    Paechter, Ben
    EVOLUTIONARY COMPUTATION, 2015, 23 (01) : 37 - 67
  • [33] Active learning based hyper-heuristic for the integration of production and Transportation: A third-party logistics perspective
    Li, Zuocheng
    Bai, Ling
    Qian, Bin
    Chen, Yuning
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 194
  • [34] An Apprenticeship Learning Hyper-Heuristic for Vehicle Routing in HyFlex
    Asta, Shahriar
    Ozcan, Ender
    2014 IEEE SYMPOSIUM ON EVOLVING AND AUTONOMOUS LEARNING SYSTEMS (EALS), 2014, : 65 - 72
  • [35] A tensor based hyper-heuristic for nurse rostering
    Asta, Shahriar
    Ozcan, Ender
    Curtois, Tim
    KNOWLEDGE-BASED SYSTEMS, 2016, 98 : 185 - 199
  • [36] A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses
    Li, Kunpeng
    Liu, Tengbo
    Kumar, P. N. Ram
    Han, Xuefang
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 185
  • [37] A Study of Transfer Learning in an Ant-Based Generation Construction Hyper-Heuristic
    Singh, Emilio
    Pillay, Nelishia
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [38] SIMULATION-BASED DEEP REINFORCEMENT LEARNING FOR MODULAR PRODUCTION SYSTEMS
    Feldkamp, Niclas
    Bergmann, Soeren
    Strassburger, Steffen
    2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 1596 - 1607
  • [39] Q-learning and hyper-heuristic based algorithm recommendation for changing environments
    Golcuk, Ilker
    Ozsoydan, Fehmi Burcin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
  • [40] Dispatching Rules for Production Scheduling: a Hyper-heuristic Landscape Analysis
    Ochoa, Gabriela
    Vazquez-Rodriguez, Jose Antonio
    Petrovic, Sanja
    Burke, Edmund
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 1873 - 1880