Dynamic production scheduling towards self-organizing mass personalization: A multi-agent dueling deep reinforcement learning approach

被引:28
|
作者
Qin, Zhaojun [1 ]
Johnson, Dazzle [1 ]
Lu, Yuqian [1 ]
机构
[1] Univ Auckland, Dept Mech & Mechatron Engn, Auckland, New Zealand
关键词
Mass personalization; Self-organizing manufacturing network; Dynamic flexible job shop scheduling problem; Multi-agent production scheduling; Reinforcement learning; OF-THE-ART; MANUFACTURING SYSTEMS; MACHINE BREAKDOWNS; GENETIC ALGORITHMS; WORKLOAD CONTROL; BOND GRAPHS; SHOP; AGENT; ARCHITECTURE; OPTIMIZATION;
D O I
10.1016/j.jmsy.2023.03.003
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Mass personalization is rapidly approaching. In response, manufacturing systems should be capable of autono-mously changing production plans, configurations and schedules under dynamic manufacturing environments for producing personalized products. Self-organizing manufacturing network is a promising paradigm for mass personalization. The backbone of a self-organizing manufacturing network is an adaptive production scheduling method to dynamically allocate and sequence manufacturing jobs under dynamic settings, such as stochastic processing time or unplanned machine breakdown. However, existing production scheduling methods (i.e., heuristic rules, meta-heuristic algorithms, and existing reinforcement learning models) fail to automatically optimize production schedules while maintaining stable manufacturing performance, under dynamic settings. In this paper, we designed a reinforcement learning-based static-training-dynamic-execution approach for dynamic job shop scheduling problems. The scheduling policies are learned from static scheduling instances by a multi -agent dueling deep reinforcement learning approach. Under this approach, we proposed new representations of observation, action, reward, and cooperation mechanisms between agents. The learned scheduling policies are then deployed to a dynamic scheduling system where stochastic processing time and unplanned machine breakdown randomly occur. Extensive simulation experiments demonstrated that our approach outperforms heuristic rules on makespan under two dynamic manufacturing settings.
引用
收藏
页码:242 / 257
页数:16
相关论文
共 50 条
  • [41] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
    Jiang, Haitian
    Xiong, Dongliang
    Jiang, Xiaowen
    Yin, Aiguo
    Ding, Li
    Huang, Kai
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
  • [42] Deep reinforcement learning for multi-agent interaction
    Ahmed, Ibrahim H.
    Brewitt, Cillian
    Carlucho, Ignacio
    Christianos, Filippos
    Dunion, Mhairi
    Fosong, Elliot
    Garcin, Samuel
    Guo, Shangmin
    Gyevnar, Balint
    McInroe, Trevor
    Papoudakis, Georgios
    Rahman, Arrasy
    Schafer, Lukas
    Tamborski, Massimiliano
    Vecchio, Giuseppe
    Wang, Cheng
    Albrecht, Stefano, V
    AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368
  • [43] Application of Multi-agent Reinforcement Learning to the Dynamic Scheduling Problem in Manufacturing Systems
    Heik, David
    Bahrpeyma, Fouad
    Reichelt, Dirk
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 237 - 254
  • [44] Multi-agent deep reinforcement learning: a survey
    Sven Gronauer
    Klaus Diepold
    Artificial Intelligence Review, 2022, 55 : 895 - 943
  • [45] Deep Multi-Agent Reinforcement Learning: A Survey
    Liang X.-X.
    Feng Y.-H.
    Ma Y.
    Cheng G.-Q.
    Huang J.-C.
    Wang Q.
    Zhou Y.-Z.
    Liu Z.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (12): : 2537 - 2557
  • [46] Lenient Multi-Agent Deep Reinforcement Learning
    Palmer, Gregory
    Tuyls, Karl
    Bloembergen, Daan
    Savani, Rahul
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 443 - 451
  • [47] Multi-agent deep reinforcement learning: a survey
    Gronauer, Sven
    Diepold, Klaus
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 895 - 943
  • [48] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
    Foerster, Jakob N.
    Assael, Yannis M.
    de Freitas, Nando
    Whiteson, Shimon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [49] Self-organizing multi-agent systems by means of Scout Movement
    Paletta, Mauricio
    Recent Patents on Computer Science, 2012, 5 (03): : 197 - 210
  • [50] Characterizing complex behavior in (self-organizing) multi-agent systems
    Hu, BC
    Liu, JM
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1274 - 1283