Dynamic production scheduling towards self-organizing mass personalization: A multi-agent dueling deep reinforcement learning approach

被引：28

作者：

Qin, Zhaojun ^{[1
]}

Johnson, Dazzle ^{[1
]}

Lu, Yuqian ^{[1
]}

机构：

[1] Univ Auckland, Dept Mech & Mechatron Engn, Auckland, New Zealand

来源：

JOURNAL OF MANUFACTURING SYSTEMS | 2023年 / 68卷

关键词：

Mass personalization; Self-organizing manufacturing network; Dynamic flexible job shop scheduling problem; Multi-agent production scheduling; Reinforcement learning; OF-THE-ART; MANUFACTURING SYSTEMS; MACHINE BREAKDOWNS; GENETIC ALGORITHMS; WORKLOAD CONTROL; BOND GRAPHS; SHOP; AGENT; ARCHITECTURE; OPTIMIZATION;

D O I：

10.1016/j.jmsy.2023.03.003

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Mass personalization is rapidly approaching. In response, manufacturing systems should be capable of autono-mously changing production plans, configurations and schedules under dynamic manufacturing environments for producing personalized products. Self-organizing manufacturing network is a promising paradigm for mass personalization. The backbone of a self-organizing manufacturing network is an adaptive production scheduling method to dynamically allocate and sequence manufacturing jobs under dynamic settings, such as stochastic processing time or unplanned machine breakdown. However, existing production scheduling methods (i.e., heuristic rules, meta-heuristic algorithms, and existing reinforcement learning models) fail to automatically optimize production schedules while maintaining stable manufacturing performance, under dynamic settings. In this paper, we designed a reinforcement learning-based static-training-dynamic-execution approach for dynamic job shop scheduling problems. The scheduling policies are learned from static scheduling instances by a multi -agent dueling deep reinforcement learning approach. Under this approach, we proposed new representations of observation, action, reward, and cooperation mechanisms between agents. The learned scheduling policies are then deployed to a dynamic scheduling system where stochastic processing time and unplanned machine breakdown randomly occur. Extensive simulation experiments demonstrated that our approach outperforms heuristic rules on makespan under two dynamic manufacturing settings.

引用

页码：242 / 257

页数：16

共 50 条

[41] HALFTONING WITH MULTI-AGENT DEEP REINFORCEMENT LEARNING
Jiang, Haitian
Xiong, Dongliang
Jiang, Xiaowen
Yin, Aiguo
Ding, Li
Huang, Kai
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 641 - 645
[42] Deep reinforcement learning for multi-agent interaction
Ahmed, Ibrahim H.
Brewitt, Cillian
Carlucho, Ignacio
Christianos, Filippos
Dunion, Mhairi
Fosong, Elliot
Garcin, Samuel
Guo, Shangmin
Gyevnar, Balint
McInroe, Trevor
Papoudakis, Georgios
Rahman, Arrasy
Schafer, Lukas
Tamborski, Massimiliano
Vecchio, Giuseppe
Wang, Cheng
Albrecht, Stefano, V
AI COMMUNICATIONS, 2022, 35 (04) : 357 - 368
[43] Application of Multi-agent Reinforcement Learning to the Dynamic Scheduling Problem in Manufacturing Systems
Heik, David
Bahrpeyma, Fouad
Reichelt, Dirk
MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 237 - 254
[44] Multi-agent deep reinforcement learning: a survey
Sven Gronauer
Klaus Diepold
Artificial Intelligence Review, 2022, 55 : 895 - 943
[45] Deep Multi-Agent Reinforcement Learning: A Survey
Liang X.-X.
Feng Y.-H.
Ma Y.
Cheng G.-Q.
Huang J.-C.
Wang Q.
Zhou Y.-Z.
Liu Z.
Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (12): : 2537 - 2557
[46] Lenient Multi-Agent Deep Reinforcement Learning
Palmer, Gregory
Tuyls, Karl
Bloembergen, Daan
Savani, Rahul
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 443 - 451
[47] Multi-agent deep reinforcement learning: a survey
Gronauer, Sven
Diepold, Klaus
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 895 - 943
[48] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Foerster, Jakob N.
Assael, Yannis M.
de Freitas, Nando
Whiteson, Shimon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[49] Self-organizing multi-agent systems by means of Scout Movement
Paletta, Mauricio
Recent Patents on Computer Science, 2012, 5 (03): : 197 - 210
[50] Characterizing complex behavior in (self-organizing) multi-agent systems
Hu, BC
Liu, JM
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 1274 - 1283

← 1 2 3 4 5 →