An efficient and adaptive design of reinforcement learning environment to solve job shop scheduling problem with soft actor-critic algorithm

被引:0
|
作者
Si, Jinghua [1 ]
Li, Xinyu [1 ,2 ]
Gao, Liang [1 ]
Li, Peigen [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, 1037 Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Job shop scheduling; reinforcement learning; environment design; multi-agent architecture; soft actor critic; BENCHMARKS;
D O I
10.1080/00207543.2024.2335663
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Shop scheduling is deeply involved in manufacturing. In order to improve the efficiency of scheduling and fit dynamic scenarios, many Deep Reinforcement Learning (DRL) methods are studied to solve scheduling problems like job shop and flow shop. But most studies focus on using the latest algorithms while ignoring that the environment plays an important role in agent learning. In this paper, we design an effective, robust and size-agnostic environment for job shop scheduling. The proposed design of environment uses centralised training and decentralised execution (CTDE) to implement a multi-agent architecture. Together with the observation space we design, environmental information that is irrelevant to the current decision is eliminated as much as possible. The proposed action space enlarges the decision space of agents, which performs better than the traditional way. Finally, Soft Actor-Critic (SAC) algorithm is adapted to learning within this environment. By comparing with traditional scheduling rules, other reinforcement learning algorithms, and relevant literature, the superiority of the results obtained in this study is demonstrated.
引用
收藏
页码:8260 / 8275
页数:16
相关论文
共 50 条
  • [41] Hybrid actor-critic algorithm for quantum reinforcement learning at CERN beam lines
    Schenk, Michael
    Combarro, Elias F.
    Grossi, Michele
    Kain, Verena
    Li, Kevin Shing Bruce
    Popa, Mircea-Marian
    Vallecorsa, Sofia
    [J]. QUANTUM SCIENCE AND TECHNOLOGY, 2024, 9 (02)
  • [42] DAG-based workflows scheduling using Actor-Critic Deep Reinforcement Learning
    Koslovski, Guilherme Piegas
    Pereira, Kleiton
    Albuquerque, Paulo Roberto
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 150 : 354 - 363
  • [43] Optimal Scheduling of Regional Integrated Energy System Based on Advantage Learning Soft Actor-critic Algorithm and Transfer Learning
    Luo W.
    Zhang J.
    He Y.
    Gu T.
    Nie X.
    Fan L.
    Yuan X.
    Li B.
    [J]. Dianwang Jishu/Power System Technology, 2023, 47 (04): : 1601 - 1611
  • [44] Hardware-in-the-Loop Soft Robotic Testing Framework Using an Actor-Critic Deep Reinforcement Learning Algorithm
    Marquez, Jesus
    Sullivan, Charles
    Price, Ryan M.
    Roberts, Robert C.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 6076 - 6082
  • [45] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
    Haarnoja, Tuomas
    Zhou, Aurick
    Abbeel, Pieter
    Levine, Sergey
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [46] Soft Actor-Critic Reinforcement Learning-Based Optimization for Analog Circuit Sizing
    Park, Sejin
    Choi, Youngchang
    Kang, Seokhyeong
    [J]. 2023 20TH INTERNATIONAL SOC DESIGN CONFERENCE, ISOCC, 2023, : 47 - 48
  • [47] Adaptive Memetic Algorithm for the Job Shop Scheduling Problem
    Nalepa, Jakub
    Cwiek, Marcin
    Kawulok, Michal
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [48] A model-based hybrid soft actor-critic deep reinforcement learning algorithm for optimal ventilator settings
    Chen, Shaotao
    Qiu, Xihe
    Tan, Xiaoyu
    Fang, Zhijun
    Jin, Yaochu
    [J]. INFORMATION SCIENCES, 2022, 611 : 47 - 64
  • [49] Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
    Bjorck, Johan
    Chen, Xiangyu
    De Sa, Christopher
    Gomes, Carla P.
    Weinberger, Kilian Q.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [50] Soft Actor-Critic Deep Reinforcement Learning for Fault-Tolerant Flight Control
    Dally, Killian
    van Kampen, Erik-Jan
    [J]. arXiv, 2022,