An efficient and adaptive design of reinforcement learning environment to solve job shop scheduling problem with soft actor-critic algorithm

被引:0
|
作者
Si, Jinghua [1 ]
Li, Xinyu [1 ,2 ]
Gao, Liang [1 ]
Li, Peigen [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, 1037 Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Job shop scheduling; reinforcement learning; environment design; multi-agent architecture; soft actor critic; BENCHMARKS;
D O I
10.1080/00207543.2024.2335663
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Shop scheduling is deeply involved in manufacturing. In order to improve the efficiency of scheduling and fit dynamic scenarios, many Deep Reinforcement Learning (DRL) methods are studied to solve scheduling problems like job shop and flow shop. But most studies focus on using the latest algorithms while ignoring that the environment plays an important role in agent learning. In this paper, we design an effective, robust and size-agnostic environment for job shop scheduling. The proposed design of environment uses centralised training and decentralised execution (CTDE) to implement a multi-agent architecture. Together with the observation space we design, environmental information that is irrelevant to the current decision is eliminated as much as possible. The proposed action space enlarges the decision space of agents, which performs better than the traditional way. Finally, Soft Actor-Critic (SAC) algorithm is adapted to learning within this environment. By comparing with traditional scheduling rules, other reinforcement learning algorithms, and relevant literature, the superiority of the results obtained in this study is demonstrated.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems
    Liu, Chien-Liang
    Chang, Chuan-Chin
    Tseng, Chun-Jan
    [J]. IEEE ACCESS, 2020, 8 : 71752 - 71762
  • [2] An effective deep actor-critic reinforcement learning method for solving the flexible job shop scheduling problem
    Wan, Lanjun
    Cui, Xueyan
    Zhao, Haoxin
    Li, Changyun
    Wang, Zhibing
    [J]. Neural Computing and Applications, 2024, 36 (20) : 11877 - 11899
  • [3] An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents
    Monaci, Marta
    Agasucci, Valerio
    Grani, Giorgio
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 312 (03) : 910 - 926
  • [4] An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems
    Zhao, Cong
    Deng, Na
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1445 - 1471
  • [5] A soft actor-critic reinforcement learning algorithm for network intrusion detection
    Li, Zhengfa
    Huang, Chuanhe
    Deng, Shuhua
    Qiu, Wanyu
    Gao, Xieping
    [J]. COMPUTERS & SECURITY, 2023, 135
  • [6] A modified actor-critic reinforcement learning algorithm
    Mustapha, SM
    Lachiver, G
    [J]. 2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
  • [7] AN ACTOR-CRITIC REINFORCEMENT LEARNING ALGORITHM BASED ON ADAPTIVE RBF NETWORK
    Li, Chun-Gui
    Wang, Meng
    Huang, Zhen-Jin
    Zhang, Zeng-Fang
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 984 - 988
  • [8] Averaged Soft Actor-Critic for Deep Reinforcement Learning
    Ding, Feng
    Ma, Guanfeng
    Chen, Zhikui
    Gao, Jing
    Li, Peng
    [J]. COMPLEXITY, 2021, 2021
  • [9] THE APPLICATION OF ACTOR-CRITIC REINFORCEMENT LEARNING FOR FAB DISPATCHING SCHEDULING
    Kim, Namyong
    Shin, IIayong
    [J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4570 - 4571
  • [10] A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
    Wen, Shuhuan
    Shu, Yili
    Rad, Ahmad
    Wen, Zeteng
    Guo, Zhengzheng
    Gong, Simeng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259