An efficient and adaptive design of reinforcement learning environment to solve job shop scheduling problem with soft actor-critic algorithm

被引:0
|
作者
Si, Jinghua [1 ]
Li, Xinyu [1 ,2 ]
Gao, Liang [1 ]
Li, Peigen [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, 1037 Luoyu Rd, Wuhan 430074, Hubei, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Job shop scheduling; reinforcement learning; environment design; multi-agent architecture; soft actor critic; BENCHMARKS;
D O I
10.1080/00207543.2024.2335663
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Shop scheduling is deeply involved in manufacturing. In order to improve the efficiency of scheduling and fit dynamic scenarios, many Deep Reinforcement Learning (DRL) methods are studied to solve scheduling problems like job shop and flow shop. But most studies focus on using the latest algorithms while ignoring that the environment plays an important role in agent learning. In this paper, we design an effective, robust and size-agnostic environment for job shop scheduling. The proposed design of environment uses centralised training and decentralised execution (CTDE) to implement a multi-agent architecture. Together with the observation space we design, environmental information that is irrelevant to the current decision is eliminated as much as possible. The proposed action space enlarges the decision space of agents, which performs better than the traditional way. Finally, Soft Actor-Critic (SAC) algorithm is adapted to learning within this environment. By comparing with traditional scheduling rules, other reinforcement learning algorithms, and relevant literature, the superiority of the results obtained in this study is demonstrated.
引用
收藏
页码:8260 / 8275
页数:16
相关论文
共 50 条
  • [31] Optimal scheduling strategy of electricity and thermal energy storage based on soft actor-critic reinforcement learning approach
    Zheng, Yingying
    Wang, Hui
    Wang, Jinglong
    Wang, Zichong
    Zhao, Yongning
    [J]. JOURNAL OF ENERGY STORAGE, 2024, 92
  • [32] Adaptive Assist-as-needed Control Based on Actor-Critic Reinforcement Learning
    Zhang, Yufeng
    Li, Shuai
    Nolan, Karen J.
    Zanotto, Damiano
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4066 - 4071
  • [33] Simple genetic algorithm to solve the Job Shop Scheduling Problem
    Jiménez-Carrión M.
    [J]. Jiménez-Carrión, Miguel (mjimenezc@gmail.com), 2018, Centro de Informacion Tecnologica (29): : 299 - 313
  • [34] A hybrid evolutionary algorithm to solve the job shop scheduling problem
    T. C. E. Cheng
    Bo Peng
    Zhipeng Lü
    [J]. Annals of Operations Research, 2016, 242 : 223 - 237
  • [35] A hybrid evolutionary algorithm to solve the job shop scheduling problem
    Cheng, T. C. E.
    Peng, Bo
    Lu, Zhipeng
    [J]. ANNALS OF OPERATIONS RESEARCH, 2016, 242 (02) : 223 - 237
  • [36] Exponential TD Learning: A Risk-Sensitive Actor-Critic Reinforcement Learning Algorithm
    Noorani, Erfaun
    Mavridis, Christos N.
    Baras, John S.
    [J]. 2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 4104 - 4109
  • [37] Application of actor-critic learning algorithm for optimal bidding problem of a Genco
    Gajjar, GR
    Khaparde, SA
    Nagaraju, P
    Soman, SA
    [J]. 2003 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-4, CONFERENCE PROCEEDINGS, 2003, : 818 - 818
  • [38] Application of actor-critic learning algorithm for optimal bidding problem of a Genco
    Gajjar, GR
    Khaparde, SA
    Nagaraju, P
    Soman, SA
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2003, 18 (01) : 11 - 18
  • [39] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
    Xu, Tao
    Gong, Lina
    Zhang, Wei
    Li, Xuhong
    Wang, Xia
    Pan, Wenwen
    [J]. ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [40] A bounded actor-critic reinforcement learning algorithm applied to airline revenue management
    Lawhead, Ryan J.
    Gosavi, Abhijit
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 82 : 252 - 262