Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems

被引：120

作者：

Liu, Chien-Liang ^{[1
]}

Chang, Chuan-Chin ^{[1
]}

Tseng, Chun-Jan ^{[1
]}

机构：

[1] Natl Chiao Tung Univ, Dept Ind Engn & Management, Hsinchu 30010, Taiwan

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Job shop scheduling; Machine learning; Benchmark testing; Dynamic scheduling; Learning (artificial intelligence); Training; Optimization; Job shop scheduling problem ([!text type='JS']JS[!/text]SP); deep reinforcement learning; actor-critic network; parallel training; OPTIMIZATION; SEARCH; LEVEL; GAME; GO;

D O I：

10.1109/ACCESS.2020.2987820

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the past decades, many optimization methods have been devised and applied to job shop scheduling problem (JSSP) to find the optimal solution. Many methods assumed that the scheduling results were applied to static environments, but the whole environments in the real world are always dynamic. Moreover, many unexpected events such as machine breakdowns and material problems may be present to adversely affect the initial job scheduling. This work views JSSP as a sequential decision making problem and proposes to use deep reinforcement learning to cope with this problem. The combination of deep learning and reinforcement learning avoids handcraft features as used in traditional reinforcement learning, and it is expected that the combination will make the whole learning phase more efficient. Our proposed model comprises actor network and critic network, both including convolution layers and fully connected layer. Actor network agent learns how to behave in different situations, while critic network helps agent evaluate the value of statement then return to actor network. This work proposes a parallel training method, combining asynchronous update as well as deep deterministic policy gradient (DDPG), to train the model. The whole network is trained with parallel training on a multi-agent environment and different simple dispatching rules are considered as actions. We evaluate our proposed model on more than ten instances that are present in a famous benchmark problem library - OR library. The evaluation results indicate that our method is comparative in static JSSP benchmark problems, and achieves a good balance between makespan and execution time in dynamic environments. Scheduling score of our method is 91.12% in static JSSP benchmark problems, and 80.78% in dynamic environments.

引用

页码：71752 / 71762

页数：11

共 50 条

[1] An effective deep actor-critic reinforcement learning method for solving the flexible job shop scheduling problem
Lanjun Wan
Xueyan Cui
Haoxin Zhao
Changyun Li
Zhibing Wang
[J]. Neural Computing and Applications, 2024, 36 (20) : 11877 - 11899
[2] An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems
Zhao, Cong
Deng, Na
[J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1445 - 1471
[3] Solving job shop scheduling problems via deep reinforcement learning
Yuan, Erdong
Cheng, Shuli
Wang, Liejun
Song, Shiji
Wu, Fang
[J]. APPLIED SOFT COMPUTING, 2023, 143
[4] Integrated Actor-Critic for Deep Reinforcement Learning
Zheng, Jiaohao
Kurt, Mehmet Necip
Wang, Xiaodong
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
[5] Solving flexible job shop scheduling problems via deep reinforcement learning
Yuan, Erdong
Wang, Liejun
Cheng, Shuli
Song, Shiji
Fan, Wei
Li, Yongming
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
[6] An efficient and adaptive design of reinforcement learning environment to solve job shop scheduling problem with soft actor-critic algorithm
Si, Jinghua
Li, Xinyu
Gao, Liang
Li, Peigen
[J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024,
[7] Visual Navigation with Actor-Critic Deep Reinforcement Learning
Shao, Kun
Zhao, Dongbin
Zhu, Yuanheng
Zhang, Qichao
[J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[8] Deep Actor-Critic Reinforcement Learning for Anomaly Detection
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
[J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[9] Averaged Soft Actor-Critic for Deep Reinforcement Learning
Ding, Feng
Ma, Guanfeng
Chen, Zhikui
Gao, Jing
Li, Peng
[J]. COMPLEXITY, 2021, 2021
[10] THE APPLICATION OF ACTOR-CRITIC REINFORCEMENT LEARNING FOR FAB DISPATCHING SCHEDULING
Kim, Namyong
Shin, IIayong
[J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4570 - 4571

← 1 2 3 4 5 →