Deep reinforcement learning for solving the joint scheduling problem of machines and AGVs in job shop

被引：0

作者：

Sun A.-H. ^{[1
]}

Lei Q. ^{[1
]}

Song Y.-C. ^{[1
]}

Yang Y.-F. ^{[1
]}

机构：

[1] State Key Laboratory of Mechanical Transmission, Chongqing University, Chongqing

来源：

Lei, Qi (leiqi@cqu.edu.cn) | 1600年 / Northeast University卷 / 39期

关键词：

automated guided vehicle; deep reinforcement learning; job shop scheduling; joint scheduling; Markov decision process; proximal policy optimization;

D O I：

10.13195/j.kzyjc.2022.1821

中图分类号：

学科分类号：

摘要：

Aiming at the joint scheduling problem of automated guided vehicle (AGV) and machines in the job shop, an integrated algorithm framework based on convolutional neural network and deep reinforcement learning is proposed with the goal of minimizing the completion time. Firstly, the job shop scheduling disjunction graph containing an AGV is analyzed, and the problem is transformed into a sequential decision problem, which is expressed as the Markov decision process. Then, according to the solving characteristics of the problem, a spatial state and five direct state features based on the disjunctive graph are designed. In the setting of the action space, a two-dimensional action space including process selection and AGV assignment is designed. According to the characteristics of fixed value of processing time and effective transportation time in the work workshop, a reward function is constructed to guide the agent to learn. Finally, a 2D-PPO algorithm for two-dimensional action space is designed for training and learning to quickly respond to the joint scheduling decision of the AGV and machine. Through case verification, the scheduling algorithm based on the 2D-PPO algorithm has good learning performance and scalability effect. © 2024 Northeast University. All rights reserved.

引用

页码：253 / 262

页数：9

共 18 条

[1] Bilge U, Ulusoy U., A time window approach to simultaneous scheduling of machines and material handling system in an FMS, Operations Research, 43, 6, pp. 1058-1070, (1995)
[2] Xie C, Allen T T., Simulation and experimental design methods for job shop scheduling with material handling: A survey, The International Journal of Advanced Manufacturing Technology, 80, 1, pp. 233-243, (2015)
[3] Erol R, Sahin C, Baykasoglu A, Et al., A multi-agent based approach to dynamic scheduling of machines and automated guided vehicles in manufacturing systems, Applied Soft Computing, 12, 6, pp. 1720-1732, (2012)
[4] Geng K F, Ye C M., Joint scheduling of machines and AGVs in green hybrid flow shop with missing operations, Control and Decision, 37, 10, pp. 2723-2732, (2022)
[5] Ren W, Yan Y, Hu Y, Et al., Joint optimisation for dynamic flexible job-shop scheduling problem with transportation time and resource constraints, International Journal of Production Research, 60, 18, pp. 5675-5696, (2021)
[6] Zhang Y F, Guo Z G, Lv J X, Et al., A framework for smart production-logistics systems based on CPS and industrial IoT, IEEE Transactions on Industrial Informatics, 14, 9, pp. 4019-4032, (2018)
[7] Guo Z G, Zhang Y F, Zhao X B, Et al., CPS-based self-adaptive collaborative control for smart production-logistics systems, IEEE Transactions on Cybernetics, 51, 1, pp. 188-198, (2021)
[8] Ham A., Transfer-robot task scheduling in job shop, International Journal of Production Research, 59, 3, pp. 813-823, (2021)
[9] Wolpert D H, Macready W G., No free lunch theorems for optimization, IEEE Transactions on Evolutionary Computation, 1, 1, pp. 67-82, (1997)
[10] Wang L, Pan Z X., Scheduling optimization for flow-shop based on deep reinforcement learning and iterative greedy method, Control and Decision, 36, 11, pp. 2609-2617, (2021)

← 1 2 →