Deep Reinforcement Learning for Solving AGVs Routing Problem

被引：4

作者：

Lu, Chengxuan ^{[1
]}

Long, Jinjun ^{[2
]}

Xing, Zichao ^{[1
]}

Wu, Weimin ^{[1
]}

Gu, Yong ^{[1
]}

Luo, Jiliang ^{[3
]}

Huang, Yisheng ^{[4
]}

机构：

[1] Zhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Hangzhou 310027, Zhejiang, Peoples R China

[2] KENGIC Intelligent Equipment Co Ltd, Qingdao 266111, Shandong, Peoples R China

[3] Huaqiao Univ, Dept Control Sci & Engn, Xiamen 361021, Fujian, Peoples R China

[4] Ilan Univ, Dept Elect Engn, Yilan 26047, Taiwan

来源：

VERIFICATION AND EVALUATION OF COMPUTER AND COMMUNICATION SYSTEMS, VECOS 2020 | 2020年 / 12519卷

基金：

国家重点研发计划;

关键词：

AGVs routing problem; Real-time routing; Asynchronous deep Q-network; Embedding; SYSTEMS; GAME; GO;

D O I：

10.1007/978-3-030-65955-4_16

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The routing of automated guided vehicles (AGVs) is playing an increasingly important role in modern logistics. AGVs routing problem is a complex combinatorial optimization problem. It fails to get the desired results of solving this problem using meta-heuristic algorithms due to its high real-time demand. Large AGVs systems in engineering are usually simplified by adding regulations, which may lead to getting only sub-optimal solutions. In this paper, we present a deep reinforcement learning algorithm to solve the AGVs routing problem. Firstly, the AGVs routing problem is modeled by a Markov decision process (MDP), enabling real-time routing. Secondly, according to the properties of the working scene of AGVs, asynchronous DQN (deep Q-network) is exploited to serve as the base framework of reinforcement learning. More importantly, the map of the working scene is discretized and represented using the embedding technique. Compared with one-hot mode, the input size of the embedding mode is much smaller, greatly improving the training speed. The extracted embeddings are built into conflict vectors, which are finally processed by LSTM (long short-term memory). Experiments show that the proposed algorithm has effectiveness both in real-time responding speed and getting high-quality solutions.

引用

页码：222 / 236

页数：15

共 50 条

[1] Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem
Li, Jingwen
Ma, Yining
Gao, Ruize
Cao, Zhiguang
Lim, Andrew
Song, Wen
Zhang, Jie
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13572 - 13585
[2] Reinforcement Learning for Solving the Vehicle Routing Problem
Nazari, Mohammadreza
Oroojlooy, Afshin
Takac, Martin
Snyder, Lawrence V.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[3] Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning
Cai, Hao
Xu, Peng
Tang, Xifeng
Lin, Gan
[J]. ELECTRONICS, 2024, 13 (16)
[4] Reinforcement Learning for Solving Stochastic Vehicle Routing Problem
Iklassov, Zangir
Sobirov, Ikboljon
Solozabal, Ruben
Takac, Martin
[J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
[5] Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls
Wang, Conghui
Cao, Zhiguang
Wu, Yaoxin
Teng, Long
Wu, Guohua
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
[6] Solving Permutation Flowshop Problem with Deep Reinforcement Learning
Pan, Ruyuan
Dong, Xingye
Han, Sheng
[J]. 2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020), 2020, : 349 - 353
[7] Deep Graph Reinforcement Learning for Solving Multicut Problem
Li, Zhenchen
Yang, Xu
Zhang, Yanchao
Zeng, Shaofeng
Yuan, Jingbin
Liu, Jiazheng
Liu, Zhiyong
Han, Hua
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[8] Reinforcement Learning for Solving Multiple Vehicle Routing Problem with Time Window
Zong, Zefang
Tong, Xia
Zheng, Meng
Li, Yong
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (02)
[9] RL SolVeR Pro: Reinforcement Learning for Solving Vehicle Routing Problem
Kalakanti, Arun Kumar
Verma, Shivani
Paul, Topon
Yoshida, Takufumi
[J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA SCIENCES (AIDAS2019), 2019, : 94 - 99
[10] Deep reinforcement learning for the dynamic and uncertain vehicle routing problem
Pan, Weixu
Liu, Shi Qiang
[J]. APPLIED INTELLIGENCE, 2023, 53 (01) : 405 - 422

← 1 2 3 4 5 →