Deep Reinforcement Learning for Solving AGVs Routing Problem

被引:4
|
作者
Lu, Chengxuan [1 ]
Long, Jinjun [2 ]
Xing, Zichao [1 ]
Wu, Weimin [1 ]
Gu, Yong [1 ]
Luo, Jiliang [3 ]
Huang, Yisheng [4 ]
机构
[1] Zhejiang Univ, Inst Cyber Syst & Control, State Key Lab Ind Control Technol, Hangzhou 310027, Zhejiang, Peoples R China
[2] KENGIC Intelligent Equipment Co Ltd, Qingdao 266111, Shandong, Peoples R China
[3] Huaqiao Univ, Dept Control Sci & Engn, Xiamen 361021, Fujian, Peoples R China
[4] Ilan Univ, Dept Elect Engn, Yilan 26047, Taiwan
基金
国家重点研发计划;
关键词
AGVs routing problem; Real-time routing; Asynchronous deep Q-network; Embedding; SYSTEMS; GAME; GO;
D O I
10.1007/978-3-030-65955-4_16
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The routing of automated guided vehicles (AGVs) is playing an increasingly important role in modern logistics. AGVs routing problem is a complex combinatorial optimization problem. It fails to get the desired results of solving this problem using meta-heuristic algorithms due to its high real-time demand. Large AGVs systems in engineering are usually simplified by adding regulations, which may lead to getting only sub-optimal solutions. In this paper, we present a deep reinforcement learning algorithm to solve the AGVs routing problem. Firstly, the AGVs routing problem is modeled by a Markov decision process (MDP), enabling real-time routing. Secondly, according to the properties of the working scene of AGVs, asynchronous DQN (deep Q-network) is exploited to serve as the base framework of reinforcement learning. More importantly, the map of the working scene is discretized and represented using the embedding technique. Compared with one-hot mode, the input size of the embedding mode is much smaller, greatly improving the training speed. The extracted embeddings are built into conflict vectors, which are finally processed by LSTM (long short-term memory). Experiments show that the proposed algorithm has effectiveness both in real-time responding speed and getting high-quality solutions.
引用
收藏
页码:222 / 236
页数:15
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem
    Li, Jingwen
    Ma, Yining
    Gao, Ruize
    Cao, Zhiguang
    Lim, Andrew
    Song, Wen
    Zhang, Jie
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13572 - 13585
  • [2] Reinforcement Learning for Solving the Vehicle Routing Problem
    Nazari, Mohammadreza
    Oroojlooy, Afshin
    Takac, Martin
    Snyder, Lawrence V.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning
    Cai, Hao
    Xu, Peng
    Tang, Xifeng
    Lin, Gan
    [J]. ELECTRONICS, 2024, 13 (16)
  • [4] Reinforcement Learning for Solving Stochastic Vehicle Routing Problem
    Iklassov, Zangir
    Sobirov, Ikboljon
    Solozabal, Ruben
    Takac, Martin
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [5] Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls
    Wang, Conghui
    Cao, Zhiguang
    Wu, Yaoxin
    Teng, Long
    Wu, Guohua
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [6] Solving Permutation Flowshop Problem with Deep Reinforcement Learning
    Pan, Ruyuan
    Dong, Xingye
    Han, Sheng
    [J]. 2020 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-BESANCON 2020), 2020, : 349 - 353
  • [7] Deep Graph Reinforcement Learning for Solving Multicut Problem
    Li, Zhenchen
    Yang, Xu
    Zhang, Yanchao
    Zeng, Shaofeng
    Yuan, Jingbin
    Liu, Jiazheng
    Liu, Zhiyong
    Han, Hua
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [8] Reinforcement Learning for Solving Multiple Vehicle Routing Problem with Time Window
    Zong, Zefang
    Tong, Xia
    Zheng, Meng
    Li, Yong
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (02)
  • [9] RL SolVeR Pro: Reinforcement Learning for Solving Vehicle Routing Problem
    Kalakanti, Arun Kumar
    Verma, Shivani
    Paul, Topon
    Yoshida, Takufumi
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA SCIENCES (AIDAS2019), 2019, : 94 - 99
  • [10] Deep reinforcement learning for the dynamic and uncertain vehicle routing problem
    Pan, Weixu
    Liu, Shi Qiang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (01) : 405 - 422