A Multi-Agent Reinforcement Learning Method With Route Recorders for Vehicle Routing in Supply Chain Management

被引:18
|
作者
Ren, Lei [1 ,2 ]
Fan, Xiaoyang [1 ,2 ]
Cui, Jin [3 ,4 ]
Shen, Zhen [5 ]
Lv, Yisheng [5 ]
Xiong, Gang [6 ,7 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Beijing Adv Innovat Ctr Big Data Based Precis Med, Beijing 100191, Peoples R China
[3] Beihang Univ, Res Inst Frontier Sci, Beijing 100191, Peoples R China
[4] Beihang Univ, Ningbo Inst Technol, Ningbo 315800, Peoples R China
[5] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Inst Automat, Beijing 100190, Peoples R China
[6] Chinese Acad Sci, Inst Automat, Beijing Engn Res Ctr Intelligent Syst & Technol, Beijing 100190, Peoples R China
[7] Chinese Acad Sci, Cloud Comp Ctr, Dongguan 523808, Peoples R China
关键词
Reinforcement learning; Costs; Task analysis; Transportation; Optimization; Computational modeling; Vehicle routing; supply chain management; multi-agent reinforcement learning (MARL); route recorder; OPTIMIZATION; ALGORITHM; NUMBER;
D O I
10.1109/TITS.2022.3150151
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In the modern supply chain system, large-scale transportation tasks require the collaborative work of multiple vehicles to be completed on time. Over the past few decades, multi-vehicle route planning was mainly implemented by heuristic algorithms. However, these algorithms face the dilemma of long computation time. In recent years, some machine learning-based methods are also proposed for vehicle route planning, but the existing algorithms can hardly solve multi-vehicle time-sensitive problems. To overcome this problem, we propose a novel multi-agent reinforcement learning model, which optimizes the route length and the vehicle's arrival time simultaneously. The model is based on the encoder-decoder framework. The encoder mines the relationship between the customer nodes in the problem, and the decoder generates the route of each vehicle iteratively. Specially, we design multiple route recorders to extract the route history information of vehicles and realize the communication between them. In the inferring phase, the model could immediately generate routes for all vehicles in a new instance. To further improve the performance of the model, we devise a multi-sampling strategy and obtain the balance boundary between computation time and performance improvement. In addition, we propose a simulation-based vehicle configuration method to select the optimal number of vehicles in real applications. For validation, we conduct a series of experiments on problems with different customer amounts and various vehicle numbers. The results show that the proposed model outperforms other typical algorithms in both performance and calculation time.
引用
收藏
页码:16410 / 16420
页数:11
相关论文
共 50 条
  • [1] A Supply Chain Inventory Management Method for Civil Aircraft Manufacturing Based on Multi-Agent Reinforcement Learning
    Piao, Mingjie
    Zhang, Dongdong
    Lu, Hu
    Li, Rupeng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [2] Multi-agent reinforcement learning for electric vehicle decarbonized routing and scheduling
    Wang, Yi
    Qiu, Dawei
    He, Yinglong
    Zhou, Quan
    Strbac, Goran
    [J]. ENERGY, 2023, 284
  • [3] Route planning model of multi-agent system for a supply chain management
    Zolfpour-Arokhlo, Mortaza
    Selamat, Ali
    Hashim, Siti Zaiton Mohd
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1505 - 1518
  • [4] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
    Mak, Stephen
    Xu, Liming
    Pearce, Tim
    Ostroumov, Michael
    Brintrup, Alexandra
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157
  • [5] Population Game-Assisted Multi-Agent Reinforcement Learning Method for Dynamic Multi-Vehicle Route Selection
    Yan, Liping
    Cai, Yu
    [J]. ELECTRONICS, 2024, 13 (08)
  • [6] An efficient routing access method based on multi-agent reinforcement learning in UWSNs
    Wei Su
    Keyu Chen
    Jiamin Lin
    Yating Lin
    [J]. Wireless Networks, 2022, 28 : 225 - 239
  • [7] An efficient routing access method based on multi-agent reinforcement learning in UWSNs
    Su, Wei
    Chen, Keyu
    Lin, Jiamin
    Lin, Yating
    [J]. WIRELESS NETWORKS, 2022, 28 (01) : 225 - 239
  • [8] Supply chain management with multi-agent paradigm
    Kaihara, T
    [J]. RO-MAN'99: 8TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN INTERACTION, 1999, : 394 - 399
  • [9] Multi-vehicle routing problems with soft time windows: A multi-agent reinforcement learning approach
    Zhang, Ke
    He, Fang
    Zhang, Zhengchao
    Lin, Xi
    Li, Meng
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 121
  • [10] A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
    Arishi, Ali
    Krishnan, Krishna
    [J]. JOURNAL OF MANAGEMENT ANALYTICS, 2023, 10 (03) : 493 - 515