Coordinated multi-agent hierarchical deep reinforcement learning to solve multi-trip vehicle routing problems with soft time windows

被引:1
|
作者
Zhang, Zixian [1 ]
Qi, Geqi [1 ]
Guan, Wei [1 ,2 ]
机构
[1] Minist Transport, Key Lab Transport Ind Big Data Applicat Technol Co, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Key Lab Transport Ind Big Data Applicat Technol Co, Minist Transport, Beijing 100044, Peoples R China
关键词
goods distribution; hierarchical systems; multi-agent systems; neural nets; optimization; coordinated multi-agent; deep reinforcement learning; hierarchical layer; vehicle routing problem with time window; LOCAL SEARCH;
D O I
10.1049/itr2.12394
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Vehicle Routing Problem (VRP) is a widespread problem in the transportation field, which challenges the intelligent level of vehicle decisions. Multi-Trip Vehicle Routing Problem with Time Windows (MTVRPTW), as a further evolved problem of VRP considering multiple departures from one depot and temporal constraint of visiting nodes, has developed into one of the critical issues in the scheduling of logistics, bus transit, railway, and aviation. Traditionally, MTVRPTW is solved by the heuristic algorithm, which is generally time-consuming and of non-steady results. Reinforcement learning (RL) and multi-agent framework have become popular in solving VRP to get better performance. However, the lack of variant dimensions in searching space and knowledge exchange between agents inhibit the further improvement of algorithms. Therefore, a Coordinated Multi-agent Hierarchical Deep Reinforcement Learning (CMA-HDRL) method is proposed in this study to enhance the overall solution quality and convergence rate by constructing a three-layered structure (time, communication, and global layers), which is particularly designed to handle the state space explosion and improve the collaboration between agents. The results show that the proposed method can significantly outperform the general genetic algorithm (GA), RL, multi-agent algorithm, and hierarchical algorithm, not only from the effectiveness on the cost consisting of travel time and penalty time but also from the operation robustness.
引用
收藏
页码:2034 / 2051
页数:18
相关论文
共 50 条
  • [1] Multi-vehicle routing problems with soft time windows: A multi-agent reinforcement learning approach
    Zhang, Ke
    He, Fang
    Zhang, Zhengchao
    Lin, Xi
    Li, Meng
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 121
  • [2] Multi-Zone Multi-Trip Vehicle Routing Problem with Time Windows
    Crainic, Teodor Gabriel
    Gajpal, Yuvraj
    Gendreau, Michel
    [J]. INFOR, 2015, 53 (02) : 49 - 67
  • [3] The Multi-Trip Vehicle Routing Problem with Time Windows and Release Dates
    Cattaruzza, Diego
    Absi, Nabil
    Feillet, Dominique
    [J]. TRANSPORTATION SCIENCE, 2016, 50 (02) : 676 - 693
  • [4] A new exact algorithm to solve the multi-trip vehicle routing problem with time windows and limited duration
    F. Hernandez
    D. Feillet
    R. Giroudeau
    O. Naud
    [J]. 4OR, 2014, 12 : 235 - 259
  • [5] A new exact algorithm to solve the multi-trip vehicle routing problem with time windows and limited duration
    Hernandez, F.
    Feillet, D.
    Giroudeau, R.
    Naud, O.
    [J]. 4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2014, 12 (03): : 235 - 259
  • [6] Multi-Trip Time-Dependent Vehicle Routing Problem with Soft Time Windows and Overtime Constraints
    Karoonsoontawong, Ampol
    Punyim, Puntipa
    Nueangnitnaraporn, Wanvara
    Ratanavaraha, Vatanavongs
    [J]. NETWORKS & SPATIAL ECONOMICS, 2020, 20 (02): : 549 - 598
  • [7] Multi-Trip Time-Dependent Vehicle Routing Problem with Soft Time Windows and Overtime Constraints
    Ampol Karoonsoontawong
    Puntipa Punyim
    Wanvara Nueangnitnaraporn
    Vatanavongs Ratanavaraha
    [J]. Networks and Spatial Economics, 2020, 20 : 549 - 598
  • [8] Multi-trip time-dependent vehicle routing problem with time windows
    Pan, Binbin
    Zhang, Zhenzhen
    Lim, Andrew
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 291 (01) : 218 - 231
  • [9] The multi-trip vehicle routing problem with time windows and unloading queue at depot
    Huang, Nan
    Li, Jiliu
    Zhu, Wenbin
    Qin, Hu
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2021, 152
  • [10] A Study of the Multi-Trip Vehicle Routing Problem with Time Windows and Heterogeneous Fleet
    Despaux, Francois
    Basterrech, Sebastian
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2014), 2014,