A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems

被引：74

作者：

Zhao, Jiuxia ^{[1
]}

Mao, Minjia ^{[2
]}

Zhao, Xi ^{[3
]}

Zou, Jianhua ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China

[3] Xi An Jiao Tong Univ, Sch Management, Xian 710049, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2021年 / 22卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Routing; Adaptation models; Heuristic algorithms; Search problems; Training; Optimization; VRP; VRPTW; routing simulator; deep reinforcement learning; adaptive critic; local search; LARGE NEIGHBORHOOD SEARCH; OPTIMIZATION; ALGORITHMS; DELIVERY;

D O I：

10.1109/TITS.2020.3003163

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Different variants of the Vehicle Routing Problem (VRP) have been studied for decades. State-of-the-art methods based on local search have been developed for VRPs, while still facing problems of slow running time and poor solution quality in the case of large problem size. To overcome these problems, we first propose a novel deep reinforcement learning (DRL) model, which is composed of an actor, an adaptive critic and a routing simulator. The actor, based on the attention mechanism, is designed to generate routing strategies. The adaptive critic is devised to change the network structure adaptively, in order to accelerate the convergence rate and improve the solution quality during training. The routing simulator is developed to provide graph information and reward with the actor and adaptive cirtic. Then, we combine this DRL model with a local search method to further improve the solution quality. The output of the DRL model can serve as the initial solution for the following local search method, from where the final solution of the VRP is obtained. Tested on three datasets with customer points of 20, 50 and 100 respectively, experimental results demonstrate that the DRL model alone finds better solutions compared to construction algorithms and previous DRL approaches, while enabling a 5- to 40-fold speedup. We also observe that combining the DRL model with various local search methods yields excellent solutions at a superior generation speed, comparing to that of other initial solutions.

引用

页码：7208 / 7218

页数：11

共 50 条

[41] VARL: a variational autoencoder-based reinforcement learning Framework for vehicle routing problems
Wang, Qi
APPLIED INTELLIGENCE, 2022, 52 (08) : 8910 - 8923
[42] VARL: a variational autoencoder-based reinforcement learning Framework for vehicle routing problems
Qi Wang
Applied Intelligence, 2022, 52 : 8910 - 8923
[43] A Hybrid Reinforcement Learning-Based Model for the Vehicle Routing Problem in Transportation Logistics
Phiboonbanakit, Thananut
Horanont, Teerayut
Huynh, Van-Nam
Supnithi, Thepchai
IEEE ACCESS, 2021, 9 : 163325 - 163347
[44] Population-Based Iterated Local Search Approach for Dynamic Vehicle Routing Problems
Sabar, Nasser R.
Goh, Say Leng
Turky, Ayad
Kendall, Graham
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 2933 - 2943
[45] A hybrid adaptive iterated local search with diversification control to the capacitated vehicle routing problem
Maximo, Vinicius R.
Nascimento, Maria C., V
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 294 (03) : 1108 - 1119
[46] Two-Stage Iterated Local Search for Solving Capacitated Vehicle Routing Problems
Yeh, Chun-Chao
Liu, Da-Yuan
Liao, Yan-Kai
2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, : 45 - 48
[47] A hybrid guided local search for the vehicle-routing problem with intermediate replenishment facilities
Tarantilis, Christos D.
Zachariadis, Emmanouil E.
Kiranoudis, Chris T.
INFORMS JOURNAL ON COMPUTING, 2008, 20 (01) : 154 - 168
[48] Router: A Fast and Flexible Local Search Algorithm for a Class of Rich Vehicle Routing Problems
Derigs, Ulrich
Doehmer, Thomas
OPERATIONS RESEARCH PROCEEDINGS 2004, 2005, : 144 - 149
[49] A hybrid iterated local search algorithm for the multi-compartment vehicle routing problem
Hou, Yan-e
Wang, Chunxiao
Wang, Congran
Fan, Gaojuan
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (01) : 257 - 268
[50] A two-stage hybrid local search for the vehicle routing problem with time windows
Bent, R
Van Hentenryck, P
TRANSPORTATION SCIENCE, 2004, 38 (04) : 515 - 530

← 1 2 3 4 5 →