Multi-Armed Bandit On-Time Arrival Algorithms for Sequential Reliable Route Selection under Uncertainty

被引：10

作者：

Zhou, Jinkai ^{[1
]}

Lai, Xuebo ^{[2
]}

Chow, Joseph Y. J. ^{[1
]}

机构：

[1] NYU, C2SMART Univ Transportat Ctr, Tandon Sch Engn, Dept Civil & Urban Engn, Brooklyn, NY 11201 USA

[2] NYU, Courant Inst Math Sci, Dept Comp Sci, New York, NY 10012 USA

来源：

TRANSPORTATION RESEARCH RECORD | 2019年 / 2673卷 / 10期

基金：

美国国家科学基金会;

关键词：

SHORTEST-PATH PROBLEM; STOCHASTIC NETWORKS; GUIDANCE;

D O I：

10.1177/0361198119850457

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Traditionally vehicles act only as servers in transporting passengers and goods. With increasing sensor equipment in vehicles, including automated vehicles, there is a need to test algorithms that consider the dual role of vehicles as both servers and sensors. The paper formulates a sequential route selection problem as a shortest path problem with on-time arrival reliability under a multi-armed bandit setting, a type of reinforcement learning model. A decision-maker has to make a finite set of decisions sequentially on departure time and path between a fixed origin-destination pair such that on-time reliability is maximized while travel time is minimized. The upper confidence bound algorithm is extended to handle this problem. Several tests are conducted. First, simulated data successfully verifies the method, then a real-data scenario is constructed of a hotel shuttle service from midtown Manhattan in New York City providing hourly access to John F. Kennedy International Airport. Results suggest that route selection with multi-armed bandit learning algorithms can be effective but neglecting passenger scheduling constraints can have negative effects on on-time arrival reliability by as much as 4.8% and combined reliability and travel time by 66.1%.

引用

页码：673 / 682

页数：10

共 50 条

[21] muMAB: A Multi-Armed Bandit Model for Wireless Network Selection
Boldrini, Stefano
De Nardis, Luca
Caso, Giuseppe
Le, Mai T. P.
Fiorina, Jocelyn
Di Benedetto, Maria-Gabriella
ALGORITHMS, 2018, 11 (02)
[22] Multi-armed Bandit processes with optimal selection of the operating times
Pilar Ibarrola
Ricardo Vélez
Test, 2005, 14 : 239 - 255
[23] A Multi-Armed Bandit Selection Strategy for Hyper-heuristics
Ferreira, Alexandre Silvestre
Goncalves, Richard Aderbal
Pozo, Aurora
2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 525 - 532
[24] Operator Selection using Improved Dynamic Multi-Armed Bandit
Belluz, Jany
Gaudesi, Marco
Squillero, Giovanni
Tonda, Alberto
GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 1311 - 1317
[25] Multi-armed bandit processes with optimal selection of the operating times
Ibarrola, P
Vélez, R
TEST, 2005, 14 (01) : 239 - 255
[26] Multi-armed Bandit Channel Selection for Power Line Communication
Nikfar, Bahak
Maghsudi, Setareh
Vinck, A. J. Han
2015 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2015, : 19 - 24
[27] AB Testing for Process Versions with Contextual Multi-armed Bandit Algorithms
Satyal, Suhrid
Weber, Ingo
Paik, Hye-Young
Di Ciccio, Claudio
Mendling, Jan
ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2018, 2018, 10816 : 19 - 34
[28] Distributed Competitive Decision Making Using Multi-Armed Bandit Algorithms
Almasri, Mahmoud
Mansour, Ali
Moy, Christophe
Assoum, Ammar
Le Jeune, Denis
Osswald, Christophe
WIRELESS PERSONAL COMMUNICATIONS, 2021, 118 (02) : 1165 - 1188
[29] Ship route optimization considering on-time arrival probability under environmental uncertainty
Yoo, Byunghyun
Kim, Jinwhan
2018 OCEANS - MTS/IEEE KOBE TECHNO-OCEANS (OTO), 2018,
[30] The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms
Bayati, Mohsen
Hamidi, Nima
Johari, Ramesh
Khosravi, Khashayar
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →