Multi-Armed Bandit On-Time Arrival Algorithms for Sequential Reliable Route Selection under Uncertainty

被引:10
|
作者
Zhou, Jinkai [1 ]
Lai, Xuebo [2 ]
Chow, Joseph Y. J. [1 ]
机构
[1] NYU, C2SMART Univ Transportat Ctr, Tandon Sch Engn, Dept Civil & Urban Engn, Brooklyn, NY 11201 USA
[2] NYU, Courant Inst Math Sci, Dept Comp Sci, New York, NY 10012 USA
基金
美国国家科学基金会;
关键词
SHORTEST-PATH PROBLEM; STOCHASTIC NETWORKS; GUIDANCE;
D O I
10.1177/0361198119850457
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traditionally vehicles act only as servers in transporting passengers and goods. With increasing sensor equipment in vehicles, including automated vehicles, there is a need to test algorithms that consider the dual role of vehicles as both servers and sensors. The paper formulates a sequential route selection problem as a shortest path problem with on-time arrival reliability under a multi-armed bandit setting, a type of reinforcement learning model. A decision-maker has to make a finite set of decisions sequentially on departure time and path between a fixed origin-destination pair such that on-time reliability is maximized while travel time is minimized. The upper confidence bound algorithm is extended to handle this problem. Several tests are conducted. First, simulated data successfully verifies the method, then a real-data scenario is constructed of a hotel shuttle service from midtown Manhattan in New York City providing hourly access to John F. Kennedy International Airport. Results suggest that route selection with multi-armed bandit learning algorithms can be effective but neglecting passenger scheduling constraints can have negative effects on on-time arrival reliability by as much as 4.8% and combined reliability and travel time by 66.1%.
引用
收藏
页码:673 / 682
页数:10
相关论文
共 50 条
  • [21] muMAB: A Multi-Armed Bandit Model for Wireless Network Selection
    Boldrini, Stefano
    De Nardis, Luca
    Caso, Giuseppe
    Le, Mai T. P.
    Fiorina, Jocelyn
    Di Benedetto, Maria-Gabriella
    ALGORITHMS, 2018, 11 (02)
  • [22] Multi-armed Bandit processes with optimal selection of the operating times
    Pilar Ibarrola
    Ricardo Vélez
    Test, 2005, 14 : 239 - 255
  • [23] A Multi-Armed Bandit Selection Strategy for Hyper-heuristics
    Ferreira, Alexandre Silvestre
    Goncalves, Richard Aderbal
    Pozo, Aurora
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 525 - 532
  • [24] Operator Selection using Improved Dynamic Multi-Armed Bandit
    Belluz, Jany
    Gaudesi, Marco
    Squillero, Giovanni
    Tonda, Alberto
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 1311 - 1317
  • [25] Multi-armed bandit processes with optimal selection of the operating times
    Ibarrola, P
    Vélez, R
    TEST, 2005, 14 (01) : 239 - 255
  • [26] Multi-armed Bandit Channel Selection for Power Line Communication
    Nikfar, Bahak
    Maghsudi, Setareh
    Vinck, A. J. Han
    2015 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2015, : 19 - 24
  • [27] AB Testing for Process Versions with Contextual Multi-armed Bandit Algorithms
    Satyal, Suhrid
    Weber, Ingo
    Paik, Hye-Young
    Di Ciccio, Claudio
    Mendling, Jan
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2018, 2018, 10816 : 19 - 34
  • [28] Distributed Competitive Decision Making Using Multi-Armed Bandit Algorithms
    Almasri, Mahmoud
    Mansour, Ali
    Moy, Christophe
    Assoum, Ammar
    Le Jeune, Denis
    Osswald, Christophe
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 118 (02) : 1165 - 1188
  • [29] Ship route optimization considering on-time arrival probability under environmental uncertainty
    Yoo, Byunghyun
    Kim, Jinwhan
    2018 OCEANS - MTS/IEEE KOBE TECHNO-OCEANS (OTO), 2018,
  • [30] The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms
    Bayati, Mohsen
    Hamidi, Nima
    Johari, Ramesh
    Khosravi, Khashayar
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33