Multi-Armed Bandit On-Time Arrival Algorithms for Sequential Reliable Route Selection under Uncertainty

被引:10
|
作者
Zhou, Jinkai [1 ]
Lai, Xuebo [2 ]
Chow, Joseph Y. J. [1 ]
机构
[1] NYU, C2SMART Univ Transportat Ctr, Tandon Sch Engn, Dept Civil & Urban Engn, Brooklyn, NY 11201 USA
[2] NYU, Courant Inst Math Sci, Dept Comp Sci, New York, NY 10012 USA
基金
美国国家科学基金会;
关键词
SHORTEST-PATH PROBLEM; STOCHASTIC NETWORKS; GUIDANCE;
D O I
10.1177/0361198119850457
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traditionally vehicles act only as servers in transporting passengers and goods. With increasing sensor equipment in vehicles, including automated vehicles, there is a need to test algorithms that consider the dual role of vehicles as both servers and sensors. The paper formulates a sequential route selection problem as a shortest path problem with on-time arrival reliability under a multi-armed bandit setting, a type of reinforcement learning model. A decision-maker has to make a finite set of decisions sequentially on departure time and path between a fixed origin-destination pair such that on-time reliability is maximized while travel time is minimized. The upper confidence bound algorithm is extended to handle this problem. Several tests are conducted. First, simulated data successfully verifies the method, then a real-data scenario is constructed of a hotel shuttle service from midtown Manhattan in New York City providing hourly access to John F. Kennedy International Airport. Results suggest that route selection with multi-armed bandit learning algorithms can be effective but neglecting passenger scheduling constraints can have negative effects on on-time arrival reliability by as much as 4.8% and combined reliability and travel time by 66.1%.
引用
收藏
页码:673 / 682
页数:10
相关论文
共 50 条
  • [31] Application of Multi-Armed Bandit Algorithms for Channel Sensing in Cognitive Radio
    Kato, Tomohiro
    Zaman, Nur Atiqah Farahin Kamarul
    Hasegawa, Mikio
    2012 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS), 2012, : 503 - 506
  • [32] Distributed Competitive Decision Making Using Multi-Armed Bandit Algorithms
    Mahmoud Almasri
    Ali Mansour
    Christophe Moy
    Ammar Assoum
    Denis Le Jeune
    Christophe Osswald
    Wireless Personal Communications, 2021, 118 : 1165 - 1188
  • [33] Optimizing Rescheduling Intervals through Using Multi-Armed Bandit Algorithms
    Lin, Fuhua
    Dewan, M. Ali Akber
    Nguyen, Matthew
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 746 - 753
  • [34] Personalized clinical trial based on multi-armed bandit algorithms with covariates
    Shao, Yifei
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 12 - 17
  • [35] Time-Varying Stochastic Multi-Armed Bandit Problems
    Vakili, Sattar
    Zhao, Qing
    Zhou, Yuan
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 2103 - 2107
  • [36] Synchronization and optimality for multi-armed bandit problems in continuous time
    ElKaroui, N
    Karatzas, I
    COMPUTATIONAL & APPLIED MATHEMATICS, 1997, 16 (02): : 117 - 151
  • [37] Multiagent Multi-Armed Bandit Schemes for Gateway Selection in UAV Networks
    Hashima, Sherief
    Hatano, Kohei
    Mohamed, Ehab Mahmoud
    2020 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2020,
  • [38] An Online Kernel Selection Wrapper via Multi-Armed Bandit Model
    Li, Junfan
    Liao, Shizhong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1307 - 1312
  • [39] Learning State Selection for Reconfigurable Antennas: A Multi-Armed Bandit Approach
    Gulati, Nikhil
    Dandekar, Kapil R.
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2014, 62 (03) : 1027 - 1038
  • [40] Research on Modelling Single Keyword Selection Based on Multi-armed Bandit
    Zhou, Baojian
    Qi, Wei
    Chen, Ligang
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION AND TECHNOLOGY (ICCT 2015), 2015, : 266 - 273