Multi-Armed Bandit On-Time Arrival Algorithms for Sequential Reliable Route Selection under Uncertainty

被引:10
|
作者
Zhou, Jinkai [1 ]
Lai, Xuebo [2 ]
Chow, Joseph Y. J. [1 ]
机构
[1] NYU, C2SMART Univ Transportat Ctr, Tandon Sch Engn, Dept Civil & Urban Engn, Brooklyn, NY 11201 USA
[2] NYU, Courant Inst Math Sci, Dept Comp Sci, New York, NY 10012 USA
基金
美国国家科学基金会;
关键词
SHORTEST-PATH PROBLEM; STOCHASTIC NETWORKS; GUIDANCE;
D O I
10.1177/0361198119850457
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traditionally vehicles act only as servers in transporting passengers and goods. With increasing sensor equipment in vehicles, including automated vehicles, there is a need to test algorithms that consider the dual role of vehicles as both servers and sensors. The paper formulates a sequential route selection problem as a shortest path problem with on-time arrival reliability under a multi-armed bandit setting, a type of reinforcement learning model. A decision-maker has to make a finite set of decisions sequentially on departure time and path between a fixed origin-destination pair such that on-time reliability is maximized while travel time is minimized. The upper confidence bound algorithm is extended to handle this problem. Several tests are conducted. First, simulated data successfully verifies the method, then a real-data scenario is constructed of a hotel shuttle service from midtown Manhattan in New York City providing hourly access to John F. Kennedy International Airport. Results suggest that route selection with multi-armed bandit learning algorithms can be effective but neglecting passenger scheduling constraints can have negative effects on on-time arrival reliability by as much as 4.8% and combined reliability and travel time by 66.1%.
引用
收藏
页码:673 / 682
页数:10
相关论文
共 50 条
  • [41] Multi-Armed Bandit-Based User Network Node Selection
    Gao, Qinyan
    Xie, Zhidong
    SENSORS, 2024, 24 (13)
  • [42] Robust Trajectory Selection for Rearrangement Planning as a Multi-Armed Bandit Problem
    Koval, Michael C.
    King, Jennifer E.
    Pollard, Nancy S.
    Srinivasa, Siddhartha S.
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 2678 - 2685
  • [43] LEARNING ALGORITHMS FOR ENERGY-EFFICIENT MIMO ANTENNA SUBSET SELECTION: MULTI-ARMED BANDIT FRAMEWORK
    Mukherjee, Amitav
    Hottinen, Ari
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 659 - 663
  • [44] Automated Collaborator Selection for Federated Learning with Multi-armed Bandit Agents
    Larsson, Hannes
    Riaz, Hassam
    Ickin, Selim
    PROCEEDINGS OF THE 4TH FLEXNETS WORKSHOP ON FLEXIBLE NETWORKS, ARTIFICIAL INTELLIGENCE SUPPORTED NETWORK FLEXIBILITY AND AGILITY (FLEXNETS'21), 2021, : 44 - 49
  • [45] A Distributed Algorithm for Sequential Decision Making in Multi-Armed Bandit with Homogeneous Rewards
    Zhu, Jingxuan
    Sandhu, Romeil
    Liu, Ji
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3078 - 3083
  • [46] Selecting multiple web adverts: A contextual multi-armed bandit with state uncertainty
    Edwards, James A.
    Leslie, David S.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2020, 71 (01) : 100 - 116
  • [47] Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty
    Laskey, Michael
    Mahler, Jeff
    McCarthy, Zoe
    Pokorny, Florian T.
    Patil, Sachin
    van den Berg, Jur
    Kragic, Danica
    Abbeel, Pieter
    Goldberg, Ken
    2015 INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2015, : 572 - 579
  • [48] Multi-Armed Bandit Approaches for Location Planning with Dynamic Relief Supplies Allocation Under Disaster Uncertainty
    Liang, Jun
    Zhang, Zongjia
    Zhi, Yanpeng
    SMART CITIES, 2025, 8 (01):
  • [49] Multi-Armed Bandit Algorithms for Crowdsourcing Systems with Online Estimation of Workers' Ability
    Rangi, Anshuka
    Franceschetti, Massimo
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1345 - 1352
  • [50] Enhancing Evolutionary Conversion Rate Optimization via Multi-Armed Bandit Algorithms
    Qiu, Xin
    Miikkulainen, Risto
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9581 - 9588