Benchmarking Travel Time and Demand Prediction Methods Using Large-scale Metro Smart Card Data

被引:0
|
作者
Zimmo I. [1 ]
Hörcher D. [1 ]
Singh R. [1 ]
Graham D.J. [1 ]
机构
[1] Transport Strategy Centre, Department of Civil and Environmental Engineering, Faculty of Engineering, Imperial College London, South Kensington Campus, London
来源
关键词
demand prediction; machine learning; smart card data; urban public transport;
D O I
10.3311/PPtr.22252
中图分类号
学科分类号
摘要
Urban mass transit systems generate large volumes of data via automated systems established for ticketing, signalling, and other operational processes. This study is motivated by the observation that despite the availability of sophisticated quantitative methods, most public transport operators are constrained in exploiting the information their datasets contain. This paper intends to address this gap in the context of real-time demand and travel time prediction with smart card data. We comparatively benchmark the predictive performance of four quantitative prediction methods: multivariate linear regression (MVLR) and semiparametric regression (SPR) widely used in the econometric literature, and random forest regression (RFR) and support vector machine regression (SVMR) from machine learning. We find that the SVMR and RFR methods are the most accurate in travel flow and travel time prediction, respectively. However, we also find that the SPR technique offers lower computation time at the expense of minor inefficiency in predictive power in comparison with the two machine learning methods. © 2023 Budapest University of Technology and Economics. All rights reserved.
引用
收藏
页码:357 / 374
页数:17
相关论文
共 50 条
  • [31] A metro smart card data-based analysis of group travel behaviour in Shanghai, China
    Zhang, Yongping
    Manley, Ed
    Martens, Karel
    Batty, Michael
    JOURNAL OF TRANSPORT GEOGRAPHY, 2024, 114
  • [32] Spatiotemporal Evolution of Travel Pattern Using Smart Card Data
    Lin, Mu
    Huang, Zhengdong
    Zhao, Tianhong
    Zhang, Ying
    Wei, Heyi
    SUSTAINABILITY, 2022, 14 (15)
  • [33] Benchmarking large-scale data management for Internet of Things
    Hendawi, Abdeltawab
    Gupta, Jayant
    Liu, Jiayi
    Teredesai, Ankur
    Ramakrishnan, Naveen
    Shah, Mohak
    El-Sappagh, Shaker
    Kwak, Kyung-Sup
    Ali, Mohamed
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (12): : 8207 - 8230
  • [34] Inferring Travel Purposes for Transit Smart Card Data Using
    Liu, Zhenzhen
    Li, Qing-Quan
    Zhuang, Yan
    Xiong, Jiacheng
    Li, Shuiquan
    SMART COMPUTING AND COMMUNICATION, SMARTCOM 2017, 2018, 10699 : 11 - 18
  • [35] Benchmarking large-scale data management for Internet of Things
    Abdeltawab Hendawi
    Jayant Gupta
    Jiayi Liu
    Ankur Teredesai
    Naveen Ramakrishnan
    Mohak Shah
    Shaker El-Sappagh
    Kyung-Sup Kwak
    Mohamed Ali
    The Journal of Supercomputing, 2019, 75 : 8207 - 8230
  • [36] Large-scale loyalty card data in health research
    Nevalainen, Jaakko
    Erkkola, Maijaliisa
    Saarijarvi, Hannu
    Nappila, Turkka
    Fogelholm, Mikael
    DIGITAL HEALTH, 2018, 4
  • [37] Calibrating a transit assignment model using smart card data in a large-scale multi-modal transit network
    Ahmad Tavassoli
    Mahmoud Mesbah
    Mark Hickman
    Transportation, 2020, 47 : 2133 - 2156
  • [38] Calibrating a transit assignment model using smart card data in a large-scale multi-modal transit network
    Tavassoli, Ahmad
    Mesbah, Mahmoud
    Hickman, Mark
    TRANSPORTATION, 2020, 47 (05) : 2133 - 2156
  • [39] An experiential learning-based transit route choice model using large-scale smart-card data
    Arriagada, Jacqueline
    Guevara, C. Angelo
    Munizaga, Marcela
    Gao, Song
    TRANSPORTATION, 2024,
  • [40] OpenBioLink: a benchmarking framework for large-scale biomedical link prediction
    Breit, Anna
    Ott, Simon
    Agibetov, Asan
    Samwald, Matthias
    BIOINFORMATICS, 2020, 36 (13) : 4097 - 4098