Interpretable machine-learning models for estimating trip purpose in smart card data

被引:15
|
作者
Kim, Eui-Jin [1 ]
Kim, Youngseo [1 ]
Kim, Dong-Kyu [1 ]
机构
[1] Seoul Natl Univ, Dept Civil & Environm Engn, Seoul, South Korea
关键词
statistical analysis; sustainability; transport management;
D O I
10.1680/jmuen.20.00003
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Investigating trip purposes of transit passengers is crucial in assessing current urban transportation systems and prioritising investments in the public transportation infrastructure. Smart card data provide day-to-day information on passengers' boardings and alightings, but the lack of information on trip purposes leads to restrictions on the use of these data. This paper focuses on estimating trip purposes of transit passengers in smart card data, using a machine-learning model that is trained by household travel survey data. To accomplish this objective, a random forest model coupled with interpretable machine-learning methods - that is, feature importance, feature interactions and accumulated local effects plot is proposed. This approach can be used to estimate trip purposes and to explain the decision-making process of the models. The models include the spatiotemporal features that can be extracted from both the smart card data and the geographic information data, which can be collected sustainably and cost-effectively. The proposed model achieves an 83% overall accuracy in its estimation of the validation data. The interpretation methods show that temporal features are the dominant factors in estimating the purposes of trips, and the spatial features influence the estimates mainly through cross-effects with the temporal features.
引用
收藏
页码:108 / 117
页数:10
相关论文
共 50 条
  • [1] Enriching smart card data with the trip purpose attribute
    Faroqi, Hamed
    Saadatmand, Alireza
    Mesbah, Mahmoud
    Khodaii, Ali
    JOURNAL OF PUBLIC TRANSPORTATION, 2023, 25
  • [2] Public transport trip purpose inference using smart card fare data
    Alsger, Azalden
    Tavassoli, Ahmad
    Mesbah, Mahmoud
    Ferreira, Luis
    Hickman, Mark
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 87 : 123 - 137
  • [3] Interpretable machine-learning models for predicting creep recovery of concrete
    Mei, Shengqi
    Liu, Xiaodong
    Wang, Xingju
    Li, Xufeng
    STRUCTURAL CONCRETE, 2024,
  • [4] Inferring trip purpose by clustering sequences of smart card records
    Faroqi, Hamed
    Mesbah, Mahmoud
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 127
  • [5] Interpretable Machine-Learning Approach in Estimating FDI Inflow: Visualization of ML Models with LIME and H2O
    Singh, Devesh
    TALTECH JOURNAL OF EUROPEAN STUDIES, 2021, 11 (01): : 133 - 152
  • [6] Estimating the water quality index based on interpretable machine learning models
    Yang, Shiwei
    Liang, Ruifeng
    Chen, Junguang
    Wang, Yuanming
    Li, Kefeng
    WATER SCIENCE AND TECHNOLOGY, 2024, 89 (05) : 1340 - 1356
  • [7] Potential of kernel and tree-based machine-learning models for estimating missing data of rainfall
    Sattari, Mohammad Taghi
    Falsafian, Kambiz
    Irvem, Ahmet
    Shahab, S.
    Qasem, Sultan Noman
    ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2020, 14 (01) : 1078 - 1094
  • [8] An Interpretable Machine-learning Framework for Modeling High-resolution Spectroscopic Data*
    Gully-Santiago, Michael
    Morley, Caroline V.
    ASTROPHYSICAL JOURNAL, 2022, 941 (02):
  • [9] Fairness in the Eyes of the Data: Certifying Machine-Learning Models
    Segal, Shahar
    Adi, Yossi
    Pinkas, Benny
    Baum, Carsten
    Ganesh, Chaya
    Keshet, Joseph
    AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 926 - 935
  • [10] Synthetic Generation of Trip Data: The Case of Smart Card
    Minh Kieu
    Iris Brighid Meredith
    Andrea Raith
    Data Science for Transportation, 2023, 5 (2):