Interpretable machine-learning models for estimating trip purpose in smart card data

被引:15
|
作者
Kim, Eui-Jin [1 ]
Kim, Youngseo [1 ]
Kim, Dong-Kyu [1 ]
机构
[1] Seoul Natl Univ, Dept Civil & Environm Engn, Seoul, South Korea
关键词
statistical analysis; sustainability; transport management;
D O I
10.1680/jmuen.20.00003
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Investigating trip purposes of transit passengers is crucial in assessing current urban transportation systems and prioritising investments in the public transportation infrastructure. Smart card data provide day-to-day information on passengers' boardings and alightings, but the lack of information on trip purposes leads to restrictions on the use of these data. This paper focuses on estimating trip purposes of transit passengers in smart card data, using a machine-learning model that is trained by household travel survey data. To accomplish this objective, a random forest model coupled with interpretable machine-learning methods - that is, feature importance, feature interactions and accumulated local effects plot is proposed. This approach can be used to estimate trip purposes and to explain the decision-making process of the models. The models include the spatiotemporal features that can be extracted from both the smart card data and the geographic information data, which can be collected sustainably and cost-effectively. The proposed model achieves an 83% overall accuracy in its estimation of the validation data. The interpretation methods show that temporal features are the dominant factors in estimating the purposes of trips, and the spatial features influence the estimates mainly through cross-effects with the temporal features.
引用
收藏
页码:108 / 117
页数:10
相关论文
共 50 条
  • [31] An interpretable machine-learning framework for dark matter halo formation
    Lucie-Smith, Luisa
    Peiris, Hiranya, V
    Pontzen, Andrew
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2019, 490 (01) : 331 - 342
  • [32] Exploring interpretable and non-interpretable machine learning models for estimating winter wheat evapotranspiration using particle swarm optimization with limited climatic data
    Zhao, Xin
    Zhang, Lei
    Zhu, Ge
    Cheng, Chenguang
    He, Jun
    Traore, Seydou
    Singh, Vijay P.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
  • [33] Machine-Learning Studies on Spin Models
    Shiina, Kenta
    Mori, Hiroyuki
    Okabe, Yutaka
    Lee, Hwee Kuan
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [34] Machine-Learning Studies on Spin Models
    Kenta Shiina
    Hiroyuki Mori
    Yutaka Okabe
    Hwee Kuan Lee
    Scientific Reports, 10
  • [35] Comprehensive review of hydrothermal liquefaction data for use in machine-learning models
    Haarlemmer, Geert
    Matricon, Lucie
    Roubaud, Anne
    BIOFUELS BIOPRODUCTS & BIOREFINING-BIOFPR, 2024, 18 (05): : 1782 - 1798
  • [36] Biases in machine-learning models of human single-cell data
    Willem, Theresa
    Shitov, Vladimir A.
    Luecken, Malte D.
    Kilbertus, Niki
    Bauer, Stefan
    Piraud, Marie
    Buyx, Alena
    Theis, Fabian J.
    NATURE CELL BIOLOGY, 2025, 27 (03) : 384 - 392
  • [37] Data Quality Considerations for Petrophysical Machine-Learning Models1
    McDonald, Andrew
    PETROPHYSICS, 2021, 62 (06): : 585 - 613
  • [38] Closure to "Estimating Particle Froude Number of Sewer Pipes by Boosting Machine-Learning Models"
    Shakya, Deepti
    Agarwal, Mayank
    Deshpande, Vishal
    Kumar, Bimlesh
    JOURNAL OF PIPELINE SYSTEMS ENGINEERING AND PRACTICE, 2024, 15 (01)
  • [39] Discussion of "Estimating Particle Froude Number of Sewer Pipes by Boosting Machine-Learning Models"
    Ebtehaj, Isa
    Montes, Carlos
    Bonakdari, Hossein
    JOURNAL OF PIPELINE SYSTEMS ENGINEERING AND PRACTICE, 2024, 15 (01)
  • [40] Correction to: Trip purpose inference for tourists by machine learning approaches based on mobile signaling data
    Haodong Sun
    Yanyan Chen
    Yang Wang
    Xiaoming Liu
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 14375 - 14375