Interpretable Machine Learning for Mode Choice Modeling on Tracking-Based Revealed Preference Data

被引:4
|
作者
Dahmen, Victoria [1 ]
Weikl, Simone [2 ]
Bogenberger, Klaus [1 ]
机构
[1] Tech Univ Munich, Traff Engn & Control, Munich, Germany
[2] Regensburg Univ Appl Sci, Artificial Intelligence Infrastruct & Urban Dev, Regensburg, Germany
关键词
mode choice; interpretable machine learning; revealed preference; smartphone tracking; sensitivity analysis; travel behavior;
D O I
10.1177/03611981241246973
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Mode choice modeling is imperative for predicting and understanding travel behavior. For this purpose, machine learning (ML) models have increasingly been applied to stated preference and traditional self-recorded revealed preference data with promising results, particularly for extreme gradient boosting (XGBoost) and random forest (RF) models. Because of the rise in the use of tracking-based smartphone applications for recording travel behavior, we address the important and unprecedented task of testing these ML models for mode choice modeling on such data. Furthermore, as ML approaches are still criticized for leading to results that are hard to understand, we consider it essential to provide an in-depth interpretability analysis of the best-performing model. Our results show that the XGBoost and RF models far outperform a conventional multinomial logit model, both overall and for each mode. The interpretability analysis using the Shapley additive explanations approach reveals that the XGBoost model can be explained well at the overall and mode level. In addition, we demonstrate how to analyze individual predictions. Lastly, a sensitivity analysis gives insight into the relative importance of different data sources, sample size, and user involvement. We conclude that the XGBoost model performs best, while also being explainable. Insights generated by such models can be used, for instance, to predict mode choice decisions for arbitrary origin-destination pairs to see which impacts infrastructural changes would have on the mode share.
引用
收藏
页码:2075 / 2091
页数:17
相关论文
共 50 条
  • [31] Interpretable Machine Learning Modeling for Ischemic Stroke Outcome Prediction
    Jabal, Mohamed Sobhi
    Joly, Olivier
    Kallmes, David
    Harston, George
    Rabinstein, Alejandro
    Huynh, Thien
    Brinjikji, Waleed
    FRONTIERS IN NEUROLOGY, 2022, 13
  • [32] DIFFERENTIABLE TRACKING-BASED TRAINING OF DEEP LEARNING SOUND SOURCE LOCALIZERS
    Adavanne, Sharath
    Politis, Archontis
    Virtanen, Tuomas
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 211 - 215
  • [33] A machine learning based data modeling for medical diagnosis
    Mahoto, Naeem Ahmed
    Shaikh, Asadullah
    Sulaiman, Adel
    Reshan, Mana Saleh Al
    Rajab, Adel
    Rajab, Khairan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [34] Analysis and Modeling of Geodetic Data Based on Machine Learning
    Wu, Tong
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [35] Panel estimators to combine revealed and stated preference dichotomous choice data
    Loomis, JB
    JOURNAL OF AGRICULTURAL AND RESOURCE ECONOMICS, 1997, 22 (02): : 233 - 245
  • [36] Passenger travel behaviour on Chinese high-speed railways using machine learning based on revealed-preference data
    Jing, Yun
    Liu, Yingke
    Zhang, Zhenhua
    Su, Yunhan
    EXPERT SYSTEMS, 2019, 36 (04)
  • [37] Development of a tracking-based system for automated traffic data collection for roundabouts
    Hai Dinh
    Hua Tang
    Journal of Modern Transportation, 2017, (01) : 12 - 23
  • [38] Modeling freight mode choice using machine learning classifiers: a comparative study using Commodity Flow Survey (CFS) data
    Uddin, Majbah
    Anowar, Sabreena
    Eluru, Naveen
    TRANSPORTATION PLANNING AND TECHNOLOGY, 2021, 44 (05) : 543 - 559
  • [39] Development of a tracking-based system for automated traffic data collection for roundabouts
    Dinh H.
    Tang H.
    Journal of Modern Transportation, 2017, 25 (1): : 12 - 23
  • [40] Large-Scale Application of a Combined Destination and Mode Choice Model Estimated with Mixed Stated and Revealed Preference Data
    Heilig, Michael
    Mallig, Nicolai
    Hilgert, Tim
    Kagerbauer, Martin
    Vortisch, Peter
    TRANSPORTATION RESEARCH RECORD, 2017, (2669) : 31 - 40