Performance of the Gold Standard and Machine Learning in Predicting Vehicle Transactions

被引:0
|
作者
Lazar, Alina [1 ]
Jin, Ling [2 ]
Brown, Caitlin [2 ]
Spurlock, C. Anna [2 ]
Sim, Alexander [3 ]
Wu, Kesheng [3 ]
机构
[1] Youngstown State Univ, Dept Comp Sci & Informat Syst, Youngstown, OH 44555 USA
[2] Lawrence Berkeley Natl Lab, Energy Technol Area, Berkeley, CA USA
[3] Lawrence Berkeley Natl Lab, Computat Res Div, Berkeley, CA USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2021年
关键词
household vehicle transaction; multinomial logit; gradient boosting; SHAP values; treeExplainer;
D O I
10.1109/BigData52589.2021.9671286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Logistic regression has long been the gold standard for choice modeling in the transportation field. Despite the rising popularity of machine learning (ML), few is applied to predicting the household vehicle transactions. To address the research gap, this paper presents a first use case of ML application to predicting household vehicle transaction decisions by leveraging a newly processed national panel data set. Model performances are reported for four ML models and the traditional multinomial logit model (MNL). Instead of treating the gold standard and ML models as competitors, this paper tries to use ML tools to inform the MNL model building process. We find the two gradient boosting based methods, CatBoost and LightGBM, are the best performing ML models; and improving logistic models with SHAP interpretation tools can achieve similar performance levels to the best performing ML methods.
引用
收藏
页码:3700 / 3704
页数:5
相关论文
共 50 条
  • [41] Predicting Collaborative Performance at Assessment Level using Machine Learning
    Raza, Syed Arshad
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
  • [42] Predicting Traffic Performance During a Wildfire Using Machine Learning
    Hou, Zenghao
    Darr, Justin
    Zhang, Michael
    TRANSPORTATION RESEARCH RECORD, 2023, 2677 (03) : 1625 - 1636
  • [43] NETPerfTrace - Predicting Internet Path Dynamics and Performance with Machine Learning
    Wassermann, Sarah
    Casas, Pedro
    Cuvelier, Thibaut
    Donnet, Benoit
    BIG-DAMA '17: PROCEEDINGS OF THE 2017 WORKSHOP ON BIG DATA ANALYTICS AND MACHINE LEARNING FOR DATA COMMUNICATION NETWORKS, 2017, : 31 - 36
  • [44] Predicting financial performance with intellectual capital using machine learning
    Lim, SangGon
    JOURNAL OF HOSPITALITY AND TOURISM TECHNOLOGY, 2025, 16 (02) : 369 - 388
  • [45] Performance of Machine Learning Compared With Regression Analysis in Predicting Albuminuria
    Arsiwala, Ali Haider
    Mathew, Marcella
    Scheppach, Johannes B.
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2022, 33 (11): : 301 - 301
  • [46] Predicting Perovskite Performance with Multiple Machine-Learning Algorithms
    Li, Ruoyu
    Deng, Qin
    Tian, Dong
    Zhu, Daoye
    Lin, Bin
    CRYSTALS, 2021, 11 (07)
  • [47] Predicting Student Performance Using Machine Learning in fNIRS Data
    Oku, Amanda Yumi Ambriola
    Sato, Joao Ricardo
    FRONTIERS IN HUMAN NEUROSCIENCE, 2021, 15
  • [48] Predicting the financial performance of microfinance institutions with machine learning techniques
    Ting, Tang
    Mia, Md Aslam
    Hossain, Md Imran
    Wah, Khaw Khai
    JOURNAL OF MODELLING IN MANAGEMENT, 2025, 20 (02) : 322 - 347
  • [49] Predicting Fitness and Performance of Diving using Machine Learning Algorithms
    Mahajan, Uma
    Krishnan, Anup
    Malhotra, Vineet
    Sharma, Deep
    Gore, Sharad
    2019 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2019,
  • [50] Predicting MPI Collective Communication Performance Using Machine Learning
    Hunold, Sascha
    Bhatele, Abhinav
    Bosilca, George
    Knees, Peter
    2020 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2020), 2020, : 259 - 269