Truck industry classification from anonymous mobile sensor data using machine learning

被引:5
|
作者
Akter, Taslima [1 ]
Hernandez, Sarah [2 ]
机构
[1] CPCS Transcom Inc, Infrastruct Analyt Unit, Washington, DC 20036 USA
[2] Univ Arkansas, Dept Civil Engn, Fayetteville, AR USA
关键词
Freight transportation; Global Positioning System (GPS); Machine learning; Pattern recognition systems; GPS DATA; MODELS;
D O I
10.1016/j.ijtst.2021.07.001
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Freight demand forecasting models used by federal, state, and local transportation agencies to predict future freight flows are often based on economic forecasts of industry growth and/or commodity production/consumption rates which are then used to estimate expected freight movement, e.g., truck volumes. Unfortunately, there is a lack of data connecting industry served and commodity carried to freight movements which limits the accuracy and usability of such models. While the private sector collects robust data on freight movements including commodity carried, when shared with the public sector, this data is anonymized to protect privacy. Thus, there is a critical need to re-identify industry served and commodity carried from anonymous freight movement data in ways that maintain privacy standards. To address this research gap, we developed a classification model using data mining and machine learning methods to predict industry served by a truck from daily activity patterns of trip and stop sequences extracted from truck Global Positioning System (GPS) data. A Weighted Random Forest (WRF) supervised machine learning model is used to predict five industry groups: farm products, mining materials, chemicals, manufactured goods, and miscellaneous mixed goods. The WRF model achieves 88% prediction accuracy and explicitly accounts for class imbalance. The model does not reveal fleet, driver, company, etc. and thus, we are able to provide necessary insight into the relationship between truck movement and economic (industry) forecasts without violating data privacy standards. Ultimately, our model allows large streams of truck movement data to be leveraged for freight travel demand forecasting.& COPY; 2021 Tongji University and Tongji University Press. Publishing Services by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).
引用
收藏
页码:522 / 535
页数:14
相关论文
共 50 条
  • [41] Classification of Breast Cancer Data Using Machine Learning Algorithms
    Akbugday, Burak
    [J]. 2019 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2019, : 429 - 432
  • [42] Perspectives on Machine Learning for Classification of Schizotypy Using fMRI Data
    Madsen, Kristoffer H.
    Krohne, Laerke G.
    Cai, Xin-lu
    Wang, Yi
    Chan, Raymond C. K.
    [J]. SCHIZOPHRENIA BULLETIN, 2018, 44 : S480 - S490
  • [43] Using machine learning techniques for exploration and classification of laboratory data
    Trulson, Inga
    Holdenrieder, Stefan
    Hoffmann, Georg
    [J]. JOURNAL OF LABORATORY MEDICINE, 2024,
  • [44] Classification of Electrical Impedance Tomography Data Using Machine Learning
    Pessoa, Diogo
    Rocha, Bruno Machado
    Cheimariotis, Grigorios-Aris
    Haris, Kostas
    Strodthoff, Claas
    Kaimakamis, Evangelos
    Maglaveras, Nicos
    Frerichs, Inez
    de Carvalho, Paulo
    Paiva, Rui Pedro
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 349 - 353
  • [45] Classification of rocks radionuclide data using machine learning techniques
    Khan, Abdul Razzaq
    Mir, Adil Aslam
    Saeed, Sharjil
    Rafique, Muhammad
    Asim, Khawaja M.
    Iqbal, Talat
    Jabbar, Abdul
    Rahman, Saeed Ur
    [J]. ACTA GEOPHYSICA, 2018, 66 (05) : 1073 - 1079
  • [46] Classification of rocks radionuclide data using machine learning techniques
    Abdul Razzaq Khan
    Adil Aslam Mir
    Sharjil Saeed
    Muhammad Rafique
    Khawaja M. Asim
    Talat Iqbal
    Abdul Jabbar
    Saeed Ur Rahman
    [J]. Acta Geophysica, 2018, 66 : 1073 - 1079
  • [47] Classification of SDSS photometric data using machine learning on a cloud
    Acharya, Vishwanath
    Bora, Piyush Singh
    Navin, Karri
    Nazareth, Anisha
    Anusha, P. S.
    Rao, Shrisha
    [J]. CURRENT SCIENCE, 2018, 115 (02): : 249 - 257
  • [48] CLASSIFICATION OF RAIL SWITCH DATA USING MACHINE LEARNING TECHNIQUES
    Bryan, Kaylen J.
    Solomon, Mitchell
    Jensen, Emily
    Coley, Christina
    Rajan, Kailas
    Tian, Charlie
    Mijatovic, Nenad
    Kiss, James M.
    Lamoureux, Benjamin
    Dersin, Pierre
    Smith, Anthony O.
    Peter, Adrian M.
    [J]. PROCEEDINGS OF THE ASME JOINT RAIL CONFERENCE, 2018, 2018,
  • [49] Classification of Diabetic Patient Data Using Machine Learning Techniques
    Singh, Pankaj Pratap
    Prasad, Shitala
    Das, Bhaskarjyoti
    Poddar, Upasana
    Choudhury, Dibarun Roy
    [J]. AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 427 - 436
  • [50] Comparison of machine learning algorithms on data from the nuclear industry
    Remy, E.
    Dautreme, E.
    Talon, C.
    Dirat, Y.
    Le Strat, C. Dinse
    [J]. SAFETY AND RELIABILITY - SAFE SOCIETIES IN A CHANGING WORLD, 2018, : 825 - 832