Feature Engineering Algorithms for Traffic Dataset

被引:0
|
作者
Abdullah, Akibu Mahmoud [1 ]
Usmani, Raja Sher Afgun [1 ]
Pillai, Thulasyammal Ramiah [1 ]
Hashem, Ibrahim Abaker Targio [2 ]
Marjani, Mohsen [1 ]
机构
[1] Taylors Univ, Sch Comp Sci & Engn, Subang Jaya, Selangor, Malaysia
[2] Univ Sharjah, Dept Comp Sci, Coll Comp & Informat, Sharjah 27272, U Arab Emirates
关键词
Feature engineering algorithm; queuing theory; Road Traffic Volume Malaysia (RTVM); machine learning algorithms; RURAL MOUNTAINOUS HIGHWAYS; VEHICLE; CRASHES;
D O I
10.14569/IJACSA.2021.0120435
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As a result of an increase in the human population globally, traffic congestion in the urban area is becoming worse, which leads to time-consuming, waste of fuel, and, most importantly, the emission of pollutants. Therefore, there is a need to monitor and estimate traffic density. The emergence of an automatic traffic management system allows us to record and monitor motor vehicles' movement in a road segment. One of the challenges researchers face is when the historical traffic data is given as an annual average that contains incomplete data. The annual average daily traffic (AADT) is an average number of traffic volumes at the roadway segment in a specific location over a year. An example of AADT data is the one given by Road Traffic Volume Malaysia (RTVM), and this data is incomplete. The RTVM provides an average of daily traffic data and one peak hour. The recorded traffic data is for sixteen hours, and the only hourly data given is one hour, from 8.00 am to 9.00 am. Hence there is a need to estimate hourly traffic volume for the remaining hours. Feature engineering can be used to overcome the issue of incomplete data. This paper proposed feature engineering algorithms that can efficiently estimate hourly traffic volume and generate features from the existing dataset for all traffic census stations in Malaysia using queuing theory. The proposed feature engineering algorithms were able to estimate the hourly traffic volume and generate features for three years in Jalan Kepong census station, Kuala Lumpur, Malaysia. The algorithms were evaluated using the Random Forest model and Decision Tree Models. The result shows that our feature engineering algorithms improve machine learning algorithms' performance except for the prediction of NO2 using Random Forest, which shows the highest MAE, MSE, and RMSE when traffic data was included for prediction. The algorithm is applied in one of the traffic census stations in Kuala Lumpur, and it can be used for the other stations in Malaysia. Additionally, the algorithm can also be used for any annual average daily traffic data if it includes average hourly data.
引用
收藏
页码:261 / 268
页数:8
相关论文
共 50 条
  • [1] Employing feature engineering strategies to improve the performance of machine learning algorithms on echocardiogram dataset
    Huang, Huang-Nan
    Chen, Hong-Ming
    Lin, Wei-Wen
    Huang, Chau-Jian
    Chen, Yung-Cheng
    Wang, Yu-Huei
    Yang, Chao-Tung
    [J]. DIGITAL HEALTH, 2023, 9
  • [2] Dataset Evolver: An Interactive Feature Engineering Notebook
    Nargesian, Fatemeh
    Khurana, Udayan
    Pedapati, Tejaswini
    Samulowitz, Horst
    Turaga, Deepak
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8212 - 8213
  • [3] Analysis of Feature Selection Techniques for Network Traffic Dataset
    Singh, Raman
    Kumar, Harish
    Singla, R. K.
    [J]. 2013 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND RESEARCH ADVANCEMENT (ICMIRA 2013), 2013, : 42 - 46
  • [4] Flow allocation algorithms for traffic engineering
    Kato, M
    Hida, H
    Kawahara, K
    Oie, Y
    [J]. INFORMATION NETWORKING: NETWORKING TECHNOLOGIES FOR ENHANCED INTERNET SERVICES, 2003, 2662 : 978 - 988
  • [5] Traffic Engineering in SDN with Cultural Algorithms
    Monteiro, Thyago de Amorim
    de Albuquerque, Edson Queiroz
    Balieiro, Andson M.
    [J]. 2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,
  • [6] A Comprehensive Feature Engineering Approach for Breast Cancer Dataset
    Sharma, Shambhvi
    Sahni, Monica
    [J]. EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [7] Automatic recommendation of feature selection algorithms based on dataset characteristics
    Sabino Parmezan, Antonio Rafael
    Lee, Huei Diana
    Spolaor, Newton
    Wu, Feng Chung
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
  • [8] Load Balancing Algorithms in MPLS Traffic Engineering
    Long, KP
    Zhang, ZS
    Cheng, SD
    [J]. 2001 IEEE WORKSHOP ON HIGH PERFORMANCE SWITCHING AND ROUTING, 2001, : 175 - 179
  • [9] Revealing the Optimality Gap for Traffic Engineering Algorithms
    Liu, Huan
    [J]. 2008 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR), 2008, : 272 - 279
  • [10] Feature Selection and Interpretable Feature Transformation: A Preliminary Study on Feature Engineering for Classification Algorithms
    Tallon-Ballesteros, Antonio J.
    Tuba, Milan
    Xue, Bing
    Hashimoto, Takako
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2018), PT II, 2018, 11315 : 280 - 287