Using Scalable Data Mining for Predicting Flight Delays

被引:62
|
作者
Belcastro, Loris [1 ]
Marozzo, Fabrizio [1 ]
Talia, Domenico [1 ]
Trunfio, Paolo [1 ]
机构
[1] Univ Calabria, DIMES, Arcavacata Di Rende, CS, Italy
关键词
Design; Algorithms; Performance; Cloud computing; big data; flight delay; scalability; open data; PROPAGATION;
D O I
10.1145/2888402
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Flight delays are frequent all over the world (about 20% of airline flights arrive more than 15min late) and they are estimated to have an annual cost of billions of dollars. This scenario makes the prediction of flight delays a primary issue for airlines and travelers. The main goal of this work is to implement a predictor of the arrival delay of a scheduled flight due to weather conditions. The predicted arrival delay takes into consideration both flight information (origin airport, destination airport, scheduled departure and arrival time) and weather conditions at origin airport and destination airport according to the flight timetable. Airline flight and weather observation datasets have been analyzed and mined using parallel algorithms implemented as MapReduce programs executed on a Cloud platform. The results show a high accuracy in predicting delays above a given threshold. For instance, with a delay threshold of 15min, we achieve an accuracy of 74.2% and 71.8% recall on delayed flights, while with a threshold of 60min, the accuracy is 85.8% and the delay recall is 86.9%. Furthermore, the experimental results demonstrate the predictor scalability that can be achieved performing data preparation and mining tasks as MapReduce applications on the Cloud.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Predicting the the operational acceptance of airborne flight reroute requests using data mining
    Evans, Antony D.
    Lee, Paul
    Sridhar, Banavar
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 96 : 270 - 289
  • [2] A system for effectively predicting flight delays based on IoT data
    Abdulwahab Aljubairy
    Wei Emma Zhang
    Ali Shemshadi
    Adnan Mahmood
    Quan Z. Sheng
    [J]. Computing, 2020, 102 : 2025 - 2048
  • [3] A system for effectively predicting flight delays based on IoT data
    Aljubairy, Abdulwahab
    Zhang, Wei Emma
    Shemshadi, Ali
    Mahmood, Adnan
    Sheng, Quan Z.
    [J]. COMPUTING, 2020, 102 (09) : 2025 - 2048
  • [4] Scalable management and data mining using astrolabe
    van Renesse, R
    Birman, K
    Dumitriu, D
    Vogels, W
    [J]. PEER-TO-PEER SYSTEMS, 2002, 2429 : 280 - 294
  • [5] Predicting Career Using Data Mining
    Arafath, Md. Yeasin
    Saifuzzaman, Mohd.
    Ahmed, Sumaiya
    Hossain, Syed Akhter
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 889 - 894
  • [6] Predicting IT Employability Using Data Mining Techniques
    Piad, Keno C.
    Dumlao, Menchita
    Ballera, Melvin A.
    Ambat, Shaneth C.
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING, DATA MINING, AND WIRELESS COMMUNICATIONS (DIPDMWC), 2016, : 26 - 30
  • [7] Flight Crash Investigation Using Data Mining Techniques
    Sharma, Shagun
    Sabitha, A. Sai
    [J]. 2016 1ST INDIA INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (IICIP), 2016,
  • [8] Using causal machine learning for predicting the risk of flight delays in air transportation
    Truong, Dothang
    [J]. JOURNAL OF AIR TRANSPORT MANAGEMENT, 2021, 91
  • [9] Scalable Mining of Big Data
    Leung, Carson K.
    Pazdor, Adam G. M.
    Zheng, Hao
    [J]. 2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 240 - 247
  • [10] Automatic Processing Techniques of Rotorcraft Flight Data Using Data Mining
    Oh, Hyeju
    Jo, Sungbeom
    Choi, Keeyoung
    Roh, Eun-Jung
    Kang, Byung-Ryong
    [J]. JOURNAL OF THE KOREAN SOCIETY FOR AERONAUTICAL AND SPACE SCIENCES, 2018, 46 (10) : 823 - 832