A MapReduce-Based Nearest Neighbor Approach for Big-Data-Driven Traffic Flow Prediction

被引:35
|
作者
Xia, Dawen [1 ,2 ]
Li, Huaqing [3 ]
Wang, Binfeng [1 ]
Li, Yantao [1 ]
Zhang, Zili [1 ,4 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing 400715, Peoples R China
[2] Guizhou Minzu Univ, Sch Informat Engn, Guiyang 550025, Peoples R China
[3] Southwest Univ, Sch Elect & Informat Engn, Chongqing 400715, Peoples R China
[4] Deakin Univ, Sch Informat Technol, Geelong, Vic 3220, Australia
来源
IEEE ACCESS | 2016年 / 4卷
基金
中国国家自然科学基金;
关键词
Big data analytics; traffic flow prediction; correlation analysis; parallel classifier; Hadoop MapReduce; TRAVEL-TIME PREDICTION; TRANSPORTATION; NETWORK; FREEWAY; SYSTEMS;
D O I
10.1109/ACCESS.2016.2570021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In big-data-driven traffic flow prediction systems, the robustness of prediction performance depends on accuracy and timeliness. This paper presents a new MapReduce-based nearest neighbor (NN) approach for traffic flow prediction using correlation analysis (TFPC) on a Hadoop platform. In particular, we develop a real-time prediction system including two key modules, i.e., offline distributed training (ODT) and online parallel prediction (OPP). Moreover, we build a parallel k-nearest neighbor optimization classifier, which incorporates correlation information among traffic flows into the classification process. Finally, we propose a novel prediction calculation method, combining the current data observed in OPP and the classification results obtained from large-scale historical data in ODT, to generate traffic flow prediction in real time. The empirical study on real-world traffic flow big data using the leave-one-out cross validation method shows that TFPC significantly outperforms four state-of-the-art prediction approaches, i.e., autoregressive integrated moving average, Naive Bayes, multilayer perceptron neural networks, and NN regression, in terms of accuracy, which can be improved 90.07% in the best case, with an average mean absolute percent error of 5.53%. In addition, it displays excellent speedup, scaleup, and sizeup.
引用
收藏
页码:2920 / 2934
页数:15
相关论文
共 50 条
  • [41] Scaling up MapReduce-based Big Data Processing on Multi-GPU systems
    Hai Jiang
    Yi Chen
    Zhi Qiao
    Tien-Hsiung Weng
    Kuan-Ching Li
    Cluster Computing, 2015, 18 : 369 - 383
  • [42] LandQυ2: A MapReduce-Based System for Processing Arable Land Quality Big Data
    Yao, Xiaochuang
    Mokbel, Mohamed E.
    Ye, Sijing
    Li, Guoqing
    Alarabi, Louai
    Eldawy, Ahmed
    Zhao, Zuliang
    Zhao, Long
    Zhu, Dehai
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (07)
  • [43] K-Nearest Neighbor Model based Short-Term Traffic Flow Prediction Method
    Yang, Lijin
    Yang, Qing
    Li, Yonghua
    Feng, Yuqing
    2019 18TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2019), 2019, : 27 - 30
  • [44] Traffic Flow Prediction With Big Data: A Learning Approach Based on SIS-Complex Networks
    Li, Yiming
    Zhao, Luming
    Yu, Zhouyu
    Wang, Songjing
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 550 - 554
  • [45] Simulation and modeling traffic flow based on Division K Nearest Neighbor
    Huang, Kun
    Zheng, Jianhu
    MODERN PHYSICS LETTERS B, 2019, 33 (32):
  • [46] Urban Traffic Flow Prediction: A MapReduce Based Parallel Multivariate Linear Regression Approach
    Dai, Liang
    Qin, Wen
    Xu, Hongke
    Chen, Ting
    Qian, Chao
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 2823 - 2827
  • [47] Big-data-driven Based Intelligent Prognostics Scheme in Industry 4.0 Environment
    Yan, Jihong
    Meng, Yue
    Lu, Lei
    Guo, Chaozhong
    2017 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-HARBIN), 2017, : 1234 - 1238
  • [48] WKNN-FDCNN method for big data driven traffic flow prediction in ITS
    Ravikant Soni
    Partha Roy
    Kapil Kumar Nagwanshi
    Multimedia Tools and Applications, 2024, 83 : 25261 - 25286
  • [49] WKNN-FDCNN method for big data driven traffic flow prediction in ITS
    Soni, Ravikant
    Roy, Partha
    Nagwanshi, Kapil Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25261 - 25286
  • [50] Big data-driven machine learning-enabled traffic flow prediction
    Kong, Fanhui
    Li, Jian
    Jiang, Bin
    Zhang, Tianyuan
    Song, Houbing
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (09)