A Distributed Framework for Predictive Analytics Using Big Data and MapReduce Parallel Programming

被引:0
|
作者
Natesan, P. [1 ]
Sathishkumar, V.E. [2 ]
Mathivanan, Sandeep Kumar [3 ]
Venkatasen, Maheshwari [3 ]
Jayagopal, Prabhu [3 ]
Allayear, Shaikh Muhammad [4 ]
机构
[1] Department of Computer Science and Engineering, Kongu Engineering College, Perundurai, Tamilnadu, Erode,638060, India
[2] Department of Industrial Engineering, Hanyang University, Seoul, Korea, Republic of
[3] School of Information Technology and Engineering, Vellore Institute of Technology, TamilNadu, Vellore,632014, India
[4] Department of Multimedia and Creative Technology, Daffodil International University, Daffodil Smart City, Khagan, Ashulia, Dhaka, Bangladesh
关键词
Fault tolerance - Large dataset - Linear regression - Open source software - Predictive analytics;
D O I
10.1155/2023/6048891
中图分类号
学科分类号
摘要
With the advancement of Internet technologies and the rapid increase of World Wide Web applications, there has been tremendous growth in the volume of digital data. This takes the digital world into a new era of big data. Various existing data processing technologies are not consistent and scalable in handling the complexity as well as the large-size datasets. Recently, there are many distributed data processing, and programming models have been proposed and implemented to handle big data applications. The open-source-implemented MapReduce programming model in Apache Hadoop is the foremost model for data exhaustive and also computational-intensive applications due to its inherent characteristics of scalability, fault tolerance, and simplicity. In this research article, a new approach for the prediction of target labels in big data applications is developed using a multiple linear regression algorithm and MapReduce programming model, named as MR-MLR. This approach promises optimum values for MAE, RMSE, and determination coefficient (R2) and thus shows its effectiveness in predictions in big data applications. © 2023 P. Natesan et al.
引用
收藏
相关论文
共 50 条
  • [1] Big Data Analytics Framework for Predictive Analytics using Public Data with Privacy Preserving
    Ho, Duy H.
    Lee, Yugyung
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5395 - 5405
  • [2] Big data analytics for retail industry using MapReduce-Apriori framework
    Verma, Neha
    Malhotra, Dheeraj
    Singh, Jatinder
    [J]. JOURNAL OF MANAGEMENT ANALYTICS, 2020, 7 (03) : 424 - 442
  • [3] Protagonist of Big Data and Predictive Analytics using data analytics
    Subbalakshmi, Sakineti
    Prabhu, C. S. R.
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 276 - 279
  • [4] Time series decomposition and predictive analytics using MapReduce framework
    Bendre, Mininath
    Manthalkar, Ramchandra
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 108 - 120
  • [5] Weather forecasting using parallel and distributed analytics approaches on big data clouds
    Alam, Mahboob
    Amjad, Mohd
    [J]. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2019, 22 (04): : 791 - 799
  • [6] Rainfall forecasting using parallel and distributed analytics approaches on big data clouds
    Alam, Mahboob
    Amjad, Mohd
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (04): : 687 - 695
  • [7] Privacy Preserving Parallel Clustering Based Anonymization for Big Data Using MapReduce Framework
    Lawrance, Josephine Usha
    Jesudhasan, Jesu Vedha Nayahi
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2021, 35 (15) : 1587 - 1620
  • [8] Using Semantics in Predictive Big Data Analytics
    Nural, Mustafa V.
    Cotterell, Michael E.
    Miller, John A.
    [J]. 2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 254 - 261
  • [9] Big data mining with parallel computing: A comparison of distributed and MapReduce methodologies
    Tsai, Chih-Fong
    Lin, Wei-Chao
    Ke, Shih-Wen
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 122 : 83 - 92
  • [10] An intelligent approach to Big Data analytics for sustainable retail environment using Apriori-MapReduce framework
    Verma, Neha
    Singh, Jatinder
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2017, 117 (07) : 1503 - 1520