An Efficient Distributed Algorithm for Big Data Processing

被引:8
|
作者
Al-kahtani, Mohammed S. [1 ]
Karim, Lutful [2 ]
机构
[1] Prince Sattam Bin Abdulaziz Univ, Dept Comp Engn, Al Kharj, Saudi Arabia
[2] Seneca Coll Appl Arts & Technol, Sch ICT, Toronto, ON, Canada
关键词
Big data; Distributed algorithms; MapReduce; DBMS; Sensor; Commodity hardware; CONVEX;
D O I
10.1007/s13369-016-2405-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper introduces an efficient distributed data analysis framework for big data which comprises data processing at the data collecting nodes and the central server end as opposed to the existing framework that only comprises data processing at the central server end. As data are being processed at the data collecting end in the proposed framework, the amount of data is reduced to be processed at the server side by the commodity computers. The proposed distributed algorithm works both in low-powered nodes such as sensors and high-speed commodity computers and also performs sequential and parallel processing based on the amount of data received at the central server. Simulation results demonstrate that the proposed distributed algorithm outperforms traditional distributed algorithms in terms of the size of data to be processed at the central server and data processing time.
引用
收藏
页码:3149 / 3157
页数:9
相关论文
共 50 条
  • [1] An Efficient Distributed Algorithm for Big Data Processing
    Mohammed S. Al-kahtani
    Lutful Karim
    [J]. Arabian Journal for Science and Engineering, 2017, 42 : 3149 - 3157
  • [2] An Efficient Distributed Database Clustering Algorithm for Big Data Processing
    Sun, Qiao
    Fu, Lan-mei
    Deng, Bu-qiao
    Pei, Xu-bin
    Sun, Jia-song
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC 2017), 2017, : 70 - 74
  • [3] Efficient Distributed Database Clustering Algorithm for Big Data Processing
    Li, Liantian
    [J]. 2021 6TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2021), 2021, : 495 - 498
  • [4] An efficient distributed range query processing algorithm on LiDAR data
    Chung, Yu Chi
    Su, I-Fang
    Lee, Chiang
    Gu, Gary
    [J]. 2017 10TH INTERNATIONAL CONFERENCE ON UBI-MEDIA COMPUTING AND WORKSHOPS (UBI-MEDIA), 2017, : 68 - 73
  • [5] Efficient and Customizable Data Partitioning Framework for Distributed Big RDF Data Processing in the Cloud
    Lee, Kisung
    Liu, Ling
    Tang, Yuzhe
    Zhang, Qi
    Zhou, Yang
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 327 - 334
  • [6] An Efficient Approach for Storage of Big Data Streams in Distributed Stream Processing Systems
    Alshamrani, Sultan
    Waseem, Quadri
    Alharbi, Abdullah
    Alosaimi, Wael
    Turabieh, Hamza
    Alyami, Hashem
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 91 - 98
  • [7] A Distributed Rough Set Theory based Algorithm for an Efficient Big Data Pre-processing under the Spark Framework
    Dagdia, Zaineb Chelly
    Zarges, Christine
    Beck, Gael
    Lebbah, Mustapha
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 911 - 916
  • [8] A Distributed Rough Set Theory Algorithm based on Locality Sensitive Hashing for an Efficient Big Data Pre-processing
    Dagdia, Zaineb Chelly
    Zarges, Christine
    Beck, Gael
    Azzag, Hanene
    Lebbah, Mustapha
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2597 - 2606
  • [9] An efficient algorithm for distributed density-based outlier detection on big data
    Bai, Mei
    Wang, Xite
    Xin, Junchang
    Wang, Guoren
    [J]. NEUROCOMPUTING, 2016, 181 : 19 - 28
  • [10] An Efficient and Scalable Algorithm to Mine Functional Dependencies from Distributed Big Data
    Wu, Wanqing
    Mao, Wenyu
    [J]. SENSORS, 2022, 22 (10)