Survey of Distributed Computing Frameworks for Supporting Big Data Analysis

被引:9
|
作者
Sun, Xudong [1 ]
He, Yulin [1 ,2 ]
Wu, Dingming [1 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
基金
中国国家自然科学基金;
关键词
Analytical models; Costs; Computational modeling; Clustering algorithms; Distributed databases; Big Data; Programming; distributed computing frameworks; big data analysis; approximate computing; MapReduce computing model; MAP-REDUCE; MAPREDUCE; PERFORMANCE; MANAGEMENT; HADOOP; TAXONOMY; SYSTEMS;
D O I
10.26599/BDMA.2022.9020014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributed computing frameworks are the fundamental component of distributed computing systems. They provide an essential way to support the efficient processing of big data on clusters or cloud. The size of big data increases at a pace that is faster than the increase in the big data processing capacity of clusters. Thus, distributed computing frameworks based on the MapReduce computing model are not adequate to support big data analysis tasks which often require running complex analytical algorithms on extremely big data sets in terabytes. In performing such tasks, these frameworks face three challenges: computational inefficiency due to high I/O and communication costs, non-scalability to big data due to memory limit, and limited analytical algorithms because many serial algorithms cannot be implemented in the MapReduce programming model. New distributed computing frameworks need to be developed to conquer these challenges. In this paper, we review MapReduce-type distributed computing frameworks that are currently used in handling big data and discuss their problems when conducting big data analysis. In addition, we present a non-MapReduce distributed computing framework that has the potential to overcome big data analysis challenges.
引用
收藏
页码:154 / 169
页数:16
相关论文
共 50 条
  • [41] Big Data for Internet of Things: A Survey on IoT Frameworks and Platforms
    Atmani, Amine
    Kandrouch, Ibtissame
    Hmina, Nabil
    Chaoui, Habiba
    [J]. ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT, AI2SD'2019, VOL 6: ADVANCED INTELLIGENT SYSTEMS FOR NETWORKS AND SYSTEMS, 2020, 92 : 59 - 67
  • [42] A Distributed Mobile Cloud Computing Model for Secure Big Data
    Sung, Soonhwa
    Youn, Cheong
    Kong, Eunbae
    Ryou, Jaecheol
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2016, : 312 - 316
  • [43] COMPARISON OF DISTRIBUTED GPU COMPUTING FRAMEWORKS FOR SAR RAW DATA SIMULATION
    Yao, Xiaojie
    Zhang, Fan
    Sun, Xiong
    Yin, Qiang
    Li, Wei
    [J]. 2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 5225 - 5228
  • [44] An Analysis on Task Migration Strategy of Big Data Streaming Storm Computing Framework for Distributed Processing
    Hu, Xiling
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2020, 11 (04) : 18 - 35
  • [45] OPC:A Distributed Computing and Memory Computing-based Effective Solution of Big Data
    Yang, Zhi
    Zhang, Chunping
    Hu, Mu
    Lin, Feng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 50 - 53
  • [46] Public Auditing for Big Data Storage in Cloud Computing -- A Survey
    Liu, Chang
    Ranjan, Rajiv
    Zhang, Xuyun
    Yang, Chi
    Georgakopoulos, Dimitrios
    Chen, Jinjun
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1128 - 1135
  • [47] Experiences of Converging Big Data Analytics Frameworks with High Performance Computing Systems
    Cheng, Peng
    Lu, Yutong
    Du, Yunfei
    Chen, Zhiguang
    [J]. SUPERCOMPUTING FRONTIERS, SCFA 2018, 2018, 10776 : 90 - 106
  • [48] A SURVEY ON BIG DATA ANALYSIS AND CHALLENGES
    Parikh, Harshil
    Liu, Jiangjiang
    [J]. ICERI2015: 8TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION, 2015, : 4451 - 4460
  • [49] Collective Computing for Scientific Big Data Analysis
    Liu, Jialin
    Chen, Yong
    Byna, Surendra
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, 2015, : 129 - 137
  • [50] A survey on correlation analysis of big data
    Liang J.-Y.
    Feng C.-J.
    Song P.
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2016, 39 (01): : 1 - 18