MapReduce Parallel Programming Model: A State-of-the-Art Survey

被引:41
|
作者
Li, Ren [1 ]
Hu, Haibo [2 ]
Li, Heng [2 ]
Wu, Yunsong [3 ]
Yang, Jianxi [1 ]
机构
[1] Chongqing Jiaotong Univ, Coll Informat Sci & Engn, Chongqing, Peoples R China
[2] Chongqing Univ, Sch Software Engn, Chongqing 630044, Peoples R China
[3] Chongqing Univ, Coll Comp Sci, Chongqing 630044, Peoples R China
关键词
MapReduce; Hadoop; Cloud computing; Big data; Scalability; FRAMEWORK; WEB; SIMULATOR; JOBS;
D O I
10.1007/s10766-015-0395-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the development of information technologies, we have entered the era of Big Data. Google's MapReduce programming model and its open-source implementation in Apache Hadoop have become the dominant model for data-intensive processing because of its simplicity, scalability, and fault tolerance. However, several inherent limitations, such as lack of efficient scheduling and iteration computing mechanisms, seriously affect the efficiency and flexibility of MapReduce. To date, various approaches have been proposed to extend MapReduce model and improve runtime efficiency for different scenarios. In this review, we assess MapReduce to help researchers better understand these novel optimizations that have been taken to address its limitations. We first present the basic idea underlying MapReduce paradigm and describe several widely used open-source runtime systems. And then we discuss the main shortcomings of original MapReduce. We also review these MapReduce optimization approaches that have recently been put forward, and categorize them according to the characteristics and capabilities. Finally, we conclude the paper and suggest several research works that should be carried out in the future.
引用
收藏
页码:832 / 866
页数:35
相关论文
共 50 条
  • [11] PARALLEL MANIPULATORS - STATE-OF-THE-ART AND PERSPECTIVES
    MERLET, JP
    [J]. ADVANCED ROBOTICS, 1994, 8 (06) : 589 - 596
  • [12] Calculational Parallel Programming (Parallel Programming with Homomorphism and MapReduce)
    Hu, Zhenjiang
    [J]. HLPP 2010: PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING AND APPLICATIONS, 2010, : 1 - 1
  • [13] Survey on Parallel Programming Model
    Kasim, Henry
    March, Verdi
    Zhang, Rita
    See, Simon
    [J]. Network and Parallel Computing, 2008, 5245 : 266 - 275
  • [14] Safety culture: a survey of the state-of-the-art
    Sorensen, JN
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2002, 76 (02) : 189 - 204
  • [15] ELECTRONICS RELIABILITY - A STATE-OF-THE-ART SURVEY
    BLANKS, HS
    [J]. MICROELECTRONICS RELIABILITY, 1980, 20 (03) : 219 - 245
  • [16] INFLUENCE ANALYSIS: A SURVEY OF THE STATE-OF-THE-ART
    Han, Meng
    Li, Yingshu
    [J]. MATHEMATICAL FOUNDATIONS OF COMPUTING, 2018, 1 (03): : 201 - 253
  • [17] Virtual reality: A state-of-the-art survey
    Zhou N.-N.
    Deng Y.-L.
    [J]. International Journal of Automation and Computing, 2009, 6 (04) : 319 - 325
  • [18] Differential Evolution: A Survey of the State-of-the-Art
    Das, Swagatam
    Suganthan, Ponnuthurai Nagaratnam
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2011, 15 (01) : 4 - 31
  • [19] A Brief Survey of State-of-the-Art BioSAXS
    Bizien, Thomas
    Durand, Dominique
    Roblin, Pierre
    Thureau, Aurelien
    Vachette, Patrice
    Perez, Javier
    [J]. PROTEIN AND PEPTIDE LETTERS, 2016, 23 (03): : 217 - 231
  • [20] State-of-the-Art Survey on Edge Intelligence
    Zhang X.
    Zhang C.
    Zhao J.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (12): : 2749 - 2769