MapReduce Parallel Programming Model: A State-of-the-Art Survey

被引:0
|
作者
Ren Li
Haibo Hu
Heng Li
Yunsong Wu
Jianxi Yang
机构
[1] Chongqing Jiaotong University,College of Information Science and Engineering
[2] Chongqing University,School of Software Engineering
[3] Chongqing University,College of Computer Science
关键词
MapReduce; Hadoop; Cloud computing; Big data; Scalability;
D O I
暂无
中图分类号
学科分类号
摘要
With the development of information technologies, we have entered the era of Big Data. Google’s MapReduce programming model and its open-source implementation in Apache Hadoop have become the dominant model for data-intensive processing because of its simplicity, scalability, and fault tolerance. However, several inherent limitations, such as lack of efficient scheduling and iteration computing mechanisms, seriously affect the efficiency and flexibility of MapReduce. To date, various approaches have been proposed to extend MapReduce model and improve runtime efficiency for different scenarios. In this review, we assess MapReduce to help researchers better understand these novel optimizations that have been taken to address its limitations. We first present the basic idea underlying MapReduce paradigm and describe several widely used open-source runtime systems. And then we discuss the main shortcomings of original MapReduce. We also review these MapReduce optimization approaches that have recently been put forward, and categorize them according to the characteristics and capabilities. Finally, we conclude the paper and suggest several research works that should be carried out in the future.
引用
收藏
页码:832 / 866
页数:34
相关论文
共 50 条
  • [1] MapReduce Parallel Programming Model: A State-of-the-Art Survey
    Li, Ren
    Hu, Haibo
    Li, Heng
    Wu, Yunsong
    Yang, Jianxi
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (04) : 832 - 866
  • [2] REDUNDANCY IN MATHEMATICAL-PROGRAMMING - A STATE-OF-THE-ART SURVEY
    KARWAN, MH
    LOTFI, V
    TELGEN, J
    ZIONTS, S
    [J]. LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1983, 206 : 1 - 285
  • [3] Model Transformation Generation A Survey of the State-of-the-Art
    Berramla, Karima
    Deba, El Abbassia
    Benhamamouch, Djilali
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,
  • [4] Verification of Model Transformations A Survey of the State-of-the-Art
    Calegari, Daniel
    Szasz, Nora
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 292 : 5 - 25
  • [5] How Heterogeneity Affects the Design of Hadoop MapReduce Schedulers: A State-of-the-Art Survey and Challenges
    Pandey, Vaibhav
    Saini, Poonam
    [J]. BIG DATA, 2018, 6 (02) : 72 - 95
  • [6] A Semantic++ MapReduce Parallel Programming Model
    Zhang, Guigang
    Li, Chao
    Zhang, Yong
    Xing, Chunxiao
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2014, 8 (03) : 279 - 299
  • [7] AutoML: A survey of the state-of-the-art
    He, Xin
    Zhao, Kaiyong
    Chu, Xiaowen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [8] Evolution and state-of-the-art in integer programming
    Sherali, HD
    Driscoll, PJ
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2000, 124 (1-2) : 319 - 340
  • [9] THE STATE-OF-THE-ART IN PARALLEL PRODUCTION SYSTEMS
    KUO, S
    MOLDOVAN, D
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1992, 15 (01) : 1 - 26
  • [10] STATE-OF-THE-ART IN PARALLEL NONLINEAR OPTIMIZATION
    LOOTSMA, FA
    RAGSDELL, KM
    [J]. PARALLEL COMPUTING, 1988, 6 (02) : 133 - 155