Challenges for MapReduce in Big Data

被引:78
|
作者
Grolinger, Katarina [1 ]
Hayes, Michael [1 ]
Higashino, Wilson A. [1 ,2 ]
L'Heureux, Alexandra [1 ]
Allison, David S. [1 ,3 ,4 ,5 ]
Capretz, Miriam A. M. [1 ]
机构
[1] Univ Western Ontario, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
[2] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil
[3] CNRS, LAAS, F-31400 Toulouse, France
[4] Univ Toulouse, LAAS, F-31400 Toulouse, France
[5] Univ Toulouse, LAAS, UT1 Capitole, F-31000 Toulouse, France
关键词
Big Data; Big Data Analytics; MapReduce; NoSQL; Machine Learning; Interactive Analytics; Online Processing; Privacy; Security;
D O I
10.1109/SERVICES.2014.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the Big Data community, MapReduce has been seen as one of the key enabling approaches for meeting continuously increasing demands on computing resources imposed by massive data sets. The reason for this is the high scalability of the MapReduce paradigm which allows for massively parallel and distributed execution over a large number of computing nodes. This paper identifies MapReduce issues and challenges in handling Big Data with the objective of providing an overview of the field, facilitating better planning and management of Big Data projects, and identifying opportunities for future research in this field. The identified challenges are grouped into four main categories corresponding to Big Data tasks types: data storage (relational databases and NoSQL stores), Big Data analytics (machine learning and interactive analytics), online processing, and security and privacy. Moreover, current efforts aimed at improving and extending MapReduce to address identified challenges are presented. Consequently, by identifying issues and challenges MapReduce faces when handling Big Data, this study encourages future Big Data research.
引用
收藏
页码:182 / 189
页数:8
相关论文
共 50 条
  • [31] Big data clustering with varied density based on MapReduce
    Safanaz Heidari
    Mahmood Alborzi
    Reza Radfar
    Mohammad Ali Afsharkazemi
    Ali Rajabzadeh Ghatari
    Journal of Big Data, 6
  • [32] Big Data-Big Challenges
    White, Martin
    ECONTENT, 2011, 34 (09) : 21 - 21
  • [33] Improving Network Traffic in MapReduce for Big Data Applications
    Gawande, Priya
    Shaikh, Nuzhaft
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 2979 - 2983
  • [34] MapReduce based Method for Big Data Semantic Clustering
    Yang, Jie
    Li, Xiaoping
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2814 - 2819
  • [35] Big data classification with optimization driven MapReduce framework
    Mohammed, Mujeeb Shaik
    Rachapudy, Praveen Sam
    Kasa, Madhavi
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2021, 25 (02) : 173 - 183
  • [36] A MapReduce-Based ELM for Regression in Big Data
    Wu, B.
    Yan, T. H.
    Xu, X. S.
    He, B.
    Li, W. H.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 164 - 173
  • [37] Big Data Challenges in Big Science
    Andreas Heiss
    Computing and Software for Big Science, 2019, 3 (1)
  • [38] Toward Conceptual MapReduce Algorithm for Big Data Platfrom
    Sohn, Seungdae
    Kim, Jinhong
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 985 - 988
  • [39] Secure Scalar Product for Big-Data in MapReduce
    Liu, Fang
    Ng, Wee Keong
    Zhang, Wei
    2015 IEEE FIRST INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2015), 2015, : 120 - 129
  • [40] Investigation and Characterization of MapReduce Applications for Big Data Analytics
    Li, Y.
    Lam, T. B. V.
    Do, T. V. Van
    Chakka, R.
    Rotter, C.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2018, 77 (09): : 493 - 498