Challenges for MapReduce in Big Data

被引:78
|
作者
Grolinger, Katarina [1 ]
Hayes, Michael [1 ]
Higashino, Wilson A. [1 ,2 ]
L'Heureux, Alexandra [1 ]
Allison, David S. [1 ,3 ,4 ,5 ]
Capretz, Miriam A. M. [1 ]
机构
[1] Univ Western Ontario, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
[2] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil
[3] CNRS, LAAS, F-31400 Toulouse, France
[4] Univ Toulouse, LAAS, F-31400 Toulouse, France
[5] Univ Toulouse, LAAS, UT1 Capitole, F-31000 Toulouse, France
关键词
Big Data; Big Data Analytics; MapReduce; NoSQL; Machine Learning; Interactive Analytics; Online Processing; Privacy; Security;
D O I
10.1109/SERVICES.2014.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the Big Data community, MapReduce has been seen as one of the key enabling approaches for meeting continuously increasing demands on computing resources imposed by massive data sets. The reason for this is the high scalability of the MapReduce paradigm which allows for massively parallel and distributed execution over a large number of computing nodes. This paper identifies MapReduce issues and challenges in handling Big Data with the objective of providing an overview of the field, facilitating better planning and management of Big Data projects, and identifying opportunities for future research in this field. The identified challenges are grouped into four main categories corresponding to Big Data tasks types: data storage (relational databases and NoSQL stores), Big Data analytics (machine learning and interactive analytics), online processing, and security and privacy. Moreover, current efforts aimed at improving and extending MapReduce to address identified challenges are presented. Consequently, by identifying issues and challenges MapReduce faces when handling Big Data, this study encourages future Big Data research.
引用
收藏
页码:182 / 189
页数:8
相关论文
共 50 条
  • [1] Issues and Challenges of Heterogeneous Datasets in MapReduce Framework of Big Data Environment
    Gupta, Saraswati
    Bhatnagar, Vishal
    Chadha, Ramneet Singh
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT ICT4SD 2015, VOL 2, 2016, 409 : 751 - 759
  • [2] MapReduce Clustering for Big Data
    Ghattas, Badih
    Pinto, Antoine
    Diao, Sambou
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5116 - 5124
  • [3] MapReduce: Simplified Data Analysis of Big Data
    Maitrey, Seema
    Jha, C. K.
    [J]. 3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 563 - 571
  • [4] Analysis of the Big Data based on MapReduce
    Tian, Zi-de
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 224 - 228
  • [5] MapReduce Algorithms for Big Data Analysis
    Shim, Kyuseok
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2016 - 2017
  • [6] Prominence of MapReduce in BIG DATA Processing
    Pandey, Shweta
    Tokekar, Vrinda
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2014, : 555 - 560
  • [7] MapReduce Research on Warehousing of Big Data
    Pticek, M.
    Vrdoljak, B.
    [J]. 2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1361 - 1366
  • [8] MapReduce Algorithms for Big Data Analysis
    Shim, Kyuseok
    [J]. DATABASES THEORY AND APPLICATIONS, ADC 2018, 2018, 10837 : XV - XV
  • [9] MapReduce-Based D_ELT Framework to Address the Challenges of Geospatial Big Data
    Jo, Junghee
    Lee, Kang-Woo
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (11)
  • [10] A Mapreduce Fuzzy Techniques of Big Data Classification
    El Bakry, Malak
    Safwat, Soha
    Hegazy, Osman
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 118 - 128