Challenges for MapReduce in Big Data

被引:78
|
作者
Grolinger, Katarina [1 ]
Hayes, Michael [1 ]
Higashino, Wilson A. [1 ,2 ]
L'Heureux, Alexandra [1 ]
Allison, David S. [1 ,3 ,4 ,5 ]
Capretz, Miriam A. M. [1 ]
机构
[1] Univ Western Ontario, Dept Elect & Comp Engn, London, ON N6A 5B9, Canada
[2] Univ Estadual Campinas, Inst Comp, Campinas, SP, Brazil
[3] CNRS, LAAS, F-31400 Toulouse, France
[4] Univ Toulouse, LAAS, F-31400 Toulouse, France
[5] Univ Toulouse, LAAS, UT1 Capitole, F-31000 Toulouse, France
关键词
Big Data; Big Data Analytics; MapReduce; NoSQL; Machine Learning; Interactive Analytics; Online Processing; Privacy; Security;
D O I
10.1109/SERVICES.2014.41
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the Big Data community, MapReduce has been seen as one of the key enabling approaches for meeting continuously increasing demands on computing resources imposed by massive data sets. The reason for this is the high scalability of the MapReduce paradigm which allows for massively parallel and distributed execution over a large number of computing nodes. This paper identifies MapReduce issues and challenges in handling Big Data with the objective of providing an overview of the field, facilitating better planning and management of Big Data projects, and identifying opportunities for future research in this field. The identified challenges are grouped into four main categories corresponding to Big Data tasks types: data storage (relational databases and NoSQL stores), Big Data analytics (machine learning and interactive analytics), online processing, and security and privacy. Moreover, current efforts aimed at improving and extending MapReduce to address identified challenges are presented. Consequently, by identifying issues and challenges MapReduce faces when handling Big Data, this study encourages future Big Data research.
引用
下载
收藏
页码:182 / 189
页数:8
相关论文
共 50 条
  • [21] THE BIG CHALLENGES OF BIG DATA
    Marx, Vivien
    NATURE, 2013, 498 (7453) : 255 - 260
  • [22] The big challenges of big data
    Vivien Marx
    Nature, 2013, 498 : 255 - 260
  • [23] i2MapReduce: Incremental MapReduce for Mining Evolving Big Data
    Zhang, Yanfeng
    Chen, Shimin
    Wang, Qiang
    Yu, Ge
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1482 - 1483
  • [24] i2MapReduce: Incremental MapReduce for Mining Evolving Big Data
    Zhang, Yanfeng
    Chen, Shimin
    Wang, Qiang
    Yu, Ge
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (07) : 1906 - 1919
  • [25] Atrak: a MapReduce-based data warehouse for big data
    Barkhordari, Mohammadhossein
    Niamanesh, Mahdi
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (10): : 4596 - 4610
  • [26] Atrak: a MapReduce-based data warehouse for big data
    Mohammadhossein Barkhordari
    Mahdi Niamanesh
    The Journal of Supercomputing, 2017, 73 : 4596 - 4610
  • [27] The challenges of big data
    Mardis, Elaine R.
    DISEASE MODELS & MECHANISMS, 2016, 9 (05) : 483 - 485
  • [28] Big Data Analysis Solutions using MapReduce Framework
    Elagib, Sara B.
    Najeeb, Atahur Rahman
    Hashim, Aisha H.
    Olanrewaju, Rashidah F.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2014, : 127 - 130
  • [29] Matrix Multiplication of Big Data Using MapReduce: A Review
    Qasem, Mais Haj
    Abu Sarhan, Alaa
    Qaddoura, Raneem
    Mahafzah, Basel A.
    PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF INFORMATION TECHNOLOGY IN DEVELOPING RENEWABLE ENERGY PROCESSES & SYSTEMS (IT-DREPS 2017), 2017,
  • [30] The challenges of big data
    Taylor P.
    ITNOW, 2020, 62 (03) : 56 - 57