MREv: an Automatic MapReduce Evaluation Tool for Big Data Workloads

被引:6
|
作者
Veiga, Jorge [1 ]
Exposito, Roberto R. [1 ]
Taboada, Guillermo L. [1 ]
Tourino, Juan [1 ]
机构
[1] Univ A Coruna, Comp Architecture Grp, La Coruna, Spain
关键词
High Performance Computing (HPC); Big Data; MapReduce; Performance Evaluation; Resource Efficiency; InfiniBand;
D O I
10.1016/j.procs.2015.05.202
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The popularity of Big Data computing models like MapReduce has caused the emergence of many frameworks oriented to High Performance Computing (HPC) systems. The suitability of each one to a particular use case depends on its design and implementation, the underlying system resources and the type of application to be run. Therefore, the appropriate selection of one of these frameworks generally involves the execution of multiple experiments in order to assess their performance, scalability and resource efficiency. This work studies the main issues of this evaluation, proposing a new MapReduce Evaluator (MREv) tool which unifies the configuration of the frameworks, eases the task of collecting results and generates resource utilization statistics. Moreover, a practical use case is described, including examples of the experimental results provided by this tool. MREv is available to download at http://mrev.des.udc.es.
引用
收藏
页码:80 / 89
页数:10
相关论文
共 50 条
  • [41] Proxy Benchmarks for Emerging Big-data Workloads
    Panda, Reena
    John, Lizy Kurian
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2017, : 139 - 140
  • [42] Characterizing OS Behaviors of Datacenter and Big Data Workloads
    Zheng, Chen
    Zhan, Jianfeng
    Jia, Zhen
    Zhang, Lixin
    [J]. PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1079 - 1086
  • [43] Atrak: a MapReduce-based data warehouse for big data
    Mohammadhossein Barkhordari
    Mahdi Niamanesh
    [J]. The Journal of Supercomputing, 2017, 73 : 4596 - 4610
  • [44] Atrak: a MapReduce-based data warehouse for big data
    Barkhordari, Mohammadhossein
    Niamanesh, Mahdi
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (10): : 4596 - 4610
  • [45] COMPARING IMPLEMENTATIONS OF NEAR-DATA COMPUTING WITH IN-MEMORY MAPREDUCE WORKLOADS
    Pugsley, Seth H.
    Jestes, Jeffrey
    Balasubramonian, Rajeev
    Srinivasan, Vijayalakshmi
    Buyuktosunoglu, Alper
    Davis, Al
    Li, Feifei
    [J]. IEEE MICRO, 2014, 34 (04) : 44 - 52
  • [46] Biscuit: A Framework for Near-Data Processing of Big Data Workloads
    Gu, Boncheol
    Yoon, Andre S.
    Bae, Duck-Ho
    Jo, Insoon
    Lee, Jinyoung
    Yoon, Jonghyun
    Kang, Jeong-Uk
    Kwon, Moonsang
    Yoon, Chanho
    Cho, Sangyeun
    Jeong, Jaeheon
    Chang, Duckhyun
    [J]. 2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 153 - 165
  • [47] Big Data Analysis Solutions using MapReduce Framework
    Elagib, Sara B.
    Najeeb, Atahur Rahman
    Hashim, Aisha H.
    Olanrewaju, Rashidah F.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE), 2014, : 127 - 130
  • [48] Matrix Multiplication of Big Data Using MapReduce: A Review
    Qasem, Mais Haj
    Abu Sarhan, Alaa
    Qaddoura, Raneem
    Mahafzah, Basel A.
    [J]. PROCEEDINGS OF 2017 2ND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF INFORMATION TECHNOLOGY IN DEVELOPING RENEWABLE ENERGY PROCESSES & SYSTEMS (IT-DREPS 2017), 2017,
  • [49] MapReduce based Method for Big Data Semantic Clustering
    Yang, Jie
    Li, Xiaoping
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2814 - 2819
  • [50] Improving Network Traffic in MapReduce for Big Data Applications
    Gawande, Priya
    Shaikh, Nuzhaft
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 2979 - 2983