Enhancing in-memory efficiency for MapReduce-based data processing

被引:5
|
作者
Veiga, Jorge [1 ]
Exposito, Roberto R. [1 ]
Taboada, Guillermo L. [1 ]
Tourino, Juan [1 ]
机构
[1] Univ A Coruna, Comp Architecture Grp, Campus A Coruna, La Coruna 15071, Spain
关键词
Big data; MapReduce; In-memory computing; Garbage collector (GC); Performance evaluation; PERFORMANCE;
D O I
10.1016/j.jpdc.2018.04.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the memory capacity of computational systems increases, the in-memory data management of Big Data processing frameworks becomes more crucial for performance. This paper analyzes and improves the memory efficiency of Flame-MR, a framework that accelerates Hadoop applications, providing valuable insight into the impact of memory management on performance. By optimizing memory allocation, the garbage collection overheads and execution times have been reduced by up to 85% and 44%, respectively, on a multi-core cluster. Moreover, different data buffer implementations are evaluated, showing that off heap buffers achieve better results overall. Memory resources are also leveraged by caching intermediate results, improving iterative applications by up to 26%. The memory-enhanced version of Flame-MR has been compared with Hadoop and Spark on the Amazon EC2 cloud platform. The experimental results have shown significant performance benefits reducing Hadoop execution times by up to 65%, while providing very competitive results compared to Spark. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:323 / 338
页数:16
相关论文
共 50 条
  • [21] An Accelerated MapReduce-Based K-prototypes for Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    [J]. SOFTWARE TECHNOLOGIES: APPLICATIONS AND FOUNDATIONS (STAF 2016), 2016, 9946 : 13 - 25
  • [22] MapReduce-based Capsule Networks
    Park, Sun Jin
    Park, Ho-Hyun
    [J]. 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2019, : 99 - 101
  • [23] In-Memory Data Processing for Sales Planning
    Hrubaru, Ionut
    [J]. INNOVATION MANAGEMENT AND EDUCATION EXCELLENCE THROUGH VISION 2020, VOLS I -XI, 2018, : 2582 - 2588
  • [24] A MapReduce-based approach to social network big data mining
    Qi, Fuli
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2023, 23 (05) : 2535 - 2547
  • [25] In-memory transaction processing: efficiency and scalability considerations
    Hu, Huiqi
    Zhou, Xuan
    Zhu, Tao
    Qian, Weining
    Zhou, Aoying
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (03) : 1209 - 1240
  • [26] In-memory transaction processing: efficiency and scalability considerations
    Huiqi Hu
    Xuan Zhou
    Tao Zhu
    Weining Qian
    Aoying Zhou
    [J]. Knowledge and Information Systems, 2019, 61 : 1209 - 1240
  • [27] The HiBench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis
    Huang, Shengsheng
    Huang, Jie
    Dai, Jinquan
    Xie, Tao
    Huang, Bo
    [J]. 2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 41 - 51
  • [28] Tri-training and MapReduce-based massive data learning
    Guo, Mao-Zu
    Deng, Chao
    Liu, Yang
    Li, Ping
    [J]. INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 355 - 380
  • [29] A MapReduce-based scalable discovery and indexing of structured big data
    Singh, Hari
    Bawa, Seema
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 73 : 32 - 43
  • [30] The HiBench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis
    Huang, Shengsheng
    Huang, Jie
    Dai, Jinquan
    Xie, Tao
    Huang, Bo
    [J]. NEW FRONTIERS IN INFORMATION AND SOFTWARE AS SERVICES: SERVICE AND APPLICATION DESIGN CHALLENGES IN THE CLOUD, 2011, 74 : 209 - 228