Spark Deployment and Performance Evaluation on the MareNostrum Supercomputer

被引:0
|
作者
Tous, Ruben [1 ,2 ]
Gounaris, Anastasios [3 ]
Tripiana, Carlos [1 ]
Torres, Jordi [1 ,2 ]
Girona, Sergi [1 ]
Ayguade, Eduard [1 ,2 ]
Labarta, Jesus [1 ,2 ]
Becerra, Yolanda [1 ,2 ]
Carrera, David [1 ,2 ]
Valero, Mateo [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, Barcelona, Spain
[3] Aristotle Univ Thessaloniki, Thessaloniki, Greece
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to investigate optimized deployment configurations of Spark on a petascale HPC setup. We detail the design of the framework and present some benchmark data to provide insights into the scalability of the system. We examine the impact of different configurations including parallelism, storage and networking alternatives, and we discuss several aspects in executing Big Data workloads on a computing system that is based on the compute-centric paradigm. Further, we derive conclusions aiming to pave the way towards systematic and optimized methodologies for fine-tuning data-intensive application on large clusters emphasizing on parallelism configurations.
引用
下载
收藏
页码:299 / 306
页数:8
相关论文
共 50 条
  • [41] Performance Evaluation of Partial Deployment of Breadcrumbs in Content Oriented Networks
    Tsutsui, Tatsuhiro
    Urabayashi, Hiroyuki
    Yamamoto, Miki
    Rosensweig, Elisha
    Kurose, James F.
    2012 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2012, : 5828 - 5832
  • [42] Performance Evaluation, System Design and Network Deployment of IEEE 802.11
    Anand R. Prasad
    Neeli R. Prasad
    Ad Kamerman
    Henri Moelard
    Albert Eikelenboom
    Wireless Personal Communications, 2001, 19 : 57 - 79
  • [43] Performance Evaluation of Sensor Deployment Strategies in WSNs Towards IoT
    Alablani, Ibtihal
    Alenazi, Mohammed
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [44] Proteome-scale Deployment of Protein Structure Prediction Workflows on the Summit Supercomputer
    Gao, Mu
    Coletti, Mark
    Davidson, Russell B.
    Prout, Ryan
    Abraham, Subil
    Hernandez, Benjamin
    Sedova, Ada
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 206 - 215
  • [45] Evaluation of Mobility Performance and Deployment Scenarios in UMTS Heterogeneous Networks
    Min, Wang
    Ramos, Edgar
    Wang, Y. -P. Eric
    Lidian, Namir
    Nammi, Sairamesh
    Curran, Mark
    2014 IEEE 79TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-SPRING), 2014,
  • [46] Performance evaluation, system design and network deployment of IEEE 802.11
    Prasad, AR
    Prasad, NR
    Kamerman, A
    Moelard, H
    Eikelenboom, A
    WIRELESS PERSONAL COMMUNICATIONS, 2001, 19 (01) : 57 - 79
  • [47] Memory or Time: Performance Evaluation for Iterative Operation on Hadoop and Spark
    Gu, Lei
    Li, Huan
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 721 - 727
  • [48] Performance Evaluation of Distributed Computing Environments with Hadoop and Spark Frameworks
    Taran, Vladyslav
    Alienin, Oleg
    Stirenko, Sergii
    Gordienko, Yuri
    Rojbi, A.
    2017 IEEE INTERNATIONAL YOUNG SCIENTISTS FORUM ON APPLIED PHYSICS AND ENGINEERING (YSF), 2017, : 80 - 83
  • [49] Apache Spark usage and deployment models for scientific computing
    Castro, Diogo
    Kothuri, Prasanth
    Mrowczynski, Piotr
    Piparo, Danilo
    Tejedor, Enric
    23RD INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2018), 2019, 214
  • [50] Performance Evaluation of a Hybrid Programming Model for RSDFT on T2K Open Supercomputer
    Tsuji, Miwako
    Sato, Mitsuhisa
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2011, 5 (02) : 199 - 219