Spark Deployment and Performance Evaluation on the MareNostrum Supercomputer

被引:0
|
作者
Tous, Ruben [1 ,2 ]
Gounaris, Anastasios [3 ]
Tripiana, Carlos [1 ]
Torres, Jordi [1 ,2 ]
Girona, Sergi [1 ]
Ayguade, Eduard [1 ,2 ]
Labarta, Jesus [1 ,2 ]
Becerra, Yolanda [1 ,2 ]
Carrera, David [1 ,2 ]
Valero, Mateo [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, Barcelona, Spain
[3] Aristotle Univ Thessaloniki, Thessaloniki, Greece
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to investigate optimized deployment configurations of Spark on a petascale HPC setup. We detail the design of the framework and present some benchmark data to provide insights into the scalability of the system. We examine the impact of different configurations including parallelism, storage and networking alternatives, and we discuss several aspects in executing Big Data workloads on a computing system that is based on the compute-centric paradigm. Further, we derive conclusions aiming to pave the way towards systematic and optimized methodologies for fine-tuning data-intensive application on large clusters emphasizing on parallelism configurations.
引用
下载
收藏
页码:299 / 306
页数:8
相关论文
共 50 条
  • [21] Performance Analysis and Evaluation of Deployment in Small Cell Networks
    Zheng, Kan
    Li, Yue
    Zhang, Yingkai
    Jiang, Zheng
    Long, Hang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (03): : 886 - 900
  • [22] Performance Evaluation of Virtualization Methodologies to Facilitate NFV Deployment
    Zahoor, Sumbal
    Ahmad, Ishtiaq
    Rehman, Ateeq Ur
    Eldin, Elsayed Tag
    Ghamry, Nivin A.
    Shafiq, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 311 - 329
  • [23] Performance and deployment evaluation of a parallel application on a private Cloud
    Mc Evoy, Giacomo V.
    Schulze, Bruno
    Garcia, Eduardo L. M.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (17): : 2048 - 2062
  • [24] Deployment and Performance Evaluation of Virtual Network based on OpenStack
    Zhao, Shaoka
    Li, Liyao
    Yang, Jiahai
    Xu, Cong
    Ling, Xiao
    Huang, Shuxiao
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON CLOUD COMPUTING AND INFORMATION SECURITY (CCIS 2013), 2013, 52 : 18 - 22
  • [25] Performance Evaluation of Big Data Frameworks: MapReduce and Spark
    Singh, Jaspreet
    Panda, S. N.
    Kaushal, Rajesh
    INTELLIGENT COMMUNICATION, CONTROL AND DEVICES, ICICCD 2017, 2018, 624 : 1611 - 1619
  • [26] Performance analysis and optimization for scalable deployment of deep learning models for country-scale settlement mapping on Titan supercomputer
    Kurte, Kuldeep
    Sanyal, Jibonananda
    Berres, Anne
    Lunga, Dalton
    Coletti, Mark
    Yang, Hsiuhan Lexie
    Graves, Daniel
    Liebersohn, Benjamin
    Rose, Amy
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (20):
  • [27] PERSONAL SUPERCOMPUTER PUSHES PRICE PERFORMANCE
    MYRVAAGNES, R
    ELECTRONIC PRODUCTS MAGAZINE, 1986, 29 (01): : 35 - 35
  • [28] Potential of a modern vector supercomputer for practical applications: performance evaluation of SX-ACE
    Ryusuke Egawa
    Kazuhiko Komatsu
    Shintaro Momose
    Yoko Isobe
    Akihiro Musa
    Hiroyuki Takizawa
    Hiroaki Kobayashi
    The Journal of Supercomputing, 2017, 73 : 3948 - 3976
  • [29] Potential of a modern vector supercomputer for practical applications: performance evaluation of SX-ACE
    Egawa, Ryusuke
    Komatsu, Kazuhiko
    Momose, Shintaro
    Isobe, Yoko
    Musa, Akihiro
    Takizawa, Hiroyuki
    Kobayashi, Hiroaki
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (09): : 3948 - 3976
  • [30] PERFORMANCE EVALUATION OF DOMAIN DECOMPOSITION METHOD WITH SPARSE MATRIX STORAGE SCHEMES IN MODERN SUPERCOMPUTER
    Mukaddes, Abul Mukid Mohammad
    Ogino, Masao
    Shioya, Ryuji
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2014, 11