Spark Deployment and Performance Evaluation on the MareNostrum Supercomputer

被引:0
|
作者
Tous, Ruben [1 ,2 ]
Gounaris, Anastasios [3 ]
Tripiana, Carlos [1 ]
Torres, Jordi [1 ,2 ]
Girona, Sergi [1 ]
Ayguade, Eduard [1 ,2 ]
Labarta, Jesus [1 ,2 ]
Becerra, Yolanda [1 ,2 ]
Carrera, David [1 ,2 ]
Valero, Mateo [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, Barcelona, Spain
[3] Aristotle Univ Thessaloniki, Thessaloniki, Greece
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to investigate optimized deployment configurations of Spark on a petascale HPC setup. We detail the design of the framework and present some benchmark data to provide insights into the scalability of the system. We examine the impact of different configurations including parallelism, storage and networking alternatives, and we discuss several aspects in executing Big Data workloads on a computing system that is based on the compute-centric paradigm. Further, we derive conclusions aiming to pave the way towards systematic and optimized methodologies for fine-tuning data-intensive application on large clusters emphasizing on parallelism configurations.
引用
下载
收藏
页码:299 / 306
页数:8
相关论文
共 50 条
  • [1] An evaluation of marenostrum performance
    Rodriguez, German
    Badia, Rosa M.
    Labarta, Jesus
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2008, 22 (01): : 81 - 96
  • [2] Cluster Performance Simulation for Spark Deployment Planning, Evaluation and Optimization
    Chen, Qian
    Wang, Kebing
    Bian, Zhaojuan
    Cremer, Illia
    Xu, Gen
    Guo, Yejun
    SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, SIMULTECH 2016, 2018, 676 : 34 - 51
  • [3] Distributed training of deep neural networks with spark: The MareNostrum experience
    Cruz, Leonel
    Tous, Ruben
    Otero, Beatriz
    PATTERN RECOGNITION LETTERS, 2019, 125 : 174 - 178
  • [4] AN AGENDA FOR IMPROVED EVALUATION OF SUPERCOMPUTER PERFORMANCE
    RICHARDSON, JM
    INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1987, 1 (01): : 110 - 111
  • [5] SUPERCOMPUTER PERFORMANCE EVALUATION - THE PERFECT BENCHMARKS
    MARTIN, JL
    SUPERCOMPUTING /, 1989, 62 : 239 - 248
  • [6] SUPERCOMPUTER PERFORMANCE EVALUATION: STATUS AND DIRECTIONS.
    Martin, Joanne L.
    Mueller-Wichards, Dieter
    Journal of Supercomputing, 1987, 1 (01): : 87 - 104
  • [7] SUPERCOMPUTER CPU PERFORMANCE EVALUATION - THE FPS MODEL 500
    BORASKY, ME
    MULTIPROCESSORS AND ARRAY PROCESSORS /, 1989, 21 : 31 - 44
  • [8] Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
    Komatsu, Kazuhiko
    Momose, Shintaro
    Isobe, Yoko
    Watanabe, Osamu
    Musa, Akihiro
    Yokokawa, Mitsuo
    Aoyama, Toshikazu
    Sato, Masayuki
    Kobayashi, Hiroaki
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE, AND ANALYSIS (SC'18), 2018,
  • [9] Activities of Cyberscience Center and Performance Evaluation of the SX-9 Supercomputer
    Kobayashi, Hiroaki
    Egawa, Ryusuke
    Okabe, Kouki
    Ito, Eiichi
    Oizumi, Kenji
    NEC TECHNICAL JOURNAL, 2008, 3 (04): : 64 - 72
  • [10] Deployment and performance evaluation of mobile multicoupon solutions
    Francisca Hinarejos, M.
    Isern-Deya, Andreu-Pere
    Ferrer-Gomila, Josep-Lluis
    Huguet-Rotger, Llorenc
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2019, 18 (01) : 101 - 124