MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems

被引:19
|
作者
Czarnul, Pawel [1 ]
Kuchta, Jaroslaw [1 ]
Matuszek, Mariusz [1 ]
Proficz, Jerzy [2 ]
Rosciszewski, Pawel [1 ]
Wojcik, Michal [1 ]
Szymanski, Julian [1 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Comp Architecture, Narutowicza 11-12, PL-80233 Gdansk, Poland
[2] Acad Comp Ctr, Narutowicza 11-12, PL-80233 Gdansk, Poland
关键词
Parallel computing; Performance simulation; Simulation environment; Cluster systems; CLUSTERS; TOOLKIT;
D O I
10.1016/j.simpat.2017.05.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects and easily allows various formulas to model execution and communication times of particular blocks of code. A simulator engine within the MERPSYS environment simulates execution of the application that consists of processes with various codes, to which distinct labels are assigned. The simulator runs one Java thread per label and scales computations and communication times adequately. This approach allows fast coarse-grained simulation of large applications on large-scale systems. We have performed tests and verification of results from the simulator for three real parallel applications implemented with C/MPI and run on real HPC clusters: a master-slave code computing similarity measures of points in a multidimensional space, a geometric single program multiple data parallel application with heat distribution and a divide-and-conquer application performing merge sort. In all cases the simulator gave results very similar to the real ones on configurations tested up to 1000 processes. Furthermore, it allowed us to make predictions of execution times on configurations beyond the hardware resources available to us. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:124 / 140
页数:17
相关论文
共 50 条
  • [1] Enabling Parallel Simulation of Large-Scale HPC Network Systems
    Mubarak, Misbah
    Carothers, Christopher D.
    Ross, Robert B.
    Carns, Philip
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (01) : 87 - 100
  • [2] SIMULATION OF HPC JOB SCHEDULING AND LARGE-SCALE PARALLEL WORKLOADS
    Abu Obaida, Mohammad
    Liu, Jason
    2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 920 - 931
  • [3] Modeling Application Resilience in Large-scale Parallel Execution
    Wu, Kai
    Dong, Wenqian
    Guan, Qiang
    DeBardeleben, Nathan
    Li, Dong
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [4] EDISON: A Web-based HPC Simulation Execution Framework for Large-scale Scientific Computing Software
    Suh, Young-Kyoon
    Ryu, Hoon
    Kim, Hangi
    Cho, Kum Won
    2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 608 - 612
  • [5] APPLICATION OF A JAVA']JAVA-BASED FRAMEWORK TO PARALLEL SIMULATION OF LARGE-SCALE SYSTEMS
    Niewiadomska-Szynkiewicz, Ewa
    Zmuda, Maciej
    Malinowski, Krzysztof
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2003, 13 (04) : 537 - 547
  • [6] Parallel simulation of a large-scale aerospace system in a multicomputer environment
    Wells, BE
    Ricks, KG
    Weir, JM
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1997, 33 (02) : 507 - 522
  • [7] A Flexible Strategy for Distributed and Parallel Execution of a Monolithic Large-Scale Sequential Application
    Navarro, Felipe
    Gonzalez, Carlos
    Peredo, Oscar
    Morales, Gerson
    Egana, Alvaro
    Ortiz, Julian M.
    HIGH PERFORMANCE COMPUTING, CARLA 2014, 2014, 485 : 54 - 67
  • [8] Improving LAMMPS performance for molecular dynamic simulation on large-scale HPC systems
    Qi, Du
    Wang, Feng
    Huang, Hui
    Chen, Jinlin
    COMPUTER JOURNAL, 2025,
  • [9] Large-scale parallel execution of urban-scale traffic simulation and its performance on K computer
    Daigo Umemoto
    Nobuyasu Ito
    Journal of Computational Social Science, 2019, 2 : 97 - 101
  • [10] Large-scale parallel execution of urban-scale traffic simulation and its performance on K computer
    Umemoto, Daigo
    Ito, Nobuyasu
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2019, 2 (01): : 97 - 101