MERPSYS: An environment for simulation of parallel application execution on large scale HPC systems

被引:19
|
作者
Czarnul, Pawel [1 ]
Kuchta, Jaroslaw [1 ]
Matuszek, Mariusz [1 ]
Proficz, Jerzy [2 ]
Rosciszewski, Pawel [1 ]
Wojcik, Michal [1 ]
Szymanski, Julian [1 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Comp Architecture, Narutowicza 11-12, PL-80233 Gdansk, Poland
[2] Acad Comp Ctr, Narutowicza 11-12, PL-80233 Gdansk, Poland
关键词
Parallel computing; Performance simulation; Simulation environment; Cluster systems; CLUSTERS; TOOLKIT;
D O I
10.1016/j.simpat.2017.05.009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we present a new environment called MERPSYS that allows simulation of parallel application execution time on cluster-based systems. The environment offers a modeling application using the Java language extended with methods representing message passing type communication routines. It also offers a graphical interface for building a system model that incorporates various hardware components such as CPUs, GPUs, interconnects and easily allows various formulas to model execution and communication times of particular blocks of code. A simulator engine within the MERPSYS environment simulates execution of the application that consists of processes with various codes, to which distinct labels are assigned. The simulator runs one Java thread per label and scales computations and communication times adequately. This approach allows fast coarse-grained simulation of large applications on large-scale systems. We have performed tests and verification of results from the simulator for three real parallel applications implemented with C/MPI and run on real HPC clusters: a master-slave code computing similarity measures of points in a multidimensional space, a geometric single program multiple data parallel application with heat distribution and a divide-and-conquer application performing merge sort. In all cases the simulator gave results very similar to the real ones on configurations tested up to 1000 processes. Furthermore, it allowed us to make predictions of execution times on configurations beyond the hardware resources available to us. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:124 / 140
页数:17
相关论文
共 50 条
  • [11] NEXUS - A SIMULATION ENVIRONMENT FOR LARGE-SCALE NEURAL SYSTEMS
    SAJDA, P
    FINKEL, LH
    SIMULATION, 1992, 59 (06) : 358 - 364
  • [12] Large-scale parallel reservoir simulation on distributed memory systems
    Cao, JW
    Pan, F
    Sun, JC
    Liu, W
    DCABES 2001 PROCEEDINGS, 2001, : 98 - 103
  • [13] Large electromagnetic simulation by hybrid approach on large-scale parallel computing systems
    Alexandru, Mihai
    Monteil, Thierry
    Lorenz, Petr
    Coccetti, Fabio
    Aubert, Herve
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (13): : 3184 - 3204
  • [14] Parallel simulation of large-scale parallel applications
    Bagrodia, R
    Deelman, E
    Phan, T
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (01): : 3 - 12
  • [15] On the application of parallel database technology for large scale document management systems
    Clausnitzer, A
    Jaedicke, M
    Mitschang, B
    Nippl, C
    Reiser, A
    Zimmermann, S
    IDEAS '97 - INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 1997, : 388 - 396
  • [16] A distributed execution environment for large-scale workflow management systems with subnets and server migration
    Bauer, T
    Dadam, P
    PROCEEDINGS OF THE SECOND IFCIS INTERNATIONAL CONFERENCE ON COOPERATIVE INFORMATION SYSTEMS - COOPIS'97, 1997, : 99 - 108
  • [17] Leveraging a large scale HPC environment for computational studies in catalyst design
    Sherer, Edward
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [18] Automated Execution of Large-Scale Daylighting and Glare Simulations in a Cloud-Based Parallel Computing Environment
    Labib, Rania
    Baltazar, Juan-Carlos
    PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 1545 - 1551
  • [19] Large scale execution of a bioinformatic application on a volunteer grid
    Bertis, Viktors
    Bolze, Raphael
    Desprez, Frederic
    Reed, Kevin
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 3095 - +
  • [20] Unifying simulation and execution in a design environment for FPGA systems
    Hutchings, BL
    Nelson, BE
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2001, 9 (01) : 201 - 205