Execution-driven Parallel Simulation of PGAS Applications on Heterogeneous Tiled Architectures

被引:6
|
作者
Roloff, Sascha [1 ]
Schafhauser, David [1 ]
Hannig, Frank [1 ]
Teich, Juergen [1 ]
机构
[1] Univ Erlangen Nurnberg, Dept Comp Sci, Hardware Software Codesign, Nurnberg, Germany
关键词
D O I
10.1145/2744769.2744840
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a parallel execution-driven simulator for the efficient simulation of heterogeneous tile-based multi-core architectures. Here, the architecture is composed of several tiles connected via a network-on-chip and each tile contains local memory as well as several possibly different types of compute resources. Partitioned Global Address Space (PGAS) is a programming model matching very well the needs for programming of such modern multi-core architectures. In order to provide performance estimations for parallel software and enable architecture design space exploration, fast functional and timing simulation techniques are required. Thus, we present a simulator that meets this requirement by combining a fast direct-execution simulation approach with different parallelization strategies. Here, we propose four novel parallel discrete-event simulation techniques, which map thread level parallelism within the applications to core-level parallelism on the target architecture and back to thread-level parallelism on the host machine. In order to achieve this, the correct synchronization and activation of the host threads is necessary being the main focus of this paper. Experiments with parallel real world applications are used to compare the different techniques against each other and demonstrate that 10.4 times faster simulations than a sequential simulation can be achieved on a 12-core Intel Xeon processor.
引用
收藏
页数:6
相关论文
共 33 条
  • [21] Execution cost minimization scheduling algorithms for deadline-constrained parallel applications on heterogeneous clouds
    Chen, Weihong
    Xie, Guoqi
    Li, Renfa
    Li, Keqin
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (02): : 701 - 715
  • [22] Execution cost minimization scheduling algorithms for deadline-constrained parallel applications on heterogeneous clouds
    Weihong Chen
    Guoqi Xie
    Renfa Li
    Keqin Li
    Cluster Computing, 2021, 24 : 701 - 715
  • [23] Evaluation of HCM: A New Model to Predict the Execution Time of Regular Parallel Applications on a Heterogeneous Cluster
    Soares, Thiago Marques
    dos Santos, Rodrigo Weber
    Lobosco, Marcelo
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT II, 2018, 10778 : 58 - 67
  • [24] Numerical simulation of heterogeneous detonation in polydisperse gas suspensions using modern parallel computational architectures
    Kratova, Yu. V.
    Kashkovsky, A. V.
    Shershnev, A. A.
    INTERNATIONAL CONFERENCE ON THE METHODS OF AEROPHYSICAL RESEARCH (ICMAR 2018), 2018, 2027
  • [25] A hybrid parallel cellular automata model for urban growth simulation over GPU/CPU heterogeneous architectures
    Guan, Qingfeng
    Shi, Xuan
    Huang, Miaoqing
    Lai, Chenggang
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2016, 30 (03) : 494 - 514
  • [26] Improving the scalability of parallel N-body applications with an event-driven constraint-based execution model
    Dekate, Chirag
    Anderson, Matthew
    Brodowicz, Maciej
    Kaiser, Hartmut
    Adelstein-Lelbach, Bryce
    Sterling, Thomas
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2012, 26 (03): : 319 - 332
  • [27] Domain-specific virtual processors as a portable programming and execution model for parallel computational workloads on modern heterogeneous high-performance computing architectures
    Lyakh, Dmitry, I
    INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2019, 119 (12)
  • [28] Simulation-based evaluation of design options for high performance parallel architectures for space-borne applications
    Leonardi, A
    Schottdorf, B
    Bretschneider, T
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 114 - 118
  • [29] PERFORMANCE-MEASUREMENT AND TRACE DRIVEN SIMULATION OF PARALLEL CAD AND NUMERIC APPLICATIONS ON A HYPERCUBE MULTICOMPUTER
    HSU, JM
    BANERJEE, P
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1992, 3 (04) : 451 - 464
  • [30] Improving the design flow for parallel and heterogeneous architectures running real-time applications: The PHARAON FP7 project
    Posadas, Hector
    Nicolas, Alejandro
    Penil, Pablo
    Villar, Eugenio
    Broekaert, Florian
    Bourdelles, Michel
    Cohen, Albert
    Lazarescu, Mihai T.
    Lavagno, Luciano
    Terechko, Andrei
    Glassee, Miguel
    Prieto, Manuel
    MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (08) : 960 - 975