PERFORMANCE ANALYSIS AND OPTIMIZATION OF PARALLEL SCIENTIFIC APPLICATIONS ON CMP CLUSTERS

被引:0
|
作者
Wu, Xingfu [1 ]
Taylor, Valerie [1 ]
Lively, Charles [1 ]
Sharkawi, Sameh [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci, College Stn, TX 77843 USA
来源
基金
美国国家科学基金会;
关键词
performance analysis; performance optimization; chip multiprocessors (CMP); clusters; parallel scientific applications;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Chip multiprocessors (CMP) are widely used for high performance computing. Further, these CMPs are being configured in a hierarchical manner to compose a node in a cluster system. A major challenge to be addressed is efficient use of such cluster systems for large-scale scientific applications. In this paper, we quantify the performance gap resulting from using different number of processors per node; this information is used to provide a baseline for the amount of optimization needed when using all processors per node on CMP clusters. We conduct detailed performance analysis to identify how applications can be modified to efficiently utilize all processors per node using three scientific applications: a 3D particle-in-cell, magnetic fusion application Gyrokinetic Toroidal Code (GTC), a Lattice Boltzmann Method for simulating fluid dynamics (LBM), and an advanced Eulerian gyrokinetic-Maxwell equation solver for simulating microturbulent transport in plasma (GYRO). In terms of refinements, we use conventional techniques such as loop blocking, loop unrolling and loop fusion, and develop hybrid methods for optimizing MPI-Allreduce and MPI Reduce. Using these optimizations, the application performance for utilizing all processors per node was improved by up to 18.97% for GTC, 15.77% for LBM and 12.29% for GYRO on up to 2048 total processors on the CMP clusters.
引用
收藏
页码:61 / 74
页数:14
相关论文
共 50 条
  • [1] Performance analysis and optimization of parallel scientific applications on CMP clusters
    Department of Computer Science, Texas A and M University, College Station
    TX
    77843, United States
    [J]. Scalable Comput. Pract. Exp., 2009, 1 (61-74):
  • [2] Digital's clusters and scientific parallel applications
    Kaufmann, R
    Reddin, T
    [J]. DIGEST OF PAPERS: COMPCON SPRING 96, FORTY-FIRST IEEE COMPUTER SOCIETY INTERNATIONAL CONFERENCE - INTELLECTUAL LEVERAGE, 1996, : 250 - 253
  • [3] Improving the performance of speculatively parallel applications on the Hydra CMP
    Olukotun, Kunle
    Hammond, Lance
    Willey, Mark
    [J]. Proceedings of the International Conference on Supercomputing, 1999, : 21 - 30
  • [4] Performance Analysis of Parallel Visualization Applications and Scientific Applications on an Optical Grid
    Wu, Xingfu
    Taylor, Valerie
    [J]. PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON CYBERWORLDS, 2008, : 447 - 454
  • [5] Instrumentation database system for performance analysis of parallel scientific applications
    Nesheiwat, J
    Szymanski, BK
    [J]. PARALLEL COMPUTING, 2002, 28 (10) : 1409 - 1449
  • [6] Online remote trace analysis of parallel applications on high-performance clusters
    Brunst, H
    Malony, AD
    Shende, SS
    Bell, R
    [J]. HIGH PERFORMANCE COMPUTING, 2003, 2858 : 440 - 449
  • [7] Introduction to the special section on "Optimization of parallel scientific applications with accelerated high-performance computers"
    Carretero, Jesus
    Garcia-Bias, Javier
    Neytcheva, Maya G.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2015, 46 : 78 - 80
  • [8] Differences in the Performance of Parallel Applications in Physical and Virtual Clusters
    Carneiro, L. S.
    Duarte, A. A.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2018, 16 (02) : 604 - 612
  • [9] Improving the performance of scientific parallel applications in a cluster of workstations
    Flores, A
    García, JM
    [J]. APPLIED PARALLEL COMPUTING: LARGE SCALE SCIENTIFIC AND INDUSTRIAL PROBLEMS, 1998, 1541 : 134 - 141
  • [10] Analysis and Optimization of Performance Characteristics for MPI Parallel Scientific Applications on the Grid (A Case Study for the OPATM-BFM Simulation Application)
    Cheptsov, A.
    Koller, B.
    Salon, S.
    Lazzari, P.
    Gracia, J.
    [J]. REMOTE INSTRUMENTATION SERVICES ON THE E-INFRASTRUCTURE: APPLICATIONS AND TOOLS, 2011, : 241 - 253