A Scalable InfiniBand Network Topology-Aware Performance Analysis Tool for MPI

被引:0
|
作者
Subramoni, Hari [1 ]
Vienne, Jerome [1 ]
Panda, Dhabaleswar K. [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Over the last decade, InfiniBand (IB) has become an increasingly popular interconnect for deploying modern supercomputing systems. As supercomputing systems grow in size and scale, the impact of IB network topology on the performance of high performance computing (HPC) applications also increase. Depending on the kind of network (FAT Tree, Tori, or Mesh), the number of network hops involved in data transfer varies. No tool currently exists that allows users of such large-scale clusters to analyze and visualize the communication pattern of HPC applications in a network topology-aware manner. In this paper, we take up this challenge and design a scalable, low-overhead InfiniBand Network Topology-Aware Performance Analysis Tool for MPI - INTAP-MPI. INTAP-MPI allows users to analyze and visualize the communication pattern of HPC applications on any IB network (FAT Tree, Tori, or Mesh). We integrate INTAP-MPI into the MVAPICH2 MPI library, allowing users of HPC clusters to seamlessly use it for analyzing their applications. Our experimental analysis shows that the INTAP-MPI is able to profile and visualize the communication pattern of applications with very low memory and performance overhead at scale.
引用
收藏
页码:439 / 450
页数:12
相关论文
共 50 条
  • [1] Topology-Aware Rank Reordering for MPI Collectives
    Mirsadeghi, Seyed H.
    Afsahi, Ahmad
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 1759 - 1768
  • [2] Portable Topology-Aware MPI-I/O
    Latham, Rob
    Bautista-Gomez, Leonardo
    Balaji, Pavan
    [J]. 2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 710 - 719
  • [3] Design of a Scalable InfiniBand Topology Service to Enable Network-Topology-Aware Placement of Processes
    Subramoni, H.
    Potluri, S.
    Kandalla, K.
    Barth, B.
    Vienne, J.
    Keasler, J.
    Tomko, K.
    Schulz, K.
    Moody, A.
    Panda, D. K.
    [J]. 2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
  • [4] Visualization Tool for Development of Topology-Aware Network Communication Algorithm
    Suzuki, Ryohei
    Ishihata, Hiroaki
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1365 - +
  • [5] Visualization Tool for Development of Topology-Aware Network Communication Algorithm
    Suzuki, Ryohei
    Ishihata, Hiroaki
    [J]. 2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1363 - +
  • [6] INAM - A Scalable InfiniBand Network Analysis and Monitoring Tool
    Dandapanthula, N.
    Subramoni, H.
    Vienne, J.
    Kandalla, K.
    Sur, S.
    Panda, Dhabaleswar K.
    Brightwell, Ron
    [J]. EURO-PAR 2011: PARALLEL PROCESSING WORKSHOPS, PT II, 2012, 7156 : 166 - 177
  • [7] Topology-aware network fault influence domain analysis
    Wu, Zhenwei
    Lu, Kai
    Wang, Xiaoping
    Chi, Wanqing
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 57 : 266 - 280
  • [8] RAQNet: A topology-aware overlay network
    Mirrezaei, Seyed Iman
    Shahparian, Javad
    Ghodsi, Mohammad
    [J]. INTER-DOMAIN MANAGEMENT, PROCEEDINGS, 2007, 4543 : 13 - +
  • [9] Topology-Aware Strategy for MPI-IO Operations in Clusters
    Liu, Weifeng
    Zhou, Jie
    Guo, Meng
    [J]. JOURNAL OF OPTIMIZATION, 2018, 2018
  • [10] Efficient topology-aware overlay network
    Waldvogel, M
    Rinaldi, R
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2003, 33 (01) : 101 - 106