A Scalable InfiniBand Network Topology-Aware Performance Analysis Tool for MPI

被引:0
|
作者
Subramoni, Hari [1 ]
Vienne, Jerome [1 ]
Panda, Dhabaleswar K. [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Over the last decade, InfiniBand (IB) has become an increasingly popular interconnect for deploying modern supercomputing systems. As supercomputing systems grow in size and scale, the impact of IB network topology on the performance of high performance computing (HPC) applications also increase. Depending on the kind of network (FAT Tree, Tori, or Mesh), the number of network hops involved in data transfer varies. No tool currently exists that allows users of such large-scale clusters to analyze and visualize the communication pattern of HPC applications in a network topology-aware manner. In this paper, we take up this challenge and design a scalable, low-overhead InfiniBand Network Topology-Aware Performance Analysis Tool for MPI - INTAP-MPI. INTAP-MPI allows users to analyze and visualize the communication pattern of HPC applications on any IB network (FAT Tree, Tori, or Mesh). We integrate INTAP-MPI into the MVAPICH2 MPI library, allowing users of HPC clusters to seamlessly use it for analyzing their applications. Our experimental analysis shows that the INTAP-MPI is able to profile and visualize the communication pattern of applications with very low memory and performance overhead at scale.
引用
收藏
页码:439 / 450
页数:12
相关论文
共 50 条
  • [31] Topology-aware Virtual Network Embedding based on Multiple Characteristics
    Feng, Min
    Liao, Jianxin
    Wang, Jingyu
    Qing, Sude
    Qi, Qi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2014, : 2956 - 2962
  • [32] Topology-Aware Activation Layer for Neural Network Image Segmentation
    Baxter, John S. H.
    Jannin, Pierre
    [J]. MEDICAL IMAGING 2020: IMAGE PROCESSING, 2021, 11313
  • [33] Topology-Aware Prediction of Virtual Network Function Resource Requirements
    Mijumbi, Rashid
    Hasija, Sidhant
    Davy, Steven
    Davy, Alan
    Jennings, Brendan
    Boutaba, Raouf
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2017, 14 (01): : 106 - 120
  • [34] TA-Net: Topology-Aware Network for Gland Segmentation
    Wang, Haotian
    Xian, Min
    Vakanski, Aleksandar
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3241 - 3249
  • [35] Topology-Aware Data Aggregation for High Performance Collective MPI-IO on a Multi-Core Cluster System
    Tsujita, Yuichi
    Hori, Atsushi
    Kameyama, Toyohisa
    Ishikawa, Yutaka
    [J]. 2016 FOURTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2016, : 37 - 46
  • [36] Topology-Aware Mapping of Spiking Neural Network to Neuromorphic Processor
    Xiao, Chao
    Wang, Yao
    Chen, Jihua
    Wang, Lei
    [J]. ELECTRONICS, 2022, 11 (18)
  • [37] TARMan: Topology-Aware Reliability Management for Softwarized Network Systems
    Gebre-Amlak, Haymanot
    Banala, Goutham
    Song, Sejun
    Choi, Baek-Young
    Choi, Taesang
    Zhu, Henry
    [J]. 2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON LOCAL AND METROPOLITAN AREA NETWORKS (LANMAN), 2017,
  • [38] Interaction Subgraph Sequential Topology-Aware Network for Transferable Recommendation
    Yang, Kang
    Yu, Ruiyun
    Guo, Bingyang
    Li, Jie
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5221 - 5233
  • [39] Topology-Aware Correlated Network Anomaly Event Detection and Diagnosis
    Prasad Calyam
    Manojprasadh Dhanapalan
    Mukundan Sridharan
    Ashok Krishnamurthy
    Rajiv Ramnath
    [J]. Journal of Network and Systems Management, 2014, 22 : 208 - 234
  • [40] Topology-Aware Correlated Network Anomaly Event Detection and Diagnosis
    Calyam, Prasad
    Dhanapalan, Manojprasadh
    Sridharan, Mukundan
    Krishnamurthy, Ashok
    Ramnath, Rajiv
    [J]. JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2014, 22 (02) : 208 - 234