A Scalable InfiniBand Network Topology-Aware Performance Analysis Tool for MPI

被引:0
|
作者
Subramoni, Hari [1 ]
Vienne, Jerome [1 ]
Panda, Dhabaleswar K. [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Over the last decade, InfiniBand (IB) has become an increasingly popular interconnect for deploying modern supercomputing systems. As supercomputing systems grow in size and scale, the impact of IB network topology on the performance of high performance computing (HPC) applications also increase. Depending on the kind of network (FAT Tree, Tori, or Mesh), the number of network hops involved in data transfer varies. No tool currently exists that allows users of such large-scale clusters to analyze and visualize the communication pattern of HPC applications in a network topology-aware manner. In this paper, we take up this challenge and design a scalable, low-overhead InfiniBand Network Topology-Aware Performance Analysis Tool for MPI - INTAP-MPI. INTAP-MPI allows users to analyze and visualize the communication pattern of HPC applications on any IB network (FAT Tree, Tori, or Mesh). We integrate INTAP-MPI into the MVAPICH2 MPI library, allowing users of HPC clusters to seamlessly use it for analyzing their applications. Our experimental analysis shows that the INTAP-MPI is able to profile and visualize the communication pattern of applications with very low memory and performance overhead at scale.
引用
收藏
页码:439 / 450
页数:12
相关论文
共 50 条
  • [41] Topology-aware virtual network embedding based on closeness centrality
    Wang, Zihou
    Han, Yanni
    Lin, Tao
    Xu, Yuemei
    Ci, Song
    Tang, Hui
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2013, 7 (03) : 446 - 457
  • [42] A Scalable Network-Based Performance Analysis Tool for MPI on Large-Scale HPC Systems
    Subramoni, Hari
    Lu, Xiaoyi
    Panda, Dhabaleswar K.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 354 - 358
  • [43] INAM2: InfiniBand Network Analysis and Monitoring with MPI
    Subramoni, Hari
    Augustine, Albert Mathews
    Arnold, Mark
    Perkins, Jonathan
    Lu, Xiaoyi
    Hamidouche, Khaled
    Panda, Dhabaleswar K.
    [J]. HIGH PERFORMANCE COMPUTING, 2016, 9697 : 300 - 320
  • [44] The Effect of Topology-Aware Process and Thread Placement on Performance and Energy
    Solernou, Albert
    Thiyagalingam, Jeyarajan
    Duta, Mihai C.
    Trefethen, Anne E.
    [J]. SUPERCOMPUTING (ISC 2013), 2013, 7905 : 357 - 371
  • [45] TOPOLOGY-AWARE DISTRIBUTED ADAPTATION OF LAPLACIAN WEIGHTS FOR IN-NETWORK AVERAGING
    Bertrand, Alexander
    Moonen, Marc
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [46] FNTAR: A Future Network Topology-aware Routing protocol in UAV networks
    Peng, Jianfei
    Gao, Hang
    Liu, Liang
    Wu, Yuting
    Xu, Xiangyu
    [J]. 2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [47] Topology-Aware Virtual Network Embedding to Survive Multiple Node Failures
    Xiao, Ailing
    Wang, Ying
    Meng, Luoming
    Qiu, Xuesong
    Li, Wenjing
    [J]. 2014 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2014), 2014, : 1823 - 1828
  • [48] Multi-core and Network Aware MPI Topology Functions
    Rashti, Mohammad Javad
    Green, Jonathan
    Balaji, Pavan
    Afsahi, Ahmad
    Gropp, William
    [J]. RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2011, 6960 : 50 - +
  • [49] A TOPOLOGY-AWARE PEER-TO-PEER PROTOCOL APPLICABLE TO WIRELESS NETWORK
    Wang, Shiguo
    Ji, Hong
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT, PROCEEDINGS, 2009, : 1004 - 1009
  • [50] Topology-aware VM Placement for Network Optimization in Cloud Data Centers
    Lian, Zhen
    Li, Xin
    Qin, Xiaolin
    [J]. 2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 558 - 565