Topology-Aware Rank Reordering for MPI Collectives

被引:13
|
作者
Mirsadeghi, Seyed H. [1 ]
Afsahi, Ahmad [1 ]
机构
[1] Queens Univ, ECE Dept, Kingston, ON K7L 3N6, Canada
关键词
Collective Communications; Mapping; MPI; Rank Reordering; Topology Awareness; FRAMEWORK;
D O I
10.1109/IPDPSW.2016.139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As we move toward the Exascale era, HPC systems are becoming more complex, introducing increasing levels of heterogeneity in communication channels. This leads to variations in communication performance at different levels of hierarchy within modern HPC systems. Consequently, communicating peers such as MPI processes should be mapped onto the target cores in a topology-aware fashion so as to avoid message transmissions over slower channels. This is especially true for collective communications due to the global nature of their communication patterns and their vast use in many of parallel applications. In this paper, we exploit the rank reordering mechanism of MPI to realize run-time topology awareness for collective communications and in particular MPI_Allgather. To this end, we propose four fine-tuned mapping heuristics for various communication patterns and algorithms commonly used in MPI_Allgather. The heuristics provide a better match between the collective communication pattern and the topology of the target system. Our experimental results with 4096 processes show that MPI rank reordering using the proposed fine-tuned mapping heuristics can provide up to 78% reduction in MPI_Allgather latency at the micro-benchmark level. At the application level, we can achieve up to 34% reduction in execution time. The results also show that the proposed heuristics significantly outperform the Scotch library which provides a general-purpose graph mapping library.
引用
收藏
页码:1759 / 1768
页数:10
相关论文
共 50 条
  • [41] Netloc: A Tool for Topology-Aware Process Mapping
    Bordage, Cyril
    Foyer, Clement
    Goglin, Brice
    [J]. EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 157 - 166
  • [42] Topology-aware Image Compositing using NVLink
    Christensen, Cameron
    Fogal, Thomas
    Luehr, Nathan
    Woolley, Cliff
    [J]. 2016 IEEE 6TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2016, : 93 - 94
  • [43] Topology-Aware Surface Reconstruction for Point Clouds
    Bruel-Gabrielsson, Rickard
    Ganapathi-Subramanian, Vignesh
    Skraba, Primoz
    Guibas, Leonidas J.
    [J]. COMPUTER GRAPHICS FORUM, 2020, 39 (05) : 197 - 207
  • [44] Topology-aware resource management for HPC applications
    Georgiou, Yiannis
    Jeannot, Emmanuel
    Mercier, Guillaume
    Villiermet, Adele
    [J]. 18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING (ICDCN 2017), 2017,
  • [45] TopoSeg: Topology-Aware Nuclear Instance Segmentation
    He, Hongliang
    Wang, Jun
    Wei, Pengxu
    Xu, Fan
    Ji, Xiangyang
    Liu, Chang
    Chen, Jie
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21250 - 21259
  • [46] Topology-aware Content-centric Networking
    Zhang, Xinggong
    Niu, Tong
    Lao, Feng
    Guo, Zongming
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2013, 43 (04) : 559 - 560
  • [47] A novel approach for topology-aware overlay multicasting
    Chen, Xiao
    Shao, Huagang
    Wang, Weinong
    [J]. PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, 2006, 4330 : 147 - +
  • [48] TOPOLOGY-AWARE LEARNING FOR VOLUMETRIC CEREBROVASCULAR SEGMENTATION
    Banerjee, Subhashis
    Toumpanakis, Dimitrios
    Dhara, Ashis Kumar
    Wikstrom, Johan
    Strand, Robin
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [49] A fast and efficient algorithm for topology-aware coallocation
    Kravtsov, Valentin
    Swain, Martin
    Dubin, Uri
    Dubitzky, Werner
    Schuster, Assaf
    [J]. COMPUTATIONAL SCIENCE - ICCS 2008, PT 1, 2008, 5101 : 274 - +
  • [50] Topology-Aware Data Aggregation for High Performance Collective MPI-IO on a Multi-Core Cluster System
    Tsujita, Yuichi
    Hori, Atsushi
    Kameyama, Toyohisa
    Ishikawa, Yutaka
    [J]. 2016 FOURTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2016, : 37 - 46