Topology-Aware Rank Reordering for MPI Collectives

被引:13
|
作者
Mirsadeghi, Seyed H. [1 ]
Afsahi, Ahmad [1 ]
机构
[1] Queens Univ, ECE Dept, Kingston, ON K7L 3N6, Canada
关键词
Collective Communications; Mapping; MPI; Rank Reordering; Topology Awareness; FRAMEWORK;
D O I
10.1109/IPDPSW.2016.139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As we move toward the Exascale era, HPC systems are becoming more complex, introducing increasing levels of heterogeneity in communication channels. This leads to variations in communication performance at different levels of hierarchy within modern HPC systems. Consequently, communicating peers such as MPI processes should be mapped onto the target cores in a topology-aware fashion so as to avoid message transmissions over slower channels. This is especially true for collective communications due to the global nature of their communication patterns and their vast use in many of parallel applications. In this paper, we exploit the rank reordering mechanism of MPI to realize run-time topology awareness for collective communications and in particular MPI_Allgather. To this end, we propose four fine-tuned mapping heuristics for various communication patterns and algorithms commonly used in MPI_Allgather. The heuristics provide a better match between the collective communication pattern and the topology of the target system. Our experimental results with 4096 processes show that MPI rank reordering using the proposed fine-tuned mapping heuristics can provide up to 78% reduction in MPI_Allgather latency at the micro-benchmark level. At the application level, we can achieve up to 34% reduction in execution time. The results also show that the proposed heuristics significantly outperform the Scotch library which provides a general-purpose graph mapping library.
引用
收藏
页码:1759 / 1768
页数:10
相关论文
共 50 条
  • [1] Rank reordering strategy for MPI topology creation functions
    Hatazaki, T
    [J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 1998, 1497 : 188 - 195
  • [2] Portable Topology-Aware MPI-I/O
    Latham, Rob
    Bautista-Gomez, Leonardo
    Balaji, Pavan
    [J]. 2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 710 - 719
  • [3] Topology-Aware Strategy for MPI-IO Operations in Clusters
    Liu, Weifeng
    Zhou, Jie
    Guo, Meng
    [J]. JOURNAL OF OPTIMIZATION, 2018, 2018
  • [4] A Scalable InfiniBand Network Topology-Aware Performance Analysis Tool for MPI
    Subramoni, Hari
    Vienne, Jerome
    Panda, Dhabaleswar K.
    [J]. EURO-PAR 2012: PARALLEL PROCESSING WORKSHOPS, 2013, 7640 : 439 - 450
  • [5] Rank reordering for MPI communication optimization
    Brandfass, B.
    Alrutz, T.
    Gerhold, T.
    [J]. COMPUTERS & FLUIDS, 2013, 80 : 372 - 380
  • [6] TopAwaRe: Topology-Aware Registration
    Nielsen, Rune Kok
    Darkner, Sune
    Feragen, Aasa
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 364 - 372
  • [7] Topology-aware Simulated Annealing
    Kerrache, Said
    Benhidour, Hafida
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION, 2014, : 19 - 24
  • [8] A Topology-Aware Improvement on Chord
    Zhou Xiaofan
    Yang Xudong
    Wang Zhiqian
    [J]. 2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2009, : 637 - 640
  • [9] A Topology-Aware Random Walk
    Yu, InKwan
    Newman, Richard
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2012, E95B (03) : 995 - 998
  • [10] Topology-aware job mapping
    Georgiou, Yiannis
    Jeannot, Emmanuel
    Mercier, Guillaume
    Villiermet, Adele
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (01): : 14 - 27