Process Arrival Pattern Aware Alltoall and Allgather on InfiniBand Clusters

被引:0
|
作者
Ying Qian
Ahmad Afsahi
机构
[1] Queen’s University,Department of Electrical and Computer Engineering
关键词
Process arrival pattern; MPI_Alltoall; MPI_Allgather; RDMA; InfiniBand; Collective communications;
D O I
暂无
中图分类号
学科分类号
摘要
Recent studies show that MPI processes in real applications could arrive at an MPI collective operation at different times. This imbalanced process arrival pattern can significantly affect the performance of the collective operation. MPI_Alltoall() and MPI_Allgather() are communication-intensive collective operations that are used in many scientific applications. Therefore, their efficient implementations under different process arrival patterns are critical to the performance of scientific applications running on modern clusters. In this paper, we propose novel RDMA-based process arrival pattern aware MPI_Alltoall() and MPI_Allgather() for different message sizes over InfiniBand clusters. We also extend the algorithms to be shared memory aware for small to medium size messages under process arrival patterns. The performance results indicate that the proposed algorithms outperform the native MVAPICH implementations as well as other non-process arrival pattern aware algorithms when processes arrive at different times. Specifically, the RDMA-based process arrival pattern aware MPI_Alltoall() and MPI_Allgather() are 3.1 times faster than MVAPICH for 8 KB messages. On average, the applications studied in this paper (FT, RADIX, and N-BODY) achieve a speedup of 1.44 using the proposed algorithms.
引用
收藏
页码:473 / 493
页数:20
相关论文
共 18 条
  • [1] Process Arrival Pattern Aware Alltoall and Allgather on InfiniBand Clusters
    Qian, Ying
    Afsahi, Ahmad
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2011, 39 (04) : 473 - 493
  • [2] Process Arrival Pattern and Shared Memory Aware Alltoall on InfiniBand
    Qian, Ying
    Afsahi, Ahmad
    [J]. RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2009, 5759 : 250 - 260
  • [3] High Performance Alltoall and Allgather designs for InfiniBand MIC Clusters
    Venkatesh, Akshay
    Potluri, Sreeram
    Rajachandrasekar, Raghunath
    Luo, Miao
    Hamidouche, Khaled
    Panda, Dhabaleswar K.
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [4] Designing Topology-Aware Communication Schedules for Alltoall Operations in Large InfiniBand Clusters
    Subramoni, H.
    Kandalla, K.
    Jose, J.
    Tomko, K.
    Schulz, K.
    Pekurovsky, D.
    Panda, D. K.
    [J]. 2014 43RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2014, : 231 - 240
  • [5] Process arrival pattern aware algorithms for acceleration of scatter and gather operations
    Proficz, Jerzy
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (04): : 2735 - 2751
  • [6] Process arrival pattern aware algorithms for acceleration of scatter and gather operations
    Jerzy Proficz
    [J]. Cluster Computing, 2020, 23 : 2735 - 2751
  • [7] Design of Network Topology Aware Scheduling Services for Large InfiniBand Clusters
    Subramoni, H.
    Bureddy, D.
    Kandalla, K.
    Schulz, K.
    Barth, B.
    Perkins, J.
    Arnold, M.
    Panda, D. K.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [8] Minimizing Network Contention in InfiniBand Clusters with a QoS-Aware Data-Staging Framework
    Rajachandrasekar, Raghunath
    Jaswani, Jai
    Subramoni, Hari
    Panda, Dhabaleswar K.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2012, : 329 - 336
  • [9] Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters
    Subramoni, H.
    Kandalla, K.
    Vienne, J.
    Sur, S.
    Barth, B.
    Tomko, K.
    McLay, R.
    Schulz, K.
    Panda, D. K.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 317 - 325
  • [10] Partition-aware routing to improve network isolation in InfiniBand based multi-tenant clusters
    Zahid, Feroz
    Gran, Ernst Gunnar
    Bogdanski, Bartosz
    Johnsen, Bjorn Dag
    Skeie, Tor
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 189 - 198