Sparbit: Towards to a Logarithmic-Cost and Data Locality-Aware MPI Allgather Algorithm

被引:0
|
作者
Wilton Jaciel Loch
Guilherme Piêgas Koslovski
机构
[1] Santa Catarina State University,Graduate Program in Applied Computing
来源
Journal of Grid Computing | 2023年 / 21卷
关键词
MPI; Allgather; Sparbit; Collective communication; Collective algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Collective communication operations are considered critical for improving the performance of exascale-ready and high-performance computing applications. On this work we focus on the Message-Passing Interface (MPI) Allgather and Allgatherv many to many collectives, which are amongst the most utilized and time-consuming operations. Each MPI algorithm for these calls suffers from different operational and performance limitations, that might include only working for restricted cases, requiring linear amounts of communication steps with the growth in number of processes, memory copies and shifts to assure correct data organization, and non-local data exchange patterns, most of which negatively contribute to the total operation time. All these characteristics create an environment that demands careful choices of alternatives to execute the call and where there is no silver bullet algorithm, which is the best for all cases. We propose the Stripe Parallel Binomial Trees (Sparbit) algorithm, which employs the binomial tree distribution to perform data exchanges with optimal time costs and no usage restrictions. It also maintains a much more local communication pattern that minimizes the delays due to long range exchanges, allowing the extraction of more performance from current systems when compared with asymptotically equivalent traditional algorithms. Experimental results indicate that nearly 40% of all calls to Allgather could experience mean reductions from 20% to 28% on execution time by employing Sparbit, with maximum reductions reaching near 74%. For Allgatherv, results are highly variable depending on the distribution of block sizes across the processes.
引用
收藏
相关论文
共 50 条
  • [1] Sparbit: Towards to a Logarithmic-Cost and Data Locality-Aware MPI Allgather Algorithm
    Loch, Wilton Jaciel
    Koslovski, Guilherme Piegas
    [J]. JOURNAL OF GRID COMPUTING, 2023, 21 (02)
  • [2] Sparbit: a new logarithmic-cost and data locality-aware MPI Allgather algorithm
    Loch, Wilton Jaciel
    Koslovski, Guilherme Piegas
    [J]. 2021 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2021), 2021, : 167 - 176
  • [3] LACS: A Locality-Aware Cost-Sensitive Cache Replacement Algorithm
    Kharbutli, Mazen
    Sheikh, Rami
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (08) : 1975 - 1987
  • [4] Availability of Data in Locality-Aware Unreliable Networks
    Geibig, Joanna
    [J]. MESH: 2009 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN MESH NETWORKS, 2009, : 163 - 166
  • [5] Towards Generalizable Deepfake Detection with Locality-Aware AutoEncoder
    Du, Mengnan
    Pentyala, Shiva
    Li, Yuening
    Hu, Xia
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 325 - 334
  • [6] An Efficient Data Layout Transformation Algorithm for Locality-Aware Parallel Sparse FFT
    Wang, Cheng
    Chandrasekaran, Sunita
    Chapman, Barbara
    [J]. PROCEEDINGS OF IA3 2017: SEVENTH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURES AND ALGORITHMS, 2017,
  • [7] Locality-aware connection management and rank assignment for wide-area MPI
    Saito, Hideo
    Taura, Kenjiro
    [J]. CCGRID 2007: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, 2007, : 249 - +
  • [8] Locality-Aware Connection Management and Rank Assignment for Wide-Area MPI
    Saito, Hideo
    Taura, Kenjiro
    [J]. PROCEEDINGS OF THE 2007 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING PPOPP'07, 2007, : 150 - 151
  • [9] Data-Driven Locality-Aware Batch Scheduling
    Gonthier, Maxime
    Larsson, Elisabeth
    Marchal, Loris
    Nettelblad, Carl
    Thibault, Samuel
    [J]. 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 202 - 211
  • [10] Taming Big Data SVM with Locality-Aware Scheduling
    Ye, Mao
    Wang, Jun
    Yin, Jiangling
    Han, Dezhi
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 37 - 44