A Scalable MPI_Comm_split Algorithm for Exascale Computing

被引:0
|
作者
Sack, Paul [1 ]
Gropp, William [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
关键词
PERFORMANCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing algorithms for creating communicators in MN programs will not scale well to future exascale supercomputers containing millions of cores. In this work, we present a novel communicator-creation algorithm that does scale well into millions of processes using three techniques: replacing the sorting at the end of MPI_Comm_split with merging as the color and key table is built, sorting the color and key table in parallel, and using a distributed table to store the output communicator data rather than a replicated table. This reduces the time cost of MPI_Comm_split in the worst case we consider from 22 seconds to 0.37 second. Existing algorithms build a table with as many entries as processes, using vast amounts of memory. Our algorithm uses a small, fixed amount of memory per communicator after MPI_Comm_split has finished and uses a fraction of the memory used by the conventional algorithm for temporary storage during the execution of MPI_Comm_split.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 43 条
  • [21] A MODIFIED SPLIT BREGMAN ALGORITHM FOR COMPUTING MICROSTRUCTURES THROUGH YOUNG MEASURES
    Jaramillo, Gabriela
    Venkataramani, Shankar C.
    MULTISCALE MODELING & SIMULATION, 2021, 19 (02): : 886 - 920
  • [22] A modified split bregman algorithm for computing microstructures through young measures
    Jaramillo, Gabriela
    Venkataramani, Shankar C.
    Multiscale Modeling and Simulation, 2021, 19 (02): : 886 - 920
  • [23] Security on Cloud Computing Using Split Algorithm Along with Cryptography and Steganography
    Garg, Priyansha
    Sharma, Moolchand
    Agrawal, Shivani
    Kumar, Yastika
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 1, 2019, 55 : 71 - 79
  • [24] A fast, Scalable SLINK Algorithm for Commodity Cluster Computing Exploiting Spatial Locality
    Goyal, Poonam
    Kumari, Sonal
    Sharma, Sumit
    Kumar, Dhruv
    Kishore, Vivek
    Balasubramaniam, Sundar
    Goyal, Navneet
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 268 - 275
  • [25] A scalable and efficient algorithm for computing the city block distance transform on reconfigurable meshes
    Pan, Y
    Trahan, JL
    Vaidyanathan, R
    COMPUTER JOURNAL, 1997, 40 (07): : 435 - 440
  • [26] A Scalable and Accurate Chessboard-Based AMC Algorithm With Low Computing Demands
    Zhao, Yuqin
    Gavin, William C. J.
    Deng, Tiantai
    Ball, Edward A.
    Seed, Luke
    IEEE ACCESS, 2023, 11 : 120955 - 120962
  • [27] Research on parallel computing algorithm of the second harmonic generation coefficients of nonlinear optical crystals based on MPI
    Liu, Rong
    Zhang, Yongfan
    Wang, Meiqing
    2012 11TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2012, : 77 - 80
  • [28] SCALABLE AND DISTRIBUTED MATHEMATICAL MODELING ALGORITHM DESIGN AND PERFORMANCE EVALUATION IN HETEROGENEOUS COMPUTING CLUSTERS
    Liu, Zhouding
    Li, Jia
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (05): : 3812 - 3821
  • [29] Hedera: A Permissionless and Scalable Hybrid Blockchain Consensus Algorithm in Multiaccess Edge Computing for IoT
    Tang, Yu
    Yan, Jiawen
    Chakraborty, Chinmay
    Sun, Yi
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (24) : 21187 - 21202
  • [30] A Scalable Parallel Q-Learning Algorithm for Resource Constrained Decentralized Computing Environments
    Camelo, Miguel
    Famaey, Jeroen
    Latre, Steven
    PROCEEDINGS OF 2016 2ND WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS (MLHPC), 2016, : 27 - 35