Synchronizing MPI Processes in Space and Time

被引:1
|
作者
Schuchart, Joseph [1 ]
Hunold, Sascha [2 ]
Bosilca, George [1 ]
机构
[1] Univ Tennesse, Innovat Comp Lab, Knoxville, TN 37996 USA
[2] TU Wien, Vienna, Austria
基金
奥地利科学基金会;
关键词
MPI; collective communication; process synchronization; clock synchronization; OSU benchmarks; reduce; allreduce; broadcast; barrier; BENCHMARKING;
D O I
10.1145/3615318.3615325
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance benchmarks are an integral part of the development and evaluation of parallel algorithms, both in distributed applications as well as MPI implementations themselves. The initial step of the benchmark process is to obtain a common timestamp to mark the start of an operation across all involved processes, and the state-of-the-art in many applications and widely used MPI benchmark suites is the use of MPI barriers. In this paper, we show that the synchronization in space provided by an MPI_Barrier is insufficient for proper benchmark results of parallel distributed algorithms, using MPI collective operations as examples. The resulting lack of a global start timestamp for an operation leads to skewed results, with a significant impact of the used barrier algorithm. In order to mitigate these issues, we propose and discuss the implementation of MPIX_Harmonize, which extends the synchronization in space provided by MPI_Barrier with a time synchronization to guarantee a common starting timestamp across all involved processes. By replacing the use of MPI_Barrier with MPIX_Harmonize, benchmark implementors can eliminate skews resulting from barrier algorithms and achieve stable performance benchmark results. We will show that the proper time synchronization can have significant impact on the benchmark results for various implementations of MPI_Allreduce, MPI_Reduce, and MPI_Bcast.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Localizing visual discrimination processes in time and space
    Hopf, JM
    Vogel, E
    Woodman, G
    Heinze, HJ
    Luck, SJ
    JOURNAL OF NEUROPHYSIOLOGY, 2002, 88 (04) : 2088 - 2095
  • [32] FEYNMAN AND THE VISUALIZATION OF SPACE-TIME PROCESSES
    SCHWEBER, SS
    REVIEWS OF MODERN PHYSICS, 1986, 58 (02) : 449 - 508
  • [33] Time synchronizing system for power system
    Yu, Yuehai
    Zhang, Daonong
    Hu, Yonghui
    Yang, Guoqing
    Hu, Jiong
    Deng, Zhigang
    Zhang, Lipei
    Li, Gang
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2008, 32 (07): : 82 - 86
  • [34] NOTES ON ENERGY FOR SPACE-TIME PROCESSES OVER LEVY PROCESSES
    KANDA, M
    NAGOYA MATHEMATICAL JOURNAL, 1991, 122 : 63 - 74
  • [35] Time synchronizing unit for measurement and monitoring
    Shivakumar, V
    Arunachalam, V
    Nandakumar, VN
    Sarkar, BN
    2004 International Conference on Power System Technology - POWERCON, Vols 1 and 2, 2004, : 442 - 446
  • [36] Time synchronizing signal by GPS satellites
    Dept. of Electrical Engineering, University of Azad Eslamshahr Branch, 369 Sayad Shirazi Ave., Eslamshahr, Iran
    WSEAS Trans. Commun., 2008, 5 (521-530):
  • [37] Protocol for synchronizing multimedia objects exchanged in a group of processes
    Hatori, S
    Shimamura, K
    Takizawa, N
    22ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS WORKSHOP, PROCEEDINGS, 2002, : 156 - 161
  • [38] The Cost of Synchronizing Imbalanced Processes in Message Passing Systems
    Peng, Ivy Bo
    Markidis, Stefano
    Laure, Erwin
    2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015, 2015, : 408 - 417
  • [39] HARDWARE METHOD OF SYNCHRONIZING PROCESSES WITHOUT USING A CLOCK
    TAUB, DM
    ELECTRONICS LETTERS, 1983, 19 (19) : 772 - 773
  • [40] Communication Target Selection for Replicated MPI Processes
    Anand, Rakhi
    Gabriel, Edgar
    Subhlok, Jaspal
    RECENT ADVANCES IN THE MESSAGE PASSING INTERFACE, 2010, 6305 : 198 - 207