Extending collective operations with application semantics for improving multi-cluster performance

被引:0
|
作者
Bongo, LA [1 ]
Anshus, O [1 ]
Bjorndalen, JM [1 ]
Larsen, T [1 ]
机构
[1] Univ Tromso, Dept Comp Sci, N-9001 Tromso, Norway
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We identify two ways of increasing the performance of allreduce-style of collective operations in a multi-cluster with large WAN latencies: (i) hiding latency in system noise, and (H) conditional allreduce where knowledge about the application is used to reduce the number of WAN messages. In our multicluster, system noise was not large enough to hide the WAN latency. But, the latency could be hidden using conditional-allreduce, since on many iterations only cluster-local values were needed, and many of the values needed from other clusters were prefetched. A speedup of 2.4 was achieved for a microbenchmark. Prefetching introduced a small overhead in the cluster with the slowest hosts.
引用
收藏
页码:320 / 327
页数:8
相关论文
共 49 条
  • [21] SCTP, XTP and TCP as Transport Protocols for High Performance Computing on Multi-cluster Grid Environments
    Viegas, Diogo R.
    Mendonca, R. P.
    Dantas, Mario A. R.
    Bauer, Michael A.
    [J]. HIGH PERFORMANCE COMPUTING SYSTEMS AND APPLICATIONS, 2010, 5976 : 230 - +
  • [22] Performance Analysis of Communication Networks in Multi-Cluster Systems under Bursty Traffic with Communication Locality
    Wu, Yulei
    Min, Geyong
    Li, Keqiu
    Javadi, Bahman
    [J]. GLOBECOM 2009 - 2009 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-8, 2009, : 5191 - +
  • [23] An Exploration Methodology for a Customizable OpenCL Stereo-Matching Application Targeted to an Industrial Multi-Cluster Architecture
    Paone, Edoardo
    Palermo, Gianluca
    Zaccaria, Vittorio
    Silvano, Cristina
    Melpignano, Diego
    Haugou, Germain
    Lepley, Thierry
    [J]. CODES+ISSS'12:PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON HARDWARE/SOFTWARE-CODESIGN AND SYSTEM SYNTHESIS, 2012, : 503 - 512
  • [24] Performance Evaluation in Single or Multi-Cluster C-RAN Supporting Quasi-Random Traffic
    Chousainov, Iskanter-Alexandros
    Moscholios, Ioannis
    Kaloxylos, Alexandros
    Logothetis, Michael
    [J]. JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2020, 16 (02) : 170 - 179
  • [25] Performance analysis and optimization of MPI collective operations on multi-core clusters
    Bibo Tu
    Jianping Fan
    Jianfeng Zhan
    Xiaofang Zhao
    [J]. The Journal of Supercomputing, 2012, 60 : 141 - 162
  • [26] Multi-cluster high performance computing method based on multimodal tensor in enterprise resource planning system
    Zhang, Hongjun
    Xia, Ruoyan
    Ye, Hao
    Shi, Desheng
    Li, Peng
    Fan, Weibei
    [J]. PHYSICAL COMMUNICATION, 2024, 62
  • [27] Performance analysis and optimization of MPI collective operations on multi-core clusters
    Tu, Bibo
    Fan, Jianping
    Zhan, Jianfeng
    Zhao, Xiaofang
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 60 (01): : 141 - 162
  • [28] Development and application of dense multi-cluster fracturing in horizontal wells for low permeability and low pressure coal reservoir
    Cao Y.
    Shi B.
    Tian L.
    Li F.
    Cao Y.
    Dong S.
    Zhou D.
    [J]. Meitan Xuebao/Journal of the China Coal Society, 2020, 45 (10): : 3512 - 3521
  • [29] Improving application layer multicast forwarding performance by offloading multisend operations
    Cao, Jijun
    Su, Jinshu
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2015, 30 (03): : 199 - 210
  • [30] Performance modelling of adaptive routing communication networks in multi-cluster systems under bit-reversal traffic
    Sharifi, Hojjat
    Akbari, Mohammad K.
    Javadi, Bahman
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2014, 12 (04) : 442 - 465