PAARD: Proximity-Aware All-Reduce Communication for Dragonfly Networks

被引:2
|
作者
Ma, Junchao [1 ]
Dong, Dezun [1 ]
Li, Cunlu [1 ]
Wu, Ke [1 ]
Xiao, Liquan [1 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
基金
国家重点研发计划;
关键词
All-reduce operation; Dragonfly topology; Collective communication; OPTIMIZATION;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00045
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The all-reduce operation is one of the most commonly used collective communication operations, which is widely used in the research and engineering fields of high-performance computing(HPC) and distributed machine learning(DML). Previous optimization work for all-reduce operation is to design new algorithms only for different message size and different number of processors, and ignores the optimization that can be achieved by considering the topology. Dragonfly is a popular topology for current and future high-speed interconnection network. The hierarchical characteristics of dragonfly topology can be utilized to effectively reduce the hardware overhead while ensuring low end-to-end transmission latency. In this paper we propose PAARD, Proximity-Aware All-Reduce Communication on Dragonfly Networks. According to the characteristics of dragonfly topology, PAARD proposes an end-to-end solution to alleviate the congestion which could remarkably boost the performance. We carefully design the algorithm of PAARD to ensure desirable performance with acceptable overhead. To illustrate the effectiveness of PAARD, we analyze the performance of PAARD with the state-of-the-art algorithm, Halving-doubling(HD) algorithm and Ring algorithm. The simulation results demonstrate that in our design the completion time can be reduced up by 75.73 % for HD algorithm and 98.63% for Ring algorithm.
引用
收藏
页码:255 / 262
页数:8
相关论文
共 10 条
  • [1] Proximity-Aware Balanced Allocations in Cache Networks
    Pourmiri, Ali
    Siavoshani, Mahdi Jafari
    Shariatpanahi, Seyed Pooya
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 1068 - 1077
  • [2] An all-reduce operation in star networks using all-to-all broadcast communication pattern
    Oh, E
    Choi, H
    Primeaux, D
    COMPUTATIONAL SCIENCE - ICCS 2005, PT 1, PROCEEDINGS, 2005, 3514 : 419 - 426
  • [3] Proximity-aware offloading of person-to-person communications in LTE networks
    Quadri, Christian
    Gaito, Sabrina
    Rossi, Gian Paolo
    2016 13TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2016,
  • [4] Permutation-Equivariant and Proximity-Aware Graph Neural Networks With Stochastic Message Passing
    Zhang, Ziwei
    Niu, Chenhao
    Cui, Peng
    Pei, Jian
    Zhang, Bo
    Zhu, Wenwu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 6182 - 6193
  • [5] Proximity-aware research leadership recommendation in research collaboration via deep neural networks
    He, Chaocheng
    Wu, Jiang
    Zhang, Qingpeng
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2022, 73 (01) : 70 - 89
  • [6] A Method for Designing Proximity-Aware Regular Graph-Based Structured Overlay Networks
    Shiraishi, Youki
    Manada, Akiko
    Taenaka, Yuzo
    Kadobayashi, Youki
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 430 - 435
  • [7] Probabilistic Proximity-aware Resource Location in Peer-to-Peer Networks Using Resource Replication
    Analoui, M.
    Sharifi, M.
    Rezvani, M. H.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2010, 5 (04) : 418 - 431
  • [8] Distributed proximity-aware peer Clustering in BitTorrent-like peer-to-peer networks
    Xiao, Bin
    Yu, Jiadi
    Shao, Zili
    Li, Minglu
    EMBEDDED AND UBIQUITOUS COMPUTING, PROCEEDINGS, 2006, 4096 : 375 - 384
  • [9] A Proximity-Aware Technique for Distributing Replicas in DHT-Based P2P Networks
    Li Wen-xiang
    Du Zhao-jun
    Sheng Zhi-chao
    Zhu Yan-li
    Hu Tao
    2009 INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2009), VOLUMES 1 AND 2, 2009, : 670 - 673
  • [10] CBT: A proximity-aware peer clustering system in large-scale BitTorrent-like peer-to-peer networks
    Yu, Jiadi
    Li, Minglu
    COMPUTER COMMUNICATIONS, 2008, 31 (03) : 591 - 602