ddRingAllreduce: a high-precision RingAllreduce algorithm

被引:2
|
作者
Lei, Xiaojun [1 ]
Gu, Tongxiang [2 ]
Xu, Xiaowen [2 ,3 ]
机构
[1] China Acad Engn Phys, Grad Sch, 6 Huayuan Rd, Beijing 100193, Peoples R China
[2] Inst Appl Phys & Computat Math, Lab Computat Phys, 6 Huayuan Rd, Beijing 100088, Peoples R China
[3] CAEP Software Ctr Numer Simulat, 6 Huayuan Rd, Beijing 100088, Peoples R China
基金
中国国家自然科学基金;
关键词
RingAllreduce; ddRingAllreduce; Collective communication; Double-double arithmetic; High precision; FLOATING-POINT SUMMATION; ACCURATE; FAITHFUL;
D O I
10.1007/s42514-023-00150-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
For complex problems in scientific computing, parallel computing is almost the only way to solve them, in which global reduction is one of the most frequently used operations. Due to the existence of floating-point rounding errors, the existing global reduction algorithm may result in inaccurate or different between two runs, which are difficult to meet the needs of complex applications. Since the communication cost of RingAllreduce is a constant, independent of the number of processes, it is an effective algorithm when a large amount of data needs to be communicated. However, it faces the same problem as the general global reduction operation, and it is necessary to develop a high-precision RingAllreduce algorithm. In this paper, by combining double-double arithmetic and RingAllreduce algorithm, we propose a high-precision RingAllreduce algorithm, called ddRingAllreduce algorithm. The theoretical error of the proposed algorithm is analyzed and the compact error bounds are derived. We have carried out a large number of parallel numerical experiments and obtained numerical results consistent with the theoretical analysis, and ddRingAllreduce is accurate in the case that RingAllreduce is inaccurate or miscalculated. At the same time, we also analyze the relationship between the problem size and the cost of using double-double arithmetic through experiments, at a small scale, the ddRingAllreduce algorithm can achieve higher accuracy with relatively less time overhead.
引用
收藏
页码:245 / 257
页数:13
相关论文
共 50 条
  • [1] ddRingAllreduce: a high-precision RingAllreduce algorithm
    Xiaojun Lei
    Tongxiang Gu
    Xiaowen Xu
    CCF Transactions on High Performance Computing, 2023, 5 : 245 - 257
  • [2] A high-precision algorithm for axisymmetric flow
    Gokhman, A
    Gokhman, D
    MATHEMATICAL PROBLEMS IN ENGINEERING, 1995, 1 (01) : 11 - 25
  • [3] Fastand High-Precision Rectangle Detection Algorithm
    Lu Lu
    Yang Jiong
    Liang Jie
    Jiang Yulin
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (18)
  • [4] ON THE PERCEPTRON LEARNING ALGORITHM ON DATA WITH HIGH-PRECISION
    SIU, KY
    DEMBO, A
    KAILATH, T
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1994, 48 (02) : 347 - 356
  • [5] Detection Algorithm of Remainder in High-Precision Spacecraft
    Liu H.
    Liu J.
    Wei C.
    1600, Science Press (48): : 716 - 724
  • [6] High-Precision Positioning Algorithm Based on GPS
    Li, Cui-Xia
    Liu, Wei-Ming
    Wu, Jian-Wei
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 5, 2010, : 364 - 368
  • [7] High-precision accounting for high-precision network services
    Clemm, Alexander
    Strassner, John
    2021 IEEE 22ND INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2021,
  • [8] High-Precision and Lightweight Facial Landmark Detection Algorithm
    Xu Lihuai
    Li Zhe
    Jiang Jiajia
    Duan Fajie
    Fu Xiao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (24)
  • [9] High-Precision ventilator with Compensation Algorithm and Automatic Regulation
    An Peng
    Song Jia-tao
    2010 2ND INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS PROCEEDINGS (DBTA), 2010,
  • [10] A High-Precision Collaboration Positioning Algorithm With Indoor Environment
    Zhao Junhui
    Jin, Xiaocen
    Yi, Gong
    2014 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2014, : 22 - 25