Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators

被引:0
|
作者
Luszczek, Piotr [1 ]
Yamazaki, Ichitaro [2 ]
Dongarra, Jack [3 ,4 ]
机构
[1] Univ Tennessee, Knoxville, TN 37996 USA
[2] Sandia Natl Labs, Livermore, CA 94550 USA
[3] Univ Tennessee, Oak Ridge Natl Lab, Knoxville, TN 37996 USA
[4] Univ Manchester, Manchester, Lancs, England
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of deep learning as a leading computational workload for machine learning tasks on large-scale cloud infrastructure installations has led to plethora of accelerator hardware releases. However, the reduced precision and range of the floating-point numbers on these new platforms makes it a non-trivial task to leverage these unprecedented advances in computational power for numerical linear algebra operations that come with a guarantee of robust error bounds. In order to address these concerns, we present a number of strategies that can be used to increase the accuracy of limited-precision iterative refinement. By limited precision, we mean 16-bit floating-point formats implemented in modern hardware accelerators and are not necessarily compliant with the IEEE half-precision specification. We include the explanation of a broader context and connections to established IEEE floating-point standards and existing high-performance computing (HPC) benchmarks. We also present a new formulation of LU factorization that we call signed square root LU which produces more numerically balanced L and U factors which directly address the problems of limited range of the low-precision storage formats. The experimental results indicate that it is possible to recover substantial amounts of the accuracy in the system solution that would otherwise be lost. Previously, this could only be achieved by using iterative refinement based on single-precision floating-point arithmetic. The discussion will also explore the numerical stability issues that are important for robust linear solvers on these new hardware platforms.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Towards Numerical Benchmark for Half-Precision Floating Point Arithmetic
    Luszczek, Piotr
    Kurzak, Jakub
    Yamazaki, Ichitaro
    Dongarra, Jack
    [J]. 2017 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2017,
  • [2] Half-Precision Floating-Point Formats for PageRank: Opportunities and Challenges
    Molahosseini, Amir Sabbagh
    Vandierendonck, Hans
    [J]. 2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [3] Seismic modeling and inversion using half-precision floating-point numbers
    Fabien-Ouellet, Gabriel
    [J]. GEOPHYSICS, 2020, 85 (03) : F64 - F75
  • [4] ARBITRARY PRECISION FLOATING-POINT ARITHMETIC
    MOTTELER, FC
    [J]. DR DOBBS JOURNAL, 1993, 18 (09): : 28 - &
  • [5] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
    Jean Jenifer Nesam, J.
    Sivanantham, S.
    [J]. International Journal of Materials and Product Technology, 2020, 60 (01) : 58 - 72
  • [6] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
    Nesam, J. Jean Jenifer
    Sivanantham, S.
    [J]. INTERNATIONAL JOURNAL OF MATERIALS & PRODUCT TECHNOLOGY, 2020, 60 (01): : 58 - 72
  • [7] A Study on Convolution using Half-Precision Floating-Point Numbers on GPU for Radio Astronomy Deconvolution
    Seznec, Mickael
    Gac, Nicolas
    Ferrari, Andre
    Orieux, Francois
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 170 - 175
  • [8] Double precision floating-point arithmetic on FPGAs
    Paschalakis, S
    Lee, P
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), PROCEEDINGS, 2003, : 352 - 358
  • [9] SIMULATING LOW PRECISION FLOATING-POINT ARITHMETIC
    Higham, Nicholas J.
    Pranesh, Srikara
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2019, 41 (05): : C585 - C602
  • [10] The Design of Fast and Energy-Efficient Linear Solvers: On the Potential of Half-Precision Arithmetic and Iterative Refinement Techniques
    Haidar, Azzam
    Abdelfattah, Ahmad
    Zounon, Mawussi
    Wu, Panruo
    Pranesh, Srikara
    Tomov, Stanimire
    Dongarra, Jack
    [J]. COMPUTATIONAL SCIENCE - ICCS 2018, PT I, 2018, 10860 : 586 - 600