Increasing Accuracy of Iterative Refinement in Limited Floating-Point Arithmetic on Half-Precision Accelerators

被引：0

作者：

Luszczek, Piotr ^{[1
]}

Yamazaki, Ichitaro ^{[2
]}

Dongarra, Jack ^{[3
,4
]}

机构：

[1] Univ Tennessee, Knoxville, TN 37996 USA

[2] Sandia Natl Labs, Livermore, CA 94550 USA

[3] Univ Tennessee, Oak Ridge Natl Lab, Knoxville, TN 37996 USA

[4] Univ Manchester, Manchester, Lancs, England

来源：

2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC) | 2019年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The emergence of deep learning as a leading computational workload for machine learning tasks on large-scale cloud infrastructure installations has led to plethora of accelerator hardware releases. However, the reduced precision and range of the floating-point numbers on these new platforms makes it a non-trivial task to leverage these unprecedented advances in computational power for numerical linear algebra operations that come with a guarantee of robust error bounds. In order to address these concerns, we present a number of strategies that can be used to increase the accuracy of limited-precision iterative refinement. By limited precision, we mean 16-bit floating-point formats implemented in modern hardware accelerators and are not necessarily compliant with the IEEE half-precision specification. We include the explanation of a broader context and connections to established IEEE floating-point standards and existing high-performance computing (HPC) benchmarks. We also present a new formulation of LU factorization that we call signed square root LU which produces more numerically balanced L and U factors which directly address the problems of limited range of the low-precision storage formats. The experimental results indicate that it is possible to recover substantial amounts of the accuracy in the system solution that would otherwise be lost. Previously, this could only be achieved by using iterative refinement based on single-precision floating-point arithmetic. The discussion will also explore the numerical stability issues that are important for robust linear solvers on these new hardware platforms.

引用

页数：6

共 50 条

[1] Towards Numerical Benchmark for Half-Precision Floating Point Arithmetic
Luszczek, Piotr
Kurzak, Jakub
Yamazaki, Ichitaro
Dongarra, Jack
[J]. 2017 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2017,
[2] Half-Precision Floating-Point Formats for PageRank: Opportunities and Challenges
Molahosseini, Amir Sabbagh
Vandierendonck, Hans
[J]. 2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
[3] Seismic modeling and inversion using half-precision floating-point numbers
Fabien-Ouellet, Gabriel
[J]. GEOPHYSICS, 2020, 85 (03) : F64 - F75
[4] ARBITRARY PRECISION FLOATING-POINT ARITHMETIC
MOTTELER, FC
[J]. DR DOBBS JOURNAL, 1993, 18 (09): : 28 - &
[5] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
Jean Jenifer Nesam, J.
Sivanantham, S.
[J]. International Journal of Materials and Product Technology, 2020, 60 (01) : 58 - 72
[6] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
Nesam, J. Jean Jenifer
Sivanantham, S.
[J]. INTERNATIONAL JOURNAL OF MATERIALS & PRODUCT TECHNOLOGY, 2020, 60 (01): : 58 - 72
[7] A Study on Convolution using Half-Precision Floating-Point Numbers on GPU for Radio Astronomy Deconvolution
Seznec, Mickael
Gac, Nicolas
Ferrari, Andre
Orieux, Francois
[J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 170 - 175
[8] Double precision floating-point arithmetic on FPGAs
Paschalakis, S
Lee, P
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), PROCEEDINGS, 2003, : 352 - 358
[9] SIMULATING LOW PRECISION FLOATING-POINT ARITHMETIC
Higham, Nicholas J.
Pranesh, Srikara
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2019, 41 (05): : C585 - C602
[10] The Design of Fast and Energy-Efficient Linear Solvers: On the Potential of Half-Precision Arithmetic and Iterative Refinement Techniques
Haidar, Azzam
Abdelfattah, Ahmad
Zounon, Mawussi
Wu, Panruo
Pranesh, Srikara
Tomov, Stanimire
Dongarra, Jack
[J]. COMPUTATIONAL SCIENCE - ICCS 2018, PT I, 2018, 10860 : 586 - 600

← 1 2 3 4 5 →