Accurate Floating-point Operation using Controlled Floating-point Precision

被引:0
|
作者
Zaki, Ahmad M. [1 ]
Bahaa-Eldin, Ayman M. [1 ]
El-Shafey, Mohamed H. [1 ]
Aly, Gamal M. [1 ]
机构
[1] Ain Shams Univ, Dept Comp & Syst Engn, Cairo, Egypt
关键词
dot-Product; floating-point; Hilbert matrix; accurate multiplication; accurate sum; ill-conditioned matrix; machine-epsilon; relative error;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rounding and accumulation of errors when using floating point numbers are important factors in computer arithmetic. Many applications suffer from these problems. The underlying machine architecture and representation of floating point numbers play the major role in the level and value of errors in this type of calculations. A quantitative measure of a system error level is the machine epsilon. In the current representation of floating point numbers, the machine epsilon can be as small as 9.63E-35 in the 128 bit version of IEEE standard floating point representation system. In this work a novel solution that guarantees achieving the desired minimum error regardless of the machine architecture is presented. The proposed model can archive a machine epsilon of about 4.94E-324. A new representation model is given and a complete arithmetic system with basic operations is presented. The accuracy of the proposed method is verified by inverting a high order, Hilbert matrix, an ill-conditioned matrix that cannot be solved in the traditional floating point standard. Finally some comparisons are given.
引用
收藏
页码:696 / 701
页数:6
相关论文
共 50 条
  • [21] Accurate and Reliable Computing in Floating-Point Arithmetic
    Rump, Siegfried M.
    MATHEMATICAL SOFTWARE - ICMS 2010, 2010, 6327 : 105 - 108
  • [22] Precimonious: Tuning Assistant for Floating-Point Precision
    Rubio-Gonzalez, Cindy
    Cuong Nguyen
    Hong Diep Nguyen
    Demmel, James
    Kahan, William
    Sen, Koushik
    Bailey, David H.
    Iancu, Costin
    Hough, David
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [23] Accurate Complex Multiplication in Floating-Point Arithmetic
    Lefevre, Vincent
    Muller, Jean-Michel
    2019 IEEE 26TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2019, : 23 - 29
  • [24] Accurate floating-point summation: a new approach
    Eisinberg, A.
    Fedele, G.
    APPLIED MATHEMATICS AND COMPUTATION, 2007, 189 (01) : 410 - 424
  • [25] Double precision floating-point arithmetic on FPGAs
    Paschalakis, S
    Lee, P
    2003 IEEE INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), PROCEEDINGS, 2003, : 352 - 358
  • [26] Static analyses of the precision of floating-point operations
    Goubault, E
    STATIC ANALYSIS, PROCEEDINGS, 2001, 2126 : 234 - 259
  • [27] A dual precision IEEE floating-point multiplier
    Even, G
    Mueller, SM
    Seidel, PM
    INTEGRATION-THE VLSI JOURNAL, 2000, 29 (02) : 167 - 180
  • [28] FLOATING-POINT TECHNIQUE FOR EXTENDING AVAILABLE PRECISION
    DEKKER, TJ
    NUMERISCHE MATHEMATIK, 1971, 18 (03) : 224 - +
  • [29] SIMULATING LOW PRECISION FLOATING-POINT ARITHMETIC
    Higham, Nicholas J.
    Pranesh, Srikara
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2019, 41 (05): : C585 - C602
  • [30] MULTIPLE PRECISION FLOATING-POINT COMPUTATION IN FORTRAN
    VERMA, SB
    SHARAN, M
    SOFTWARE-PRACTICE & EXPERIENCE, 1980, 10 (03): : 163 - 173