Accurate Floating-point Operation using Controlled Floating-point Precision

被引:0
|
作者
Zaki, Ahmad M. [1 ]
Bahaa-Eldin, Ayman M. [1 ]
El-Shafey, Mohamed H. [1 ]
Aly, Gamal M. [1 ]
机构
[1] Ain Shams Univ, Dept Comp & Syst Engn, Cairo, Egypt
关键词
dot-Product; floating-point; Hilbert matrix; accurate multiplication; accurate sum; ill-conditioned matrix; machine-epsilon; relative error;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rounding and accumulation of errors when using floating point numbers are important factors in computer arithmetic. Many applications suffer from these problems. The underlying machine architecture and representation of floating point numbers play the major role in the level and value of errors in this type of calculations. A quantitative measure of a system error level is the machine epsilon. In the current representation of floating point numbers, the machine epsilon can be as small as 9.63E-35 in the 128 bit version of IEEE standard floating point representation system. In this work a novel solution that guarantees achieving the desired minimum error regardless of the machine architecture is presented. The proposed model can archive a machine epsilon of about 4.94E-324. A new representation model is given and a complete arithmetic system with basic operations is presented. The accuracy of the proposed method is verified by inverting a high order, Hilbert matrix, an ill-conditioned matrix that cannot be solved in the traditional floating point standard. Finally some comparisons are given.
引用
收藏
页码:696 / 701
页数:6
相关论文
共 50 条
  • [1] ACCURATE FLOATING-POINT SUMMATION
    MALCOLM, MA
    COMMUNICATIONS OF THE ACM, 1971, 14 (11) : 731 - &
  • [2] ACCURATE FLOATING-POINT SUMMATION
    LINZ, P
    COMMUNICATIONS OF THE ACM, 1970, 13 (06) : 361 - &
  • [3] Practically Accurate Floating-Point Math
    Toronto, Neil
    McCarthy, Jay
    COMPUTING IN SCIENCE & ENGINEERING, 2014, 16 (04) : 80 - +
  • [4] ARBITRARY PRECISION FLOATING-POINT ARITHMETIC
    MOTTELER, FC
    DR DOBBS JOURNAL, 1993, 18 (09): : 28 - &
  • [5] Accurate Parallel Floating-Point Accumulation
    Kadric, Edin
    Gurniak, Paul
    DeHon, Andre
    2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2013, : 153 - 162
  • [6] Accurate Floating-Point Product and Exponentiation
    Graillat, Stef
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (07) : 994 - 1000
  • [7] Synthesizing Accurate Floating-Point Formulas
    Ioualalen, Arnault
    Martel, Matthieu
    PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 113 - 116
  • [8] Accurate Parallel Floating-Point Accumulation
    Kadric, Edin
    Gurniak, Paul
    DeHon, Andre
    IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (11) : 3224 - 3238
  • [9] Sinking Point: Dynamic Precision Tracking for Floating-Point
    Zorn, Bill
    Grossman, Dan
    Tatlock, Zach
    CONFERENCE FOR NEXT GENERATION ARITHMETIC 2019 (CONGA), 2019,
  • [10] An IEEE 754 Double-Precision Floating-Point Multiplier for Denormalized and Normalized Floating-Point Numbers
    Thompson, Ross
    Stine, James E.
    PROCEEDINGS OF THE ASAP2015 2015 IEEE 26TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2015, : 62 - 63