MACcelerator: Approximate Arithmetic Unit for Computational Acceleration

被引:1
|
作者
Sokolova, Alice [1 ,3 ]
Imani, Mohsen [2 ]
Huang, Andrew [1 ]
Garcia, Ricardo [1 ]
Morris, Justin [1 ,3 ]
Rosing, Tajana [1 ]
Aksanli, Baris [3 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92697 USA
[3] San Diego State Univ, Dept Elect & Comp Engn, San Diego, CA 92182 USA
基金
美国国家科学基金会;
关键词
Approximate computing; Energy Efficiency; Multiply-accumulator; Machine learning acceleration;
D O I
10.1109/ISQED51717.2021.9424293
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As computationally expensive applications such as neural networks gain popularity, approximate computing has emerged as a solution for significantly reducing the energy and latency costs of extensive computational workloads. In this paper, we propose a highly accurate approximate floating point Multiply-and-Accumulate (MAC) unit for GPUs which significantly decreases power and delay costs of a MAC operation. We propose an intelligent input analysis scheme to approximate the addition stage of a MAC operation and an efficient Approximate Multiplier to simplify the multiplication stage. Our design has tunable accuracy, offering the flexibility of exchanging accuracy for increased efficiency. We evaluated our proposed design over a range of multimedia and machine learning applications. Our design offers up to 2.18 x and 3.21 x Energy-Delay Product improvement for machine learning and multimedia applications respectively while providing comparable quality to an exact GPU.
引用
收藏
页码:444 / 449
页数:6
相关论文
共 50 条