An FPU design template to optimize the accuracy-efficiency-area trade-off

被引：10

作者：

Zoni, Davide ^{[1
]}

Galimberti, Andrea ^{[1
]}

Fornaciari, William ^{[1
]}

机构：

[1] DEIB Politecn Milano, I-20133 Milan, Italy

来源：

SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS | 2021年 / 29卷

基金：

欧盟地平线“2020”;

关键词：

Floating Point Units (FPU); Accuracy-Cost-energy tradeoff; Run-time optimization; FPGA-ORIENTED DESIGN; POWER;

D O I：

10.1016/j.suscom.2020.100450

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Modern embedded systems are in charge of an increasing number of tasks that extensively employ floating-point (FP) computations. The ever-increasing efficiency requirement, coupled with the additional computational effort to perform FP computations, motivates several microarchitectural optimizations of the FPU. This manuscript presents a novel modular FPU microarchitecture, which targets modern embedded systems and considers heterogeneous workloads including both best-effort and accuracy-sensitive applications. The design optimizes the EDP-accuracy-area figure of merit by allowing, at design-time, to independently configure the precision of each FP operation, while the FP dynamic range is kept common to the entire FPU to deliver a simpler micro architecture. To ensure the correct execution of accuracy-sensitive applications, a novel compiler pass allows to substitute each FP operation for which a low-precision hardware support is offered with the corresponding soft float function call. The assessment considers seven FPU variants encompassing three different state-of-the-art designs. The results on several representative use cases show that the binary32 FPU implementation offers an EDP gain of 15%, while, in case the FPU implements a mix of binary32 and bfloat16 operations, the EDP gain is 19%, the reduction in the resource utilization is 21% and the average accuracy loss is less than 2.5%. Moreover, the resource utilization of our FPU variants is aligned with the one of the FPU employing state-of-the-art, highly specialized FP hardware accelerators. Starting from the assessment, a set of guidelines is drawn to steer the design of the FP hardware support in modern embedded systems.

引用

下载

页数：10

共 50 条

[41] Chunking as the result of an efficiency computation trade-off
Ramkumar, Pavan
Acuna, Daniel E.
Berniker, Max
Grafton, Scott T.
Turner, Robert S.
Kording, Konrad P.
NATURE COMMUNICATIONS, 2016, 7
[42] EQUITY VERSUS EFFICIENCY - THE ELUSIVE TRADE-OFF
LEGRAND, J
ETHICS, 1990, 100 (03) : 554 - 568
[43] Energy efficiency vs. programmability trade-off: Architectures and design principles
Robelly, J. P.
Seidel, H.
Chen, K. C.
Fettweis, G.
2006 DESIGN AUTOMATION AND TEST IN EUROPE, VOLS 1-3, PROCEEDINGS, 2006, : 585 - +
[44] Unifying Speed-Accuracy Trade-Off and Cost-Benefit Trade-Off in Human Reaching Movements
Peternel, Luka
Sigaud, Olivier
Babic, Jan
FRONTIERS IN HUMAN NEUROSCIENCE, 2017, 11
[45] Energy Efficiency Trade-off with Spectral Efficiency in MIMO Systems
Asif, Rao Muhammad
Shakir, Mustafa
Nebhen, Jamel
Rehman, Ateeq Ur
Shafiq, Muhammad
Choi, Jin-Ghoo
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03): : 5889 - 5905
[46] Neural dynamics of the speed-accuracy trade-off
Dominic Standage
Da-Hui Wang
Gunnar Blohm
BMC Neuroscience, 15 (Suppl 1)
[47] Trade-off between the sampling rate and the data accuracy
Zhang, Chun
Liu, Xue
2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 2631 - +
[48] On the neural implementation of the speed-accuracy trade-off
Standage, Dominic
Blohm, Gunnar
Dorris, Michael C.
FRONTIERS IN NEUROSCIENCE, 2014, 8
[49] Accuracy versus Incentives A Trade-Off for Performance Measurement
Schwartz, Aaron L.
AMERICAN JOURNAL OF HEALTH ECONOMICS, 2021, 7 (03) : 333 - 360
[50] Exploring the Accuracy - Energy Trade-off in Machine Learning
Brownlee, Alexander E., I
Adair, Jason
Haraldsson, Saemundur O.
Jabbo, John
2021 IEEE/ACM INTERNATIONAL WORKSHOP ON GENETIC IMPROVEMENT (GI 2021), 2021, : 11 - 18

← 1 2 3 4 5 →