Exploring FPGA designs for MX and beyond

被引：0

作者：

Samson, Ebby ^{[1
]}

Mellempudi, Naveen ^{[2
]}

Luk, Wayne ^{[1
]}

Constantinides, George A. ^{[1
]}

机构：

[1] Imperial Coll London, London, England

[2] AMD, Austin, TX USA

来源：

2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024 | 2024年

关键词：

MX; FPGA; Brevitas; quantization; scale;

D O I：

10.1109/FPL64840.2024.00049

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A number of companies recently worked together to release the new Open Compute Project MX standard for low-precision computation, aimed at efficient neural network implementation. In this paper, we describe and evaluate the first open-source FPGA implementation of the arithmetic defined in the standard. Our designs fully support all the standard's concrete formats for conversion into and out of MX formats and for the standard-defined arithmetic operations, as well as arbitrary fixed-point and floating-point formats. Certain elements of the standard are left as implementation-defined, and we present the first concrete FPGA-inspired choices for these elements, which we outline in the paper. Our library of optimized hardware components is available open source, and can be used to build larger systems. For this purpose, we also describe and release an open-source Pytorch library for quantization into the new standard, integrated with the Brevitas library so that the community can develop novel neural network designs quantized with MX formats in mind. We demonstrate the usability and efficacy of our libraries via the implementation of example neural networks such as ResNet-18 on the ImageNet ILSVRC12 dataset. Our testing shows that MX is very effective for formats such as INT5 or FP6 which are not natively supported on GPUs. This gives FPGAs an advantage as they have the flexibility to implement a custom datapath and take advantage of the smaller area footprints offered by these formats.

引用

页码：304 / 310

页数：7

共 50 条

[21] Exploring the anatomy of multiprocessor designs
Cravotta, R
EDN, 2002, 47 (24) : 49 - +
[22] Exploring Thread Coarsening on FPGA
Zarch, Mostafa Eghbali
Neff, Reece
Becchi, Michela
2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 436 - 441
[23] Exploring multilayered collaboration designs
Nancy Law
Sanna Järvelä
Carolyn Rosé
International Journal of Computer-Supported Collaborative Learning, 2021, 16 : 1 - 5
[24] Reliability and Safety Issues of FPGA Based Designs
Kvas, M.
Valach, S.
Fiedler, P.
11TH IFAC/IEEE INTERNATIONAL CONFERENCE ON PROGRAMMABLE DEVICES AND EMBEDDED SYSTEMS (PDES 2012), 2012,
[25] RAPID MODULAR ASSEMBLY OF XILINX FPGA DESIGNS
Love, Andrew
Athanas, Peter
2013 23RD INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2013) PROCEEDINGS, 2013,
[26] Automated Masking of FPGA-Mapped Designs
Mueller, Nicolai
Meschkov, Sergej
Gnad, Dennis R. E.
Tahoori, Mehdi B.
Moradi, Amir
2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 79 - 85
[27] REGISTER-RICH FPGA FOR PIPELINED DESIGNS
EGAN, BT
COMPUTER DESIGN, 1992, 31 (02): : 121 - 121
[28] FPGA-Based Designs of the Factorial Function
Chabini, Noureddine
Beguenane, Rachid
2022 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2022, : 16 - 20
[29] The A|RT of optimising designs for minimum FPGA size
Neale, R
ELECTRONIC ENGINEERING, 2000, 72 (879): : 83 - 84
[30] A Comparison of FinFET based FPGA LUT Designs
Abusultan, Monther
Khatri, Sunil P.
GLSVLSI'14: PROCEEDINGS OF THE 2014 GREAT LAKES SYMPOSIUM ON VLSI, 2014, : 353 - 358

← 1 2 3 4 5 →