An End-to-End Workflow to Efficiently Compress and Deploy DNN Classifiers on SoC/FPGA

被引：2

作者：

Molina, Romina Soledad ^{[1
,2
]}

Morales, Ivan Rene ^{[1
,2
]}

Crespo, Maria Liz ^{[1
]}

Costa, Veronica Gil ^{[3
]}

Carrato, Sergio ^{[4
]}

Ramponi, Giovanni

机构：

[1] Abdus Salam Int Ctr Theoret Phys, STI Unit, Multidisciplinary Lab MLab, I-34151 Trieste, Italy

[2] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy

[3] Natl Univ San Luis, Dept Geol, San Luis, Argentina

[4] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy

来源：

IEEE EMBEDDED SYSTEMS LETTERS | 2024年 / 16卷 / 03期

关键词：

Compression; deep neural networks; FPGA/SoC; machine learning (ML); workflow;

D O I：

10.1109/LES.2023.3343030

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Machine learning (ML) models have demonstrated discriminative and representative learning capabilities over a wide range of applications, even at the cost of high-computational complexity. Due to their parallel processing capabilities, reconfigurability, and low-power consumption, systems on chip based on a field programmable gate array (SoC/FPGA) have been used to face this challenge. Nevertheless, SoC/FPGA devices are resource-constrained, which implies the need for optimal use of technology for the computation and storage operations involved in ML-based inference. Consequently, mapping a deep neural network (DNN) architecture to a SoC/FPGA requires compression strategies to obtain a hardware design with a good compromise between effectiveness, memory footprint, and inference time. This letter presents an efficient end-to-end workflow for deploying DNNs on an SoC/FPGA by integrating hyperparameter tuning through Bayesian optimization (BO) with an ensemble of compression techniques.

引用

页码：255 / 258

页数：4

共 50 条

[31] Joint Training of Expanded End-to-end DNN for Text-dependent Speaker Verification
Heo, Hee-soo
Jung, Jee-weon
Yang, Il-ho
Yoon, Sung-hyun
Yu, Ha-jin
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1532 - 1536
[32] END-TO-END DNN BASED SPEAKER RECOGNITION INSPIRED BY I-VECTOR AND PLDA
Rohdin, Johan
Silnova, Anna
Diez, Mireia
Plchot, Oldrich
Matejka, Pavel
Burget, Lukas
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4874 - 4878
[33] SOMATIC-CELLS EFFICIENTLY JOIN UNRELATED DNA SEGMENTS END-TO-END
WILSON, JH
BERGET, PB
PIPAS, JM
MOLECULAR AND CELLULAR BIOLOGY, 1982, 2 (10) : 1258 - 1269
[34] DIANA: An End-to-End Hybrid DIgital and ANAlog Neural Network SoC for the Edge
Houshmand P.
Sarda G.M.
Jain V.
Ueyoshi K.
Papistas I.A.
Shi M.
Zheng Q.
Bhattacharjee D.
Mallik A.
Debacker P.
Verkest D.
Verhelst M.
IEEE Journal of Solid-State Circuits, 2023, 58 (01) : 203 - 215
[35] MCTE: MARRYING CONVOLUTION AND TRANSFORMER EFFICIENTLY FOR END-TO-END MEDICAL IMAGE SEGMENTATION
Li, Jiuqiang
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1100 - 1104
[36] A High-Performance Neural Network SoC for End-to-End Speaker Verification
Tsai, Tsung-Han
Chiang, Meng-Jui
IEEE ACCESS, 2024, 12 : 165482 - 165496
[37] A Regression-based Model for End-to-End Latency Prediction for DNN Execution on GPUs
Li, Ying
Sun, Yifan
Jog, Adwait
2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 343 - 345
[38] On-FPGA Spiking Neural Networks for End-to-End Neural Decoding
Leone, Gianluca
Raffo, Luigi
Meloni, Paolo
IEEE ACCESS, 2023, 11 : 41387 - 41399
[39] FlexiGAN: An End-to-End Solution for FPGA Acceleration of Generative Adversarial Networks
Yazdanbakhsh, Amir
Brzozowski, Michael
Khaleghi, Behnam
Ghodrati, Soroush
Samadi, Kambiz
Kim, Nam Sung
Esmaeilzadeh, Hadi
PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 65 - 72
[40] End-to-End Rapid FPGA Prototyping for Embedded Proactive BMI Control
Huang, Nan-Sheng
Braun, Jan-Matthias
Carmo, Ricardo Rodrigues do
Larsen, Jorgen Christian
Manoonpong, Poramate
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,

← 1 2 3 4 5 →