An End-to-End Workflow to Efficiently Compress and Deploy DNN Classifiers on SoC/FPGA

被引:2
|
作者
Molina, Romina Soledad [1 ,2 ]
Morales, Ivan Rene [1 ,2 ]
Crespo, Maria Liz [1 ]
Costa, Veronica Gil [3 ]
Carrato, Sergio [4 ]
Ramponi, Giovanni
机构
[1] Abdus Salam Int Ctr Theoret Phys, STI Unit, Multidisciplinary Lab MLab, I-34151 Trieste, Italy
[2] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy
[3] Natl Univ San Luis, Dept Geol, San Luis, Argentina
[4] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy
关键词
Compression; deep neural networks; FPGA/SoC; machine learning (ML); workflow;
D O I
10.1109/LES.2023.3343030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) models have demonstrated discriminative and representative learning capabilities over a wide range of applications, even at the cost of high-computational complexity. Due to their parallel processing capabilities, reconfigurability, and low-power consumption, systems on chip based on a field programmable gate array (SoC/FPGA) have been used to face this challenge. Nevertheless, SoC/FPGA devices are resource-constrained, which implies the need for optimal use of technology for the computation and storage operations involved in ML-based inference. Consequently, mapping a deep neural network (DNN) architecture to a SoC/FPGA requires compression strategies to obtain a hardware design with a good compromise between effectiveness, memory footprint, and inference time. This letter presents an efficient end-to-end workflow for deploying DNNs on an SoC/FPGA by integrating hyperparameter tuning through Bayesian optimization (BO) with an ensemble of compression techniques.
引用
收藏
页码:255 / 258
页数:4
相关论文
共 50 条
  • [31] Joint Training of Expanded End-to-end DNN for Text-dependent Speaker Verification
    Heo, Hee-soo
    Jung, Jee-weon
    Yang, Il-ho
    Yoon, Sung-hyun
    Yu, Ha-jin
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1532 - 1536
  • [32] END-TO-END DNN BASED SPEAKER RECOGNITION INSPIRED BY I-VECTOR AND PLDA
    Rohdin, Johan
    Silnova, Anna
    Diez, Mireia
    Plchot, Oldrich
    Matejka, Pavel
    Burget, Lukas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4874 - 4878
  • [33] SOMATIC-CELLS EFFICIENTLY JOIN UNRELATED DNA SEGMENTS END-TO-END
    WILSON, JH
    BERGET, PB
    PIPAS, JM
    MOLECULAR AND CELLULAR BIOLOGY, 1982, 2 (10) : 1258 - 1269
  • [34] DIANA: An End-to-End Hybrid DIgital and ANAlog Neural Network SoC for the Edge
    Houshmand P.
    Sarda G.M.
    Jain V.
    Ueyoshi K.
    Papistas I.A.
    Shi M.
    Zheng Q.
    Bhattacharjee D.
    Mallik A.
    Debacker P.
    Verkest D.
    Verhelst M.
    IEEE Journal of Solid-State Circuits, 2023, 58 (01) : 203 - 215
  • [35] MCTE: MARRYING CONVOLUTION AND TRANSFORMER EFFICIENTLY FOR END-TO-END MEDICAL IMAGE SEGMENTATION
    Li, Jiuqiang
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1100 - 1104
  • [36] A High-Performance Neural Network SoC for End-to-End Speaker Verification
    Tsai, Tsung-Han
    Chiang, Meng-Jui
    IEEE ACCESS, 2024, 12 : 165482 - 165496
  • [37] A Regression-based Model for End-to-End Latency Prediction for DNN Execution on GPUs
    Li, Ying
    Sun, Yifan
    Jog, Adwait
    2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 343 - 345
  • [38] On-FPGA Spiking Neural Networks for End-to-End Neural Decoding
    Leone, Gianluca
    Raffo, Luigi
    Meloni, Paolo
    IEEE ACCESS, 2023, 11 : 41387 - 41399
  • [39] FlexiGAN: An End-to-End Solution for FPGA Acceleration of Generative Adversarial Networks
    Yazdanbakhsh, Amir
    Brzozowski, Michael
    Khaleghi, Behnam
    Ghodrati, Soroush
    Samadi, Kambiz
    Kim, Nam Sung
    Esmaeilzadeh, Hadi
    PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 65 - 72
  • [40] End-to-End Rapid FPGA Prototyping for Embedded Proactive BMI Control
    Huang, Nan-Sheng
    Braun, Jan-Matthias
    Carmo, Ricardo Rodrigues do
    Larsen, Jorgen Christian
    Manoonpong, Poramate
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,