An End-to-End Workflow to Efficiently Compress and Deploy DNN Classifiers on SoC/FPGA

被引:2
|
作者
Molina, Romina Soledad [1 ,2 ]
Morales, Ivan Rene [1 ,2 ]
Crespo, Maria Liz [1 ]
Costa, Veronica Gil [3 ]
Carrato, Sergio [4 ]
Ramponi, Giovanni
机构
[1] Abdus Salam Int Ctr Theoret Phys, STI Unit, Multidisciplinary Lab MLab, I-34151 Trieste, Italy
[2] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy
[3] Natl Univ San Luis, Dept Geol, San Luis, Argentina
[4] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy
关键词
Compression; deep neural networks; FPGA/SoC; machine learning (ML); workflow;
D O I
10.1109/LES.2023.3343030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) models have demonstrated discriminative and representative learning capabilities over a wide range of applications, even at the cost of high-computational complexity. Due to their parallel processing capabilities, reconfigurability, and low-power consumption, systems on chip based on a field programmable gate array (SoC/FPGA) have been used to face this challenge. Nevertheless, SoC/FPGA devices are resource-constrained, which implies the need for optimal use of technology for the computation and storage operations involved in ML-based inference. Consequently, mapping a deep neural network (DNN) architecture to a SoC/FPGA requires compression strategies to obtain a hardware design with a good compromise between effectiveness, memory footprint, and inference time. This letter presents an efficient end-to-end workflow for deploying DNNs on an SoC/FPGA by integrating hyperparameter tuning through Bayesian optimization (BO) with an ensemble of compression techniques.
引用
收藏
页码:255 / 258
页数:4
相关论文
共 50 条
  • [21] FlexCNN: An End-to-end Framework for Composing CNN Accelerators on FPGA
    Basalama, Suhail
    Sohrabizadeh, Atefeh
    Wang, Jie
    Guo, Licheng
    Cong, Jason
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (02)
  • [22] End-to-End Scalable FPGA Accelerator for Deep Residual Networks
    Ma, Yufei
    Kim, Minkyu
    Cao, Yu
    Vrudhula, Sarma
    Seo, Jae-sun
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 456 - 459
  • [23] An open-source, end-to-end workflow for multidimensional photoemission spectroscopy
    Xian, R. Patrick
    Acremann, Yves
    Agustsson, Steinn Y.
    Dendzik, Maciej
    Buhlmann, Kevin
    Curcio, Davide
    Kutnyakhov, Dmytro
    Pressacco, Federico
    Heber, Michael
    Dong, Shuo
    Pincelli, Tommaso
    Demsar, Jure
    Wurth, Wilfried
    Hofmann, Philip
    Wolf, Martin
    Scheidgen, Markus
    Rettig, Laurenz
    Ernstorfer, Ralph
    SCIENTIFIC DATA, 2020, 7 (01)
  • [24] An end-to-end workflow pipeline for large-scale Grid computing
    McGough A.S.
    Cohen J.
    Darlington J.
    Katsiri E.
    Lee W.
    Panagiotidi S.
    Patel Y.
    Journal of Grid Computing, 2005, 3 (3-4) : 259 - 281
  • [25] Transforming Workflow Models into Automated End-to-End Acceptance Test Cases
    Boucher, Mathieu
    Mussbacher, Gunter
    2017 IEEE/ACM 9TH INTERNATIONAL WORKSHOP ON MODELLING IN SOFTWARE ENGINEERING (MISE), 2017, : 68 - 74
  • [26] An open-source, end-to-end workflow for multidimensional photoemission spectroscopy
    R. Patrick Xian
    Yves Acremann
    Steinn Y. Agustsson
    Maciej Dendzik
    Kevin Bühlmann
    Davide Curcio
    Dmytro Kutnyakhov
    Federico Pressacco
    Michael Heber
    Shuo Dong
    Tommaso Pincelli
    Jure Demsar
    Wilfried Wurth
    Philip Hofmann
    Martin Wolf
    Markus Scheidgen
    Laurenz Rettig
    Ralph Ernstorfer
    Scientific Data, 7
  • [27] Quantitative evaluation of footwear evidence: Initial workflow for an end-to-end system
    Venkatasubramanian, Gautham
    Hegde, Vighnesh
    Lund, Steven P.
    Iyer, Hari
    Herman, Martin
    JOURNAL OF FORENSIC SCIENCES, 2021, 66 (06) : 2232 - 2251
  • [28] Fully Automated End-to-End Neuroimaging Workflow for Mental Health Screening
    Thomas, Nikita
    Perumalla, Akhila
    Rao, Srinivasa
    Thangaraj, Venkatesan
    Ravi, Keerthi Sravan
    Geethanath, Sairam
    Kim, Hansuk
    Srinivasan, Girish
    2020 IEEE 20TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE 2020), 2020, : 642 - 647
  • [29] End-to-End Joint Antenna Selection Strategy and Distributed Compress and Forward Strategy for Relay Channels
    Rahul Vaze
    Robert W. Heath
    EURASIP Journal on Wireless Communications and Networking, 2009
  • [30] End-to-End Joint Antenna Selection Strategy and Distributed Compress and Forward Strategy for Relay Channels
    Vaze, Rahul
    Heath, Robert W., Jr.
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2009,