An End-to-End Workflow to Efficiently Compress and Deploy DNN Classifiers on SoC/FPGA

被引:2
|
作者
Molina, Romina Soledad [1 ,2 ]
Morales, Ivan Rene [1 ,2 ]
Crespo, Maria Liz [1 ]
Costa, Veronica Gil [3 ]
Carrato, Sergio [4 ]
Ramponi, Giovanni
机构
[1] Abdus Salam Int Ctr Theoret Phys, STI Unit, Multidisciplinary Lab MLab, I-34151 Trieste, Italy
[2] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy
[3] Natl Univ San Luis, Dept Geol, San Luis, Argentina
[4] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, Italy
关键词
Compression; deep neural networks; FPGA/SoC; machine learning (ML); workflow;
D O I
10.1109/LES.2023.3343030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) models have demonstrated discriminative and representative learning capabilities over a wide range of applications, even at the cost of high-computational complexity. Due to their parallel processing capabilities, reconfigurability, and low-power consumption, systems on chip based on a field programmable gate array (SoC/FPGA) have been used to face this challenge. Nevertheless, SoC/FPGA devices are resource-constrained, which implies the need for optimal use of technology for the computation and storage operations involved in ML-based inference. Consequently, mapping a deep neural network (DNN) architecture to a SoC/FPGA requires compression strategies to obtain a hardware design with a good compromise between effectiveness, memory footprint, and inference time. This letter presents an efficient end-to-end workflow for deploying DNNs on an SoC/FPGA by integrating hyperparameter tuning through Bayesian optimization (BO) with an ensemble of compression techniques.
引用
收藏
页码:255 / 258
页数:4
相关论文
共 50 条
  • [41] An End-to-End Tool Flow for FPGA-Accelerated Scientific Computing
    Stitt, Greg
    George, Alan
    Lam, Herman
    Smith, Melissa
    Aggarwal, Vikas
    Wang, Gongyu
    Coole, James
    Reardon, Casey
    Holland, Brian
    Koehler, Seth
    IEEE DESIGN & TEST OF COMPUTERS, 2011, 28 (04): : 68 - 77
  • [42] Design of Cryptographic model for End-to-End Encryption in FPGA based systems
    Chatterjee, Runa
    Chakraborty, Rajdeep
    Mandal, J. K.
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 459 - 465
  • [43] END-TO-END WORKFLOW FOR FINITE ELEMENT ANALYSIS OF TUMOR TREATING FIELDS IN GLIOBLASTOMAS
    Timmons, Joshua
    Lok, Edwin
    San, Pyay
    Bui, Kevin
    Wong, Eric
    NEURO-ONCOLOGY, 2017, 19 : 220 - 220
  • [44] Accelerating End-to-End Deep Learning Workflow With Codesign of Data Preprocessing and Scheduling
    Cheng, Yang
    Li, Dan
    Guo, Zhiyuan
    Jiang, Binyao
    Geng, Jinkun
    Bai, Wei
    Wu, Jianping
    Xiong, Yongqiang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (07) : 1802 - 1814
  • [45] Evaluating In-Workflow Messages for Improving Mental Models of End-to-End Encryption
    Akgul, Omer
    Bai, Wei
    Das, Shruti
    Mazurek, Michelle L.
    PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, 2021, : 447 - 464
  • [46] Seq2science: an end-to-end workflow for functional genomics analysis
    van der Sande, Maarten
    Frolich, Siebren
    Schafers, Tilman
    Smits, Jos G. A.
    Snabel, Rebecca R.
    Rinzema, Sybren
    van Heeringen, Simon J.
    PEERJ, 2023, 11
  • [47] SCALABLE AND AUTOMATED END-TO-END WORKFLOW FOR TUMOR INFILTRATING LYMPHOCYTE (TIL) MANUFACTURING
    Xu, F.
    Lin, W.
    Carmichael, S.
    Bulsara, S.
    Iannarelli, J.
    Eaker, S.
    CYTOTHERAPY, 2022, 24 (05) : S185 - S186
  • [48] End-to-end workflow for finite element analysis of tumor treating fields in glioblastomas
    Timmons, Joshua J.
    Lok, Edwin
    San, Pyay
    Bui, Kevin
    Wong, Eric T.
    PHYSICS IN MEDICINE AND BIOLOGY, 2017, 62 (21): : 8264 - 8282
  • [49] Conceptual Framework to Support Clinical Trial Optimization and End-to-End Enrollment Workflow
    Jain, Neha M.
    Culley, Alison
    Knoop, Teresa
    Micheel, Christine
    Osterman, Travis
    Levy, Mia
    JCO CLINICAL CANCER INFORMATICS, 2019, 3
  • [50] The Machine Learning Pod (MLPod™) Canvas: An End-To-End Methodology to Design, Build, and Deploy ML Applications
    Gerogiannis, Demetrios P.
    Sotiriadis, Paul P.
    Nikou, Christophoros
    FRONTIERS OF ARTIFICIAL INTELLIGENCE, ETHICS, AND MULTIDISCIPLINARY APPLICATIONS, FAIEMA 2023, 2024, : 51 - 58