A Design Space Exploration Framework for Deployment of Resource-Constrained Deep Neural Networks

被引:0
|
作者
Zhang, Yan [1 ]
Pan, Lei [1 ]
Berkowitz, Phillip [2 ]
Lee, Mun Wai [2 ]
Riggan, Benjamin [3 ]
Bhattacharyya, Shuvra S. [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Intelligent Automat, Rockville, MD 20855 USA
[3] Univ Nebraska, Lincoln, NE 68588 USA
关键词
Design space exploration; Deep Neural Networks; Dataflow Modeling; Resource-constrained deployment; PARTICLE SWARM OPTIMIZATION;
D O I
10.1117/12.3014043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed great progress in the development of deep neural networks (DNNs), which has led to growing interest in deploying DNNs in resource-constrained environments such as network-edge and edge-cloud environments. To address objectives of efficient DNN inference, numerous approaches as well as specialized platforms have been designed for inference acceleration. The flexibility and diverse capabilities offered by these approaches and platforms result in large design spaces with complex trade-offs for DNN deployment. Relevant objectives involved in these trade-offs include inference accuracy, latency, throughput, memory requirements, and energy consumption. Tools that can effectively assist designers in deriving efficient DNN configurations for specific deployment scenarios are therefore needed. In this work, we present a design space exploration framework for this purpose. In the proposed framework, DNNs are represented as dataflow graphs using a lightweight-dataflow-based modeling tool, and schedules (strategies for managing processing resources across different DNN tasks) are modeled in a formal, abstract form using dataflow methods as well. The dataflow-based application and schedule representations are integrated systematically with a multiobjective particle swarm optimization (PSO) strategy, which enables efficient evaluation of implementation trade-offs and derivation of Pareto fronts involving alternative deployment configurations. Experimental results using different DNN architectures demonstrate the effectiveness of our proposed framework in exploring design spaces for DNN deployment.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Network-level Design Space Exploration of Resource-constrained Networks-of-Systems
    Zhao, Zhuoran
    Barijough, Kamyar Mirzazad
    Gerstlauer, Andreas
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2020, 19 (04)
  • [2] Survey of Progress in Deep Neural Networks for Resource-Constrained Applications
    Stuart, Morgan
    Manic, Milos
    IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 7259 - 7266
  • [3] AERO: Design Space Exploration Framework for Resource-Constrained CNN Mapping on Tile-Based Accelerators
    Yang, Simei
    Bhattacharjee, Debjyoti
    Kumar, Vinay B. Y.
    Chatterjee, Saikat
    De, Sayandip
    Debacker, Peter
    Verkest, Diederik
    Mallik, Arindam
    Catthoor, Francky
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 508 - 521
  • [4] Lightweight Run-Time Working Memory Compression for Deployment of Deep Neural Networks on Resource-Constrained MCUs
    Wang, Zhepeng
    Wu, Yawen
    Jia, Zhenge
    Shi, Yiyu
    Hu, Jingtong
    2021 26TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2021, : 607 - 614
  • [5] Selective Binarization based Architecture Design Methodology for Resource-constrained Computation of Deep Neural Networks
    Chandrapu, Ramesh Reddy
    Gyaneshwar, Dubacharla
    Channappayya, Sumohana
    Acharyya, Amit
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [6] NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks
    Lee, Eugene
    Lee, Chen-Yi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1475 - 1484
  • [7] Resource-constrained maximum network throughput on space networks
    Yanling Xing
    Ning Ge
    Youzheng Wang
    JournalofSystemsEngineeringandElectronics, 2015, 26 (02) : 215 - 223
  • [8] Resource-constrained maximum network throughput on space networks
    Xing, Yanling
    Ge, Ning
    Wang, Youzheng
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2015, 26 (02) : 215 - 223
  • [9] A Design Strategy for the Efficient Implementation of Random Basis Neural Networks on Resource-Constrained Devices
    Edoardo Ragusa
    Christian Gianoglio
    Rodolfo Zunino
    Paolo Gastaldo
    Neural Processing Letters, 2020, 51 : 1611 - 1629
  • [10] A Design Strategy for the Efficient Implementation of Random Basis Neural Networks on Resource-Constrained Devices
    Ragusa, Edoardo
    Gianoglio, Christian
    Zunino, Rodolfo
    Gastaldo, Paolo
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1611 - 1629