A Design Space Exploration Framework for Deployment of Resource-Constrained Deep Neural Networks

被引:0
|
作者
Zhang, Yan [1 ]
Pan, Lei [1 ]
Berkowitz, Phillip [2 ]
Lee, Mun Wai [2 ]
Riggan, Benjamin [3 ]
Bhattacharyya, Shuvra S. [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Intelligent Automat, Rockville, MD 20855 USA
[3] Univ Nebraska, Lincoln, NE 68588 USA
关键词
Design space exploration; Deep Neural Networks; Dataflow Modeling; Resource-constrained deployment; PARTICLE SWARM OPTIMIZATION;
D O I
10.1117/12.3014043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed great progress in the development of deep neural networks (DNNs), which has led to growing interest in deploying DNNs in resource-constrained environments such as network-edge and edge-cloud environments. To address objectives of efficient DNN inference, numerous approaches as well as specialized platforms have been designed for inference acceleration. The flexibility and diverse capabilities offered by these approaches and platforms result in large design spaces with complex trade-offs for DNN deployment. Relevant objectives involved in these trade-offs include inference accuracy, latency, throughput, memory requirements, and energy consumption. Tools that can effectively assist designers in deriving efficient DNN configurations for specific deployment scenarios are therefore needed. In this work, we present a design space exploration framework for this purpose. In the proposed framework, DNNs are represented as dataflow graphs using a lightweight-dataflow-based modeling tool, and schedules (strategies for managing processing resources across different DNN tasks) are modeled in a formal, abstract form using dataflow methods as well. The dataflow-based application and schedule representations are integrated systematically with a multiobjective particle swarm optimization (PSO) strategy, which enables efficient evaluation of implementation trade-offs and derivation of Pareto fronts involving alternative deployment configurations. Experimental results using different DNN architectures demonstrate the effectiveness of our proposed framework in exploring design spaces for DNN deployment.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Improving training datasets for resource-constrained speaker recognition neural networks
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2023, 2023, : 3167 - 3171
  • [32] MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks
    Gordon, Ariel
    Eban, Elad
    Nachum, Ofir
    Chen, Bo
    Wu, Hao
    Yang, Tien-Ju
    Choi, Edward
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1586 - 1595
  • [33] Squeezing Accumulators in Binary Neural Networks for Extremely Resource-Constrained Applications
    Azamat, Azat
    Park, Jaewoo
    Lee, Jongeun
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [34] Design Patterns for Resource-Constrained Automated Deep-Learning Methods
    Tuggener, Lukas
    Amirian, Mohammadreza
    Benites, Fernando
    von Daniken, Pius
    Gupta, Prakhar
    Schilling, Frank-Peter
    Stadelmann, Thilo
    AI, 2020, 1 (04) : 510 - 538
  • [35] FeatherNet: An Accelerated Convolutional Neural Network Design for Resource-constrained FPGAs
    Morcel, Raghid
    Hajj, Hazem M.
    Saghir, Mazen A. R.
    Akkary, Haitham
    Artail, Hassan
    Khanna, Rahul
    Keshavamurthy, Anil
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2019, 12 (02)
  • [36] A Design Space Exploration Framework for Convolutional Neural Networks Implemented on Edge Devices
    Tsimpourlas, Foivos
    Papadopoulos, Lazaros
    Bartsokas, Anastasios
    Soudris, Dimitrios
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (11) : 2212 - 2221
  • [37] DESIGN AND MODELING OF RESOURCE-CONSTRAINED IOT BASED BODY AREA NETWORKS
    Zahoor, Saniya
    Mir, Roohie Naaz
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2020, 21 (03): : 497 - 514
  • [38] Design and modeling of resource-constrained IoT based body area networks
    Zahoor S.
    Mir R.N.
    Scalable Computing, 2020, 21 (03): : 497 - 514
  • [39] Smart-DNN: Efficiently Reducing the Memory Requirements of Running Deep Neural Networks on Resource-constrained Platforms
    Hu, Zhenbo
    Zou, Xiangyu
    Xia, Wen
    Zhao, Yuhong
    Zhang, Weizhe
    Wu, Donglei
    2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 533 - 541
  • [40] Design Space Exploration of FPGA-Based Deep Convolutional Neural Networks
    Motamedi, Mohammad
    Gysel, Philipp
    Akella, Venkatesh
    Ghiasi, Soheil
    2016 21ST ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2016, : 575 - 580