A Runtime Switchable Multi-Phase Convolutional Neural Network for Resource-Constrained Systems

被引:0
|
作者
Jang, Jeonggyu [1 ]
Yang, Hoeseok [2 ]
机构
[1] Ajou Univ, Dept Elect & Comp Engn, Suwon 16499, South Korea
[2] Santa Clara Univ, Dept Elect & Comp Engn, Santa Clara, CA 95053 USA
关键词
Deep learning; convolutional neural network; neural network optimization; resource-constrained system; MULTIOBJECTIVE OPTIMIZATION;
D O I
10.1109/ACCESS.2023.3287998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) are widely used in various systems, including text resource-constrained embedded systems or IoT devices. In such systems, it is typical to deploy compressed or pruned CNNs, instead of original ones, at the cost of reduced accuracy. Existing CNN pruning techniques have primarily focused on minimizing resource requirements. However, today's embedded systems are increasingly dynamic in both resource demands and availability. Thus, the previous techniques that only consider given static cases are no longer efficient. In this paper, we propose a novel text multi-phase CNN that enables a text multi-objective exploration of a number of pruning candidates out of a single CNN. In the proposed technique, a CNN can operate in various versions depending on which subsets of weights are used and can be transformed to the one best matches to the given constraint adaptively and efficiently. For that, a CNN is first pruned to the sparsest form; then a set of parameters (sub-network) is additionally supplemented as the phase goes by. As a result, a number of network versions for all different phases can be represented by a single network and they form a pareto solution over the accuracy and resource usage trade-off. In this work, we target CPU-based CNN inference engines as most embedded systems do not have the luxury of specialized text co-processor support such as GPUs or HW accelerators. The proposed technique has been implemented in a publicly available CPU inference engine, Darknet, and its effectiveness has been validated with a popular CNN in terms of design space exploration capability and runtime switchability.
引用
收藏
页码:62449 / 62461
页数:13
相关论文
共 50 条
  • [41] Energy Autonomy for Resource-Constrained Multi Robot Missions
    Fouad, Hassan
    Beltrame, Giovanni
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 7006 - 7013
  • [42] CommNets: Communicating Neural Network Architectures for Resource Constrained Systems
    Abudu, Prince M.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9909 - 9910
  • [43] Resource-constrained maximum network throughput on space networks
    Xing, Yanling
    Ge, Ning
    Wang, Youzheng
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2015, 26 (02) : 215 - 223
  • [44] HEURISTIC PERFORMANCE AND NETWORK RESOURCE CHARACTERISTICS IN RESOURCE-CONSTRAINED PROJECT SCHEDULING
    ULUSOY, G
    OZDAMAR, L
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1989, 40 (12) : 1145 - 1152
  • [45] Neural network-based prediction of effective thermal conductivity of loose multi-phase systems
    Bhoopal, R. S.
    Sharma, P. K.
    Kumar, Sajjan
    Singh, Ramvir
    Beniwal, R. S.
    INDIAN JOURNAL OF PURE & APPLIED PHYSICS, 2013, 51 (02) : 118 - 124
  • [46] Network-level Design Space Exploration of Resource-constrained Networks-of-Systems
    Zhao, Zhuoran
    Barijough, Kamyar Mirzazad
    Gerstlauer, Andreas
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2020, 19 (04)
  • [47] Gradual Data Aggregation in Multi-granular Fact Tables on Resource-Constrained Systems
    Iftikhar, Nadeem
    Pedersen, Torben Bach
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT III, 2010, 6278 : 349 - 358
  • [48] Data-driven control for switched systems over a vulnerable and resource-constrained network
    Qi, Yiwen
    Zhao, Xiujuan
    Li, Xin
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (17): : 9569 - 9590
  • [49] Adaptive Sparse Deep Neural Network Inference on Resource-Constrained Cost-Efficient GPUs
    Dun, Ming
    Zhang, Xu
    Cao, Huawei
    Zhang, Yuan
    Huang, Junying
    Ye, Xiaochun
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,
  • [50] MRNDA: A Multicast Mechanism for Resource-Constrained Noc-Based Deep Neural Network Accelerators
    Ouyang Y.-M.
    Wang Q.
    Tang F.-Y.
    Zhou W.
    Li J.-H.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (03): : 872 - 884