Efficient architecture for deep neural networks with heterogeneous sensitivity

被引:2
|
作者
Cho, Hyunjoong [1 ]
Jang, Jinhyeok [2 ]
Lee, Chanhyeok [1 ]
Yang, Seungjoon [1 ]
机构
[1] Ulsan Natl Inst Sci & Technol UNIST, Sch Elect & Comp Engn, Ulsan, South Korea
[2] Elect & Telecommun Res Inst ETRI, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
Deep neural networks; Efficient architecture; Heterogeneous sensitivity; Constrained optimization; Simultaneous regularization parameter selection; L-CURVE; REGULARIZATION;
D O I
10.1016/j.neunet.2020.10.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we present a neural network that consists of nodes with heterogeneous sensitivity. Each node in a network is assigned a variable that determines the sensitivity with which it learns to perform a given task. The network is trained via a constrained optimization that maximizes the sparsity of the sensitivity variables while ensuring optimal network performance. As a result, the network learns to perform a given task using only a few sensitive nodes. Insensitive nodes, which are nodes with zero sensitivity, can be removed from a trained network to obtain a computationally efficient network. Removing zero-sensitivity nodes has no effect on the performance of the network because the network has already been trained to perform the task without them. The regularization parameter used to solve the optimization problem was simultaneously found during the training of the networks. To validate our approach, we designed networks with computationally efficient architectures for various tasks such as autoregression, object recognition, facial expression recognition, and object detection using various datasets. In our experiments, the networks designed by our proposed method provided the same or higher performances but with far less computational complexity. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页码:95 / 106
页数:12
相关论文
共 50 条
  • [1] Efficient Architecture Search for Deep Neural Networks
    Gottapu, Ram Deepak
    Dagli, Cihan H.
    [J]. COMPLEX ADAPTIVE SYSTEMS, 2020, 168 : 19 - 25
  • [2] Efficient Softmax Hardware Architecture for Deep Neural Networks
    Du, Gaoming
    Tian, Chao
    Li, Zhenmin
    Zhang, Duoli
    Yin, Yongsheng
    Ouyang, Yiming
    [J]. GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 75 - 80
  • [3] An Energy-Efficient Deep Learning Processor with Heterogeneous Multi-Core Architecture for Convolutional Neural Networks and Recurrent Neural Networks
    Shin, Dongjoo
    Lee, Jinmook
    Lee, Jinsu
    Lee, Juhyoung
    Yoo, Hoi-Jun
    [J]. 2017 IEEE SYMPOSIUM IN LOW-POWER AND HIGH-SPEED CHIPS (COOL CHIPS), 2017,
  • [4] Efficient multiscale modeling of heterogeneous materials using deep neural networks
    Aldakheel, Fadi
    Elsayed, Elsayed S. S.
    Zohdi, Tarek I. I.
    Wriggers, Peter
    [J]. COMPUTATIONAL MECHANICS, 2023, 72 (01) : 155 - 171
  • [5] Efficient multiscale modeling of heterogeneous materials using deep neural networks
    Fadi Aldakheel
    Elsayed S. Elsayed
    Tarek I. Zohdi
    Peter Wriggers
    [J]. Computational Mechanics, 2023, 72 : 155 - 171
  • [6] An Efficient and Fast Softmax Hardware Architecture (EFSHA) for Deep Neural Networks
    Hussain, Muhammad Awais
    Tsai, Tsung-Han
    [J]. 2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [7] HGNAS plus plus : Efficient Architecture Search for Heterogeneous Graph Neural Networks
    Gao, Yang
    Zhang, Peng
    Zhou, Chuan
    Yang, Hong
    Li, Zhao
    Hu, Yue
    Yu, Philip S. S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 9448 - 9461
  • [8] An efficient and flexible inference system for serving heterogeneous ensembles of deep neural networks
    Pochelu, Pierrick
    Petiton, Serge G.
    Conche, Bruno
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5225 - 5232
  • [9] Architecture Disentanglement for Deep Neural Networks
    Hu, Jie
    Cao, Liujuan
    Tong, Tong
    Ye, Qixiang
    Zhang, Shengchuan
    Li, Ke
    Huang, Feiyue
    Shao, Ling
    Ji, Rongrong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 652 - 661
  • [10] An Efficient Event-driven Neuromorphic Architecture for Deep Spiking Neural Networks
    Duy-Anh Nguyen
    Duy-Hicu Bui
    Iacopi, Francesca
    Xuan-Tu Tran
    [J]. 32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 144 - 149