Efficient architecture for deep neural networks with heterogeneous sensitivity

被引:2
|
作者
Cho, Hyunjoong [1 ]
Jang, Jinhyeok [2 ]
Lee, Chanhyeok [1 ]
Yang, Seungjoon [1 ]
机构
[1] Ulsan Natl Inst Sci & Technol UNIST, Sch Elect & Comp Engn, Ulsan, South Korea
[2] Elect & Telecommun Res Inst ETRI, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
Deep neural networks; Efficient architecture; Heterogeneous sensitivity; Constrained optimization; Simultaneous regularization parameter selection; L-CURVE; REGULARIZATION;
D O I
10.1016/j.neunet.2020.10.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we present a neural network that consists of nodes with heterogeneous sensitivity. Each node in a network is assigned a variable that determines the sensitivity with which it learns to perform a given task. The network is trained via a constrained optimization that maximizes the sparsity of the sensitivity variables while ensuring optimal network performance. As a result, the network learns to perform a given task using only a few sensitive nodes. Insensitive nodes, which are nodes with zero sensitivity, can be removed from a trained network to obtain a computationally efficient network. Removing zero-sensitivity nodes has no effect on the performance of the network because the network has already been trained to perform the task without them. The regularization parameter used to solve the optimization problem was simultaneously found during the training of the networks. To validate our approach, we designed networks with computationally efficient architectures for various tasks such as autoregression, object recognition, facial expression recognition, and object detection using various datasets. In our experiments, the networks designed by our proposed method provided the same or higher performances but with far less computational complexity. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页码:95 / 106
页数:12
相关论文
共 50 条
  • [31] Bit Efficient Quantization for Deep Neural Networks
    Nayak, Prateeth
    Zhang, David
    Chai, Sek
    FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 52 - 56
  • [32] The Efficient Hedging Frontier with Deep Neural Networks
    Gong, Zheng
    Ventre, Carmine
    O'Hara, John
    ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
  • [33] Deep Neural Networks with Efficient Guaranteed Invariances
    Rath, Matthias
    Condurache, Alexandru Paul
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [34] Learning deep morphological networks with neural architecture search
    Hu, Yufei
    Belkhir, Nacim
    Angulo, Jesus
    Yao, Angela
    Franchi, Gianni
    PATTERN RECOGNITION, 2022, 131
  • [35] A Fast Compressed Hardware Architecture for Deep Neural Networks
    Ansari, Anaam
    Shelton, Allen
    Ogunfunmi, Tokunbo
    Panchbhaiyye, Vineet
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 370 - 374
  • [36] A Domain-Specific Architecture for Deep Neural Networks
    Jouppi, Norman P.
    Young, Cliff
    Patil, Nishant
    Patterson, David
    COMMUNICATIONS OF THE ACM, 2018, 61 (09) : 50 - 59
  • [37] NeuroUnlock: Unlocking the Architecture of Obfuscated Deep Neural Networks
    Ahmadi, Mahya Morid
    Alrahis, Lilas
    Colucci, Alessio
    Sinanoglu, Ozgur
    Shafique, Muhammad
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [38] EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS
    Sun, Fangxuan
    Lin, Jun
    Wang, Zhongfeng
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1145 - 1149
  • [39] An Efficient Hardware Architecture for Multilayer Spiking Neural Networks
    Luo, Yuling
    Wan, Lei
    Liu, Junxiu
    Zhang, Jinlei
    Cao, Yi
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 786 - 795
  • [40] Energy-Efficient and High-Performance NoC Architecture and Mapping Solution for Deep Neural Networks
    Reza, Md Farhadur
    Ampadu, Paul
    PROCEEDINGS OF THE 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS'19), 2019,