Efficient architecture for deep neural networks with heterogeneous sensitivity

被引：2

作者：

Cho, Hyunjoong ^{[1
]}

Jang, Jinhyeok ^{[2
]}

Lee, Chanhyeok ^{[1
]}

Yang, Seungjoon ^{[1
]}

机构：

[1] Ulsan Natl Inst Sci & Technol UNIST, Sch Elect & Comp Engn, Ulsan, South Korea

[2] Elect & Telecommun Res Inst ETRI, Daejeon, South Korea

来源：

NEURAL NETWORKS | 2021年 / 134卷

基金：

新加坡国家研究基金会;

关键词：

Deep neural networks; Efficient architecture; Heterogeneous sensitivity; Constrained optimization; Simultaneous regularization parameter selection; L-CURVE; REGULARIZATION;

D O I：

10.1016/j.neunet.2020.10.017

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this study, we present a neural network that consists of nodes with heterogeneous sensitivity. Each node in a network is assigned a variable that determines the sensitivity with which it learns to perform a given task. The network is trained via a constrained optimization that maximizes the sparsity of the sensitivity variables while ensuring optimal network performance. As a result, the network learns to perform a given task using only a few sensitive nodes. Insensitive nodes, which are nodes with zero sensitivity, can be removed from a trained network to obtain a computationally efficient network. Removing zero-sensitivity nodes has no effect on the performance of the network because the network has already been trained to perform the task without them. The regularization parameter used to solve the optimization problem was simultaneously found during the training of the networks. To validate our approach, we designed networks with computationally efficient architectures for various tasks such as autoregression, object recognition, facial expression recognition, and object detection using various datasets. In our experiments, the networks designed by our proposed method provided the same or higher performances but with far less computational complexity. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页码：95 / 106

页数：12

共 50 条

[31] Bit Efficient Quantization for Deep Neural Networks
Nayak, Prateeth
Zhang, David
Chai, Sek
FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019), 2019, : 52 - 56
[32] The Efficient Hedging Frontier with Deep Neural Networks
Gong, Zheng
Ventre, Carmine
O'Hara, John
ICAIF 2021: THE SECOND ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, 2021,
[33] Deep Neural Networks with Efficient Guaranteed Invariances
Rath, Matthias
Condurache, Alexandru Paul
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
[34] Learning deep morphological networks with neural architecture search
Hu, Yufei
Belkhir, Nacim
Angulo, Jesus
Yao, Angela
Franchi, Gianni
PATTERN RECOGNITION, 2022, 131
[35] A Fast Compressed Hardware Architecture for Deep Neural Networks
Ansari, Anaam
Shelton, Allen
Ogunfunmi, Tokunbo
Panchbhaiyye, Vineet
2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 370 - 374
[36] A Domain-Specific Architecture for Deep Neural Networks
Jouppi, Norman P.
Young, Cliff
Patil, Nishant
Patterson, David
COMMUNICATIONS OF THE ACM, 2018, 61 (09) : 50 - 59
[37] NeuroUnlock: Unlocking the Architecture of Obfuscated Deep Neural Networks
Ahmadi, Mahya Morid
Alrahis, Lilas
Colucci, Alessio
Sinanoglu, Ozgur
Shafique, Muhammad
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[38] EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS
Sun, Fangxuan
Lin, Jun
Wang, Zhongfeng
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1145 - 1149
[39] An Efficient Hardware Architecture for Multilayer Spiking Neural Networks
Luo, Yuling
Wan, Lei
Liu, Junxiu
Zhang, Jinlei
Cao, Yi
NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 786 - 795
[40] Energy-Efficient and High-Performance NoC Architecture and Mapping Solution for Deep Neural Networks
Reza, Md Farhadur
Ampadu, Paul
PROCEEDINGS OF THE 13TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS'19), 2019,

← 1 2 3 4 5 →