On a Sparse Shortcut Topology of Artificial Neural Networks

被引:8
|
作者
Fan F.-L. [1 ]
Wang D. [2 ]
Guo H. [1 ]
Zhu Q. [1 ]
Yan P. [1 ]
Wang G. [1 ]
Yu H. [2 ]
机构
[1] Rensselaer Polytechnic Institute, Department of Biomedical Engineering, Troy, 12180, NY
[2] University of Massachusetts, Department of Electrical and Computer Engineering, Lowell, 01854, MA
来源
关键词
Expressivity; generalizability; network architec- ture; shortcut network; theoretical deep learning;
D O I
10.1109/TAI.2021.3128132
中图分类号
学科分类号
摘要
In established network architectures, shortcut connections are often used to take the outputs of earlier layers as additional inputs to later layers. Despite the extraordinary effectiveness of shortcuts, there remain open questions on the mechanism and characteristics. For example, why are shortcuts powerful? Why do shortcuts generalize well? In this article, we investigate the expressivity and generalizability of a novel sparse shortcut topology. First, we demonstrate that this topology can empower a one-neuron-wide deep network to approximate any univariate continuous function. Then, we present a novel width-bounded universal approximator in contrast to depth-bounded universal approximators and extend the approximation result to a family of equally competent networks. Furthermore, with generalization bound theory, we show that the proposed shortcut topology enjoys excellent generalizability. Finally, we corroborate our theoretical analyses by comparing the proposed topology with popular architectures, including ResNet and DenseNet, on well-known benchmarks and perform a saliency map analysis to interpret the proposed topology. Our work helps understand the role of shortcuts and suggests further opportunities to innovate neural architectures. © 2020 IEEE.
引用
收藏
页码:595 / 608
页数:13
相关论文
共 50 条
  • [21] Topology and Toughening of Sparse Elastic Networks
    Yamaguchi, Tetsuo
    Onoue, Yudai
    Sawae, Yoshinori
    PHYSICAL REVIEW LETTERS, 2020, 124 (06)
  • [22] Shortcut Convolutional Neural Networks for Classification of Gender and Texture
    Zhang, Ting
    Li, Yujian
    Liu, Zhaoying
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 30 - 39
  • [23] A Sparse and Ergonomic Tomographic Image Reconstruction Technique based on Artificial Neural Networks
    Tomazinaki, M-E
    Lytrosyngounis, I
    Stiliaris, E.
    2018 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE PROCEEDINGS (NSS/MIC), 2018,
  • [24] Optimizing bags of artificial neural networks for the prediction of viability from sparse data
    Daly, Clyde A., Jr.
    Hernandez, Rigoberto
    JOURNAL OF CHEMICAL PHYSICS, 2020, 153 (05):
  • [25] Sparse and burst spiking in artificial neural networks inspired by synaptic retrograde signaling
    Faghihi, Faramarz
    Moustafa, Ahmed A.
    INFORMATION SCIENCES, 2017, 421 : 30 - 42
  • [26] Asymptotic properties of one-layer artificial neural networks with sparse connectivity
    Hirsch, Christian
    Neumann, Matthias
    Schmidt, Volker
    STATISTICS & PROBABILITY LETTERS, 2023, 193
  • [27] Power System Topology Verification using Network Examinations and Artificial Neural Networks
    Lukomski, Robert
    Wilkosz, Kazimierz
    PROCEEDINGS OF THE 7TH INTERNATIONAL SCIENTIFIC CONFERENCE ELECTRIC POWER ENGINEERING 2006, 2006, : 105 - 110
  • [28] Robust topology optimization with low rank approximation using artificial neural networks
    Vahid Keshavarzzadeh
    Robert M. Kirby
    Akil Narayan
    Computational Mechanics, 2021, 68 : 1297 - 1323
  • [29] Machine fusion to enhance the topology preservation of vector quantization artificial neural networks
    Salas, R.
    Saavedra, C.
    Allende, H.
    Moraga, C.
    PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 962 - 972
  • [30] Topology and weight evolving artificial neural networks in cooperative transport by a robotic swarm
    Hiraga, Motoaki
    Ohkura, Kazuhiro
    ARTIFICIAL LIFE AND ROBOTICS, 2022, 27 (02) : 324 - 332