A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

被引:200
|
作者
Wiatowski, Thomas [1 ]
Bolcskei, Helmut [1 ]
机构
[1] ETH, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland
关键词
Machine learning; deep convolutional neural networks; scattering networks; feature extraction; frame theory; TEXTURE CLASSIFICATION; WAVELET; RECOGNITION; RIDGELET; REPRESENTATIONS; FRAMES;
D O I
10.1109/TIT.2017.2776228
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep convolutional neural networks (DCNNs) have led to breakthrough results in numerous practical machine learning tasks, such as classification of images in the ImageNet data set, control-policy-learning to play Atari games or the board game Go, and image captioning. Many of these applications first perform feature extraction and then feed the results thereof into a classifier. The mathematical analysis of DCNNs for feature extraction was initiated by Mallat, 2012. Specifically, Mallat considered so-called scattering networks based on a wavelet transform followed by the modulus non-linearity in each network layer, and proved translation invariance (asymptotically in the wavelet scale parameter) and deformation stability of the corresponding feature extractor. This paper complements Mallat's results by developing a theory that encompasses general convolutional transforms, or in more technical parlance, general semi-discrete frames (including Weyl-Heisenberg filters, curvelets, shearlets, ridgelets, wavelets, and learned filters), general Lipschitz-continuous non-linearities (e.g., rectified linear units, shifted logistic sigmoids, hyperbolic tangents, and modulus functions), and general Lipschitz-continuous pooling operators emulating, e.g., sub-sampling and averaging. In addition, all of these elements can be different in different network layers. For the resulting feature extractor, we prove a translation invariance result of vertical nature in the sense of the features becoming progressively more translation-invariant with increasing network depth, and we establish deformation sensitivity bounds that apply to signal classes such as, e.g., band-limited functions, cartoon functions, and Lipschitz functions.
引用
收藏
页码:1845 / 1866
页数:22
相关论文
共 50 条
  • [1] Regularized Deep Convolutional Neural Networks for Feature Extraction and Classification
    Jayech, Khaoula
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 431 - 439
  • [2] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    [J]. HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
  • [3] Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks
    Chen, Yushi
    Jiang, Hanlu
    Li, Chunyang
    Jia, Xiuping
    Ghamisi, Pedram
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10): : 6232 - 6251
  • [4] Feature Extraction and Fusion Using Deep Convolutional Neural Networks for Face Detection
    Lu, Xiaojun
    Duan, Xu
    Mao, Xiuping
    Li, Yuanyuan
    Zhang, Xiangde
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [5] Deep Convolutional Neural Networks for Feature Extraction of Images Generated from Complex Networks Topologies
    Xu, Ye
    Chi, Yun
    Tian, Ye
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (01) : 327 - 338
  • [6] Deep Convolutional Neural Networks for Feature Extraction of Images Generated from Complex Networks Topologies
    Ye Xu
    Yun Chi
    Ye Tian
    [J]. Wireless Personal Communications, 2018, 103 : 327 - 338
  • [7] Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks
    Miao, Yajie
    Metze, Florian
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 800 - 804
  • [8] Hyperspectral Remote Sensing Images Deep Feature Extraction Based on Mixed Feature and Convolutional Neural Networks
    Liu, Jing
    Yang, Zhe
    Liu, Yi
    Mu, Caihong
    [J]. REMOTE SENSING, 2021, 13 (13)
  • [9] Topology Reduction in Deep Convolutional Feature Extraction Networks
    Wiatowski, Thomas
    Grohs, Philipp
    Bolcskei, Helmut
    [J]. WAVELETS AND SPARSITY XVII, 2017, 10394
  • [10] Development of Convolutional Neural Networks (CNNs) for Feature Extraction
    Eikmeier, Nicole
    Westerkamp, Rachel
    Zelnio, Edmund
    [J]. ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXV, 2018, 10647