A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

被引：200

作者：

Wiatowski, Thomas ^{[1
]}

Bolcskei, Helmut ^{[1
]}

机构：

[1] ETH, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland

来源：

IEEE TRANSACTIONS ON INFORMATION THEORY | 2018年 / 64卷 / 03期

关键词：

Machine learning; deep convolutional neural networks; scattering networks; feature extraction; frame theory; TEXTURE CLASSIFICATION; WAVELET; RECOGNITION; RIDGELET; REPRESENTATIONS; FRAMES;

D O I：

10.1109/TIT.2017.2776228

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep convolutional neural networks (DCNNs) have led to breakthrough results in numerous practical machine learning tasks, such as classification of images in the ImageNet data set, control-policy-learning to play Atari games or the board game Go, and image captioning. Many of these applications first perform feature extraction and then feed the results thereof into a classifier. The mathematical analysis of DCNNs for feature extraction was initiated by Mallat, 2012. Specifically, Mallat considered so-called scattering networks based on a wavelet transform followed by the modulus non-linearity in each network layer, and proved translation invariance (asymptotically in the wavelet scale parameter) and deformation stability of the corresponding feature extractor. This paper complements Mallat's results by developing a theory that encompasses general convolutional transforms, or in more technical parlance, general semi-discrete frames (including Weyl-Heisenberg filters, curvelets, shearlets, ridgelets, wavelets, and learned filters), general Lipschitz-continuous non-linearities (e.g., rectified linear units, shifted logistic sigmoids, hyperbolic tangents, and modulus functions), and general Lipschitz-continuous pooling operators emulating, e.g., sub-sampling and averaging. In addition, all of these elements can be different in different network layers. For the resulting feature extractor, we prove a translation invariance result of vertical nature in the sense of the features becoming progressively more translation-invariant with increasing network depth, and we establish deformation sensitivity bounds that apply to signal classes such as, e.g., band-limited functions, cartoon functions, and Lipschitz functions.

引用

页码：1845 / 1866

页数：22

共 50 条

[1] Regularized Deep Convolutional Neural Networks for Feature Extraction and Classification
Jayech, Khaoula
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 431 - 439
[2] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
Heracleous, Panikos
Mohammad, Yasser
Yoneyama, Akio
[J]. HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
[3] Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks
Chen, Yushi
Jiang, Hanlu
Li, Chunyang
Jia, Xiuping
Ghamisi, Pedram
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10): : 6232 - 6251
[4] Feature Extraction and Fusion Using Deep Convolutional Neural Networks for Face Detection
Lu, Xiaojun
Duan, Xu
Mao, Xiuping
Li, Yuanyuan
Zhang, Xiangde
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
[5] Deep Convolutional Neural Networks for Feature Extraction of Images Generated from Complex Networks Topologies
Xu, Ye
Chi, Yun
Tian, Ye
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (01) : 327 - 338
[6] Deep Convolutional Neural Networks for Feature Extraction of Images Generated from Complex Networks Topologies
Ye Xu
Yun Chi
Ye Tian
[J]. Wireless Personal Communications, 2018, 103 : 327 - 338
[7] Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks
Miao, Yajie
Metze, Florian
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 800 - 804
[8] Hyperspectral Remote Sensing Images Deep Feature Extraction Based on Mixed Feature and Convolutional Neural Networks
Liu, Jing
Yang, Zhe
Liu, Yi
Mu, Caihong
[J]. REMOTE SENSING, 2021, 13 (13)
[9] Topology Reduction in Deep Convolutional Feature Extraction Networks
Wiatowski, Thomas
Grohs, Philipp
Bolcskei, Helmut
[J]. WAVELETS AND SPARSITY XVII, 2017, 10394
[10] Development of Convolutional Neural Networks (CNNs) for Feature Extraction
Eikmeier, Nicole
Westerkamp, Rachel
Zelnio, Edmund
[J]. ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXV, 2018, 10647

← 1 2 3 4 5 →