Revisiting Sparse Convolutional Model for Visual Recognition

被引：0

作者：

Dai, Xili ^{[1
]}

Li, Mingyang ^{[2
]}

Zhai, Pengyuan ^{[3
]}

Tong, Shengbang ^{[4
]}

Gao, Xingjian ^{[4
]}

Huang, Shao-Lun ^{[2
]}

Zhu, Zhihui ^{[5
]}

You, Chong ^{[4
]}

Ma, Yi ^{[2
,4
]}

机构：

[1] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China

[2] Tsinghua Univ, TBSI, Shenzhen, Peoples R China

[3] Harvard Univ, Cambridge, MA 02138 USA

[4] Univ Calif Berkeley, Berkeley, CA 94720 USA

[5] Ohio State Univ, Columbus, OH 43210 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite strong empirical performance for image classification, deep neural networks are often regarded as "black boxes" and they are difficult to interpret. On the other hand, sparse convolutional models, which assume that a signal can be expressed by a linear combination of a few elements from a convolutional dictionary, are powerful tools for analyzing natural images with good theoretical interpretability and biological plausibility. However, such principled models have not demonstrated competitive performance when compared with empirically designed deep networks. This paper revisits the sparse convolutional modeling for image classification and bridges the gap between good empirical performance (of deep learning) and good interpretability (of sparse convolutional models). Our method uses differentiable optimization layers that are defined from convolutional sparse coding as drop-in replacements of standard convolutional layers in conventional deep neural networks. We show that such models have equally strong empirical performance on CIFAR-10, CIFAR-100 and ImageNet datasets when compared to conventional neural networks. By leveraging stable recovery property of sparse modeling, we further show that such models can be much more robust to input corruptions as well as adversarial perturbations in testing through a simple proper trade-off between sparse regularization and data reconstruction terms. Source code can be found at https://github.com/Delay- Xili/SDNet.

引用

页数：13

共 50 条

[1] DATA DRIVEN CONVOLUTIONAL SPARSE CODING FOR VISUAL RECOGNITION
Zeng, Yijie
Chen, Jichao
Huang, Guang-Bin
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2736 - 2740
[2] Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
Wu, Zifeng
Shen, Chunhua
van den Hengel, Anton
PATTERN RECOGNITION, 2019, 90 : 119 - 133
[3] Sparse convolutional model with semantic expression for waste electrical appliances recognition
Han, HongGui
Liu, YiMing
Li, FangYu
Du, YongPing
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (09) : 2881 - 2893
[4] Sparse convolutional model with semantic expression for waste electrical appliances recognition
HAN HongGui
LIU YiMing
LI FangYu
DU YongPing
Science China(Technological Sciences), 2024, 67 (09) : 2881 - 2893
[5] Convolutional Sparse Coding for Face Recognition
Jin, Junwei
Chen, C. L. Philip
2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 137 - 141
[6] BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION
Singh, Jaspreet
Singh, Chandan
2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
[7] A Visual Recognition Model Based on Improved Convolutional Neural Network
Zhou, Jin
Zhang, Yonglin
Song, Shaoyun
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 260 - 260
[8] Sparse Decomposition of Convolutional Features for Scene Recognition
Xie, Lin
Lee, Feifei
Yan, Yan
Chen, Qiu
2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 345 - 348
[9] Bidirectional Convolutional Recurrent Sparse Network (BCRSN): An Efficient Model for Music Emotion Recognition
Dong, Yizhuo
Yang, Xinyu
Zhao, Xi
Li, Juan
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3150 - 3163
[10] Visual tracking using convolutional features with sparse coding
Abbass, Mohammed Y.
Kwon, Ki-Chul
Kim, Nam
Abdelwahab, Safey A.
Abd El-Samie, Fathi E.
Khalaf, Ashraf A. M.
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3349 - 3360

← 1 2 3 4 5 →