Revisiting Sparse Convolutional Model for Visual Recognition

被引:0
|
作者
Dai, Xili [1 ]
Li, Mingyang [2 ]
Zhai, Pengyuan [3 ]
Tong, Shengbang [4 ]
Gao, Xingjian [4 ]
Huang, Shao-Lun [2 ]
Zhu, Zhihui [5 ]
You, Chong [4 ]
Ma, Yi [2 ,4 ]
机构
[1] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China
[2] Tsinghua Univ, TBSI, Shenzhen, Peoples R China
[3] Harvard Univ, Cambridge, MA 02138 USA
[4] Univ Calif Berkeley, Berkeley, CA 94720 USA
[5] Ohio State Univ, Columbus, OH 43210 USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite strong empirical performance for image classification, deep neural networks are often regarded as "black boxes" and they are difficult to interpret. On the other hand, sparse convolutional models, which assume that a signal can be expressed by a linear combination of a few elements from a convolutional dictionary, are powerful tools for analyzing natural images with good theoretical interpretability and biological plausibility. However, such principled models have not demonstrated competitive performance when compared with empirically designed deep networks. This paper revisits the sparse convolutional modeling for image classification and bridges the gap between good empirical performance (of deep learning) and good interpretability (of sparse convolutional models). Our method uses differentiable optimization layers that are defined from convolutional sparse coding as drop-in replacements of standard convolutional layers in conventional deep neural networks. We show that such models have equally strong empirical performance on CIFAR-10, CIFAR-100 and ImageNet datasets when compared to conventional neural networks. By leveraging stable recovery property of sparse modeling, we further show that such models can be much more robust to input corruptions as well as adversarial perturbations in testing through a simple proper trade-off between sparse regularization and data reconstruction terms. Source code can be found at https://github.com/Delay- Xili/SDNet.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Face Recognition Based on Stacked Convolutional Autoencoder and Sparse Representation
    Chang, Liping
    Yang, Jianjun
    Li, Sheng
    Xu, Hong
    Liu, Kai
    Huang, Chaogeng
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [22] GENERIC SPARSE GRAPH BASED CONVOLUTIONAL NETWORKS FOR FACE RECOGNITION
    Wu, Renjie
    Kamata, Sei-ichiro
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1589 - 1593
  • [23] Sparse Deep LSTMs with Convolutional Attention for Human Action Recognition
    Aghaei A.
    Nazari A.
    Moghaddam M.E.
    SN Computer Science, 2021, 2 (3)
  • [24] Clothing recognition based on deep sparse convolutional neural network
    Xiang, Jun
    Pan, Ruru
    Gao, Weidong
    INTERNATIONAL JOURNAL OF CLOTHING SCIENCE AND TECHNOLOGY, 2022, 34 (01) : 119 - 133
  • [25] GENERIC SPARSE GRAPH BASED CONVOLUTIONAL NETWORKS FOR FACE RECOGNITION
    Wu, Renjie
    Kamata, Sei-Ichiro
    Proceedings - International Conference on Image Processing, ICIP, 2021, 2021-September : 1589 - 1593
  • [26] Visual Attributes Based Sparse Multitask Action Recognition
    Wang, Qicong
    Zhao, Jinhao
    Shen, Yehu
    Li, Maozhen
    Wu, Yuxiang
    Lei, Yunqi
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1767 - 1772
  • [27] Sparse Spatial Coding: A Novel Approach to Visual Recognition
    Oliveira, Gabriel Leivas
    Nascimento, Erickson R.
    Vieira, Antonio Wilson
    Montenegro Campos, Mario Fernando
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (06) : 2719 - 2731
  • [28] Sparse representation and learning in visual recognition: Theory and applications
    Cheng, Hong
    Liu, Zicheng
    Yang, Lu
    Chen, Xuewen
    SIGNAL PROCESSING, 2013, 93 (06) : 1408 - 1425
  • [29] Revisiting Convolutional Sparse Coding for Image Denoising: From a Multi-Scale Perspective
    Xu, Jingyi
    Deng, Xin
    Xu, Mai
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1202 - 1206
  • [30] Delving into Fully Convolutional Networks Activations for Visual Recognition
    Zhang, Longfei
    Guo, Yanming
    PROCEEDINGS OF 2018 THE 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP 2018), 2018, : 99 - 104