Revisiting Sparse Convolutional Model for Visual Recognition

被引:0
|
作者
Dai, Xili [1 ]
Li, Mingyang [2 ]
Zhai, Pengyuan [3 ]
Tong, Shengbang [4 ]
Gao, Xingjian [4 ]
Huang, Shao-Lun [2 ]
Zhu, Zhihui [5 ]
You, Chong [4 ]
Ma, Yi [2 ,4 ]
机构
[1] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China
[2] Tsinghua Univ, TBSI, Shenzhen, Peoples R China
[3] Harvard Univ, Cambridge, MA 02138 USA
[4] Univ Calif Berkeley, Berkeley, CA 94720 USA
[5] Ohio State Univ, Columbus, OH 43210 USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite strong empirical performance for image classification, deep neural networks are often regarded as "black boxes" and they are difficult to interpret. On the other hand, sparse convolutional models, which assume that a signal can be expressed by a linear combination of a few elements from a convolutional dictionary, are powerful tools for analyzing natural images with good theoretical interpretability and biological plausibility. However, such principled models have not demonstrated competitive performance when compared with empirically designed deep networks. This paper revisits the sparse convolutional modeling for image classification and bridges the gap between good empirical performance (of deep learning) and good interpretability (of sparse convolutional models). Our method uses differentiable optimization layers that are defined from convolutional sparse coding as drop-in replacements of standard convolutional layers in conventional deep neural networks. We show that such models have equally strong empirical performance on CIFAR-10, CIFAR-100 and ImageNet datasets when compared to conventional neural networks. By leveraging stable recovery property of sparse modeling, we further show that such models can be much more robust to input corruptions as well as adversarial perturbations in testing through a simple proper trade-off between sparse regularization and data reconstruction terms. Source code can be found at https://github.com/Delay- Xili/SDNet.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] DATA DRIVEN CONVOLUTIONAL SPARSE CODING FOR VISUAL RECOGNITION
    Zeng, Yijie
    Chen, Jichao
    Huang, Guang-Bin
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2736 - 2740
  • [2] Wider or Deeper: Revisiting the ResNet Model for Visual Recognition
    Wu, Zifeng
    Shen, Chunhua
    van den Hengel, Anton
    PATTERN RECOGNITION, 2019, 90 : 119 - 133
  • [3] Sparse convolutional model with semantic expression for waste electrical appliances recognition
    Han, HongGui
    Liu, YiMing
    Li, FangYu
    Du, YongPing
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (09) : 2881 - 2893
  • [4] Sparse convolutional model with semantic expression for waste electrical appliances recognition
    HAN HongGui
    LIU YiMing
    LI FangYu
    DU YongPing
    Science China(Technological Sciences), 2024, 67 (09) : 2881 - 2893
  • [5] Convolutional Sparse Coding for Face Recognition
    Jin, Junwei
    Chen, C. L. Philip
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 137 - 141
  • [6] BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION
    Singh, Jaspreet
    Singh, Chandan
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [7] A Visual Recognition Model Based on Improved Convolutional Neural Network
    Zhou, Jin
    Zhang, Yonglin
    Song, Shaoyun
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 260 - 260
  • [8] Sparse Decomposition of Convolutional Features for Scene Recognition
    Xie, Lin
    Lee, Feifei
    Yan, Yan
    Chen, Qiu
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 345 - 348
  • [9] Bidirectional Convolutional Recurrent Sparse Network (BCRSN): An Efficient Model for Music Emotion Recognition
    Dong, Yizhuo
    Yang, Xinyu
    Zhao, Xi
    Li, Juan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) : 3150 - 3163
  • [10] Visual tracking using convolutional features with sparse coding
    Abbass, Mohammed Y.
    Kwon, Ki-Chul
    Kim, Nam
    Abdelwahab, Safey A.
    Abd El-Samie, Fathi E.
    Khalaf, Ashraf A. M.
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3349 - 3360