Revisiting Sparse Convolutional Model for Visual Recognition

被引:0
|
作者
Dai, Xili [1 ]
Li, Mingyang [2 ]
Zhai, Pengyuan [3 ]
Tong, Shengbang [4 ]
Gao, Xingjian [4 ]
Huang, Shao-Lun [2 ]
Zhu, Zhihui [5 ]
You, Chong [4 ]
Ma, Yi [2 ,4 ]
机构
[1] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China
[2] Tsinghua Univ, TBSI, Shenzhen, Peoples R China
[3] Harvard Univ, Cambridge, MA 02138 USA
[4] Univ Calif Berkeley, Berkeley, CA 94720 USA
[5] Ohio State Univ, Columbus, OH 43210 USA
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite strong empirical performance for image classification, deep neural networks are often regarded as "black boxes" and they are difficult to interpret. On the other hand, sparse convolutional models, which assume that a signal can be expressed by a linear combination of a few elements from a convolutional dictionary, are powerful tools for analyzing natural images with good theoretical interpretability and biological plausibility. However, such principled models have not demonstrated competitive performance when compared with empirically designed deep networks. This paper revisits the sparse convolutional modeling for image classification and bridges the gap between good empirical performance (of deep learning) and good interpretability (of sparse convolutional models). Our method uses differentiable optimization layers that are defined from convolutional sparse coding as drop-in replacements of standard convolutional layers in conventional deep neural networks. We show that such models have equally strong empirical performance on CIFAR-10, CIFAR-100 and ImageNet datasets when compared to conventional neural networks. By leveraging stable recovery property of sparse modeling, we further show that such models can be much more robust to input corruptions as well as adversarial perturbations in testing through a simple proper trade-off between sparse regularization and data reconstruction terms. Source code can be found at https://github.com/Delay- Xili/SDNet.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Revisiting the variable memory model of visual search
    Horowitz, Todd S.
    VISUAL COGNITION, 2006, 14 (4-8) : 668 - 684
  • [42] CONVOLUTIONAL SPARSE CODING CLASSIFICATION MODEL FOR IMAGE CLASSIFICATION
    Chen, Boheng
    Li, Jie
    Ma, Biyun
    Wei, Gang
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1918 - 1922
  • [43] Visual recognition and inference using dynamic overcomplete sparse learning
    Murray, Joseph F.
    Kreutz-Delgado, Kenneth
    NEURAL COMPUTATION, 2007, 19 (09) : 2301 - 2352
  • [44] Object class recognition based on compressive sensing with sparse features inspired by hierarchical model in visual cortex
    Lu Pei
    Xu Zhiyong
    Yu Huapeng
    Chang Yongxin
    Fu Chengyu
    Shao Jianxin
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY II, 2012, 8558
  • [45] Sparse Output Coding for Large-Scale Visual Recognition
    Zhao, Bin
    Xing, Eric P.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3350 - 3357
  • [46] A Sparse Representation Model Using the Complete Marginal Fisher Analysis Framework and Its Applications to Visual Recognition
    Puthenputhussery, Ajit
    Liu, Qingfeng
    Liu, Chengjun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (08) : 1757 - 1770
  • [47] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [48] PART-BASED CONVOLUTIONAL NEURAL NETWORK FOR VISUAL RECOGNITION
    Yang, Lingxiao
    Xie, Xiaohua
    Li, Peihua
    Zhang, David
    Zhang, Lei
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1772 - 1776
  • [49] A convolutional neural network for visual object recognition in marine sector
    Kumar, Aiswarya S.
    Sherly, Elizabeth
    2017 2ND INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2017, : 304 - 307
  • [50] A Lightweight Visual Font Style Recognition With Quantized Convolutional Autoencoder
    Tonmoy, Moshiur Rahman
    Rakib, Abdul Fattah
    Rahman, Rashik
    Adnan, Md. Akhtaruzzaman
    Mridha, M. F.
    Huang, Jie
    Shin, Jungpil
    IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2024, 5 : 120 - 130