SAM: A Rethinking of Prominent Convolutional Neural Network Architectures for Visual Object Recognition

被引:0
|
作者
Wang, Zhenyang [1 ]
Deng, Zhidong [1 ]
Wang, Shiyao [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
基金
美国国家科学基金会;
关键词
RECEPTIVE FIELDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks play an increasingly important role in computer vision tasks, especially in the field of visual object recognition. Many prominent models, such as Inception, Maxout, ResNet, and NIN, have been proposed to significantly improve recognition performance. Inspired from those models, we propose a novel module called self-adaptive module (SAM). SAM consists of four passes and one selector. Specifically, the four passes include two direct passes with different receptive fields and depths, one residual pass, and one Maxout pass. Actually, the residual pass is used to speed up convergence, while we take advantage of the Maxout pass to enhance approximate capabilities of SAM. The selector is further designed to help choose reasonable output. Basically, SAM is intended to simplify design of any new deep learning architecture, since it no longer requires consideration of how to select receptive fields and depths. Our SAM is tested on the visual object recognition datasets including CIFAR-10, CIFAR-100, MNIST, and SVHN. The experimental results demonstrate that the SAM-Net has superior recognition performances on the four benchmarks, which achieve test errors of 5.76%, 28.56%, 0.31%, and 1.98%, respectively.
引用
收藏
页码:1008 / 1014
页数:7
相关论文
共 50 条
  • [1] A convolutional neural network for visual object recognition in marine sector
    Kumar, Aiswarya S.
    Sherly, Elizabeth
    [J]. 2017 2ND INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2017, : 304 - 307
  • [2] Recurrent Convolutional Neural Network for Object Recognition
    Liang, Ming
    Hu, Xiaolin
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3367 - 3375
  • [3] Inception recurrent convolutional neural network for object recognition
    Alom, Md Zahangir
    Hasan, Mahmudul
    Yakopcic, Chris
    Taha, Tarek M.
    Asari, Vijayan K.
    [J]. MACHINE VISION AND APPLICATIONS, 2021, 32 (01)
  • [4] Radar Based Object Recognition with Convolutional Neural Network
    Loi, Kin Chong
    Cheong, Pedro
    Choi, Wai Wa
    [J]. PROCEEDINGS OF THE 2019 IEEE ASIA-PACIFIC MICROWAVE CONFERENCE (APMC), 2019, : 87 - 89
  • [5] Inception recurrent convolutional neural network for object recognition
    Md Zahangir Alom
    Mahmudul Hasan
    Chris Yakopcic
    Tarek M. Taha
    Vijayan K. Asari
    [J]. Machine Vision and Applications, 2021, 32
  • [6] A visual neural network for moving object recognition
    Sato, N
    Hagiwara, M
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (09): : 46 - 55
  • [7] Object Recognition in Images using Convolutional Neural Network
    Duth, Sudharshan P.
    Raj, Swathi
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 718 - 722
  • [8] On the Use of Convolutional Neural Network Architectures for Facial Emotion Recognition
    Espinel, Andres
    Perez, Noel
    Riofrio, Daniel
    Benitez, Diego S.
    Flores Moyano, Ricardo
    [J]. APPLICATIONS OF COMPUTATIONAL INTELLIGENCE, COLCACI 2021, 2022, 1471 : 18 - 30
  • [9] Convolutional neural network-based encoding and decoding of visual object recognition in space and time
    Seeliger, K.
    Fritsche, M.
    Guclu, U.
    Schoenmakers, S.
    Schoffelen, J. -M.
    Bosch, S. E.
    van Gerven, M. A. J.
    [J]. NEUROIMAGE, 2018, 180 : 253 - 266
  • [10] Convolutional Neural Network with Structural Input for Visual Object Tracking
    Fiaz, Mustansar
    Mahmood, Arif
    Jung, Soon Ki
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 1345 - 1352