BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION

被引:0
|
作者
Singh, Jaspreet [1 ]
Singh, Chandan [1 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Patiala 147002, Punjab, India
关键词
Rotation; equivariance; invariance; classification; MOMENTS; SCALE;
D O I
10.1109/MLSP52302.2021.9596432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) are only equivariant to translation. Recently, equivariant CNNs are proposed for the task of image classification which are not only equivariant to translation but also to other affine geometric transformations. Moreover, CNNs and equivariant CNNs require a large amount of labeled training data to generalize its parameters which also limit their application areas. We propose a bag of groups of convolutional features (BoGCFs) model for the CNNs and group-equivariant CNNs (G-CNNs)[1], which preserves the fundamental property of equivariance of G-CNNs and generate the global invariant features by dividing the convolutional feature maps of the deeper layers of the network into groups. The proposed model for CNNs and G-CNNs, referred as CNN-BoGCFs and G-CNN-BoGCFs, performs significantly high when trained on a small amount of labeled data for image classification. The proposed method is evaluated using rotated MNIST, SIMPLIcity and Oxford flower 17 datasets.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] What are the visual features underlying rapid object recognition?
    Crouzet, Sebastien M.
    Serre, Thomas
    FRONTIERS IN PSYCHOLOGY, 2011, 2
  • [22] Revisiting Sparse Convolutional Model for Visual Recognition
    Dai, Xili
    Li, Mingyang
    Zhai, Pengyuan
    Tong, Shengbang
    Gao, Xingjian
    Huang, Shao-Lun
    Zhu, Zhihui
    You, Chong
    Ma, Yi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] BAG-OF-FEATURES REPRESENTATIONS USING SPATIAL VISUAL VOCABULARIES FOR OBJECT CLASSIFICATION
    Grzeszick, Rene
    Rothacker, Leonard
    Fink, Gernot A.
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2857 - 2861
  • [24] Random interest regions for object recognition based on texture descriptors and bag of features
    Nanni, Loris
    Brahnam, Sheryl
    Lumini, Alessandra
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 973 - 977
  • [25] Performance evaluation of large-scale object recognition system using bag-of-visual words model
    Kim, Min-Uk
    Yoon, Kyoungro
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (07) : 2499 - 2517
  • [26] Performance evaluation of large-scale object recognition system using bag-of-visual words model
    Min-Uk Kim
    Kyoungro Yoon
    Multimedia Tools and Applications, 2015, 74 : 2499 - 2517
  • [27] Bag of Tricks for Long-Tailed Visual Recognition with Deep Convolutional Neural Networks
    Zhang, Yongshun
    Wei, Xiu-Shen
    Zhou, Boyan
    Wu, Jianxin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3447 - 3455
  • [28] Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP
    Hyun Chul Song
    Kwang Nam Choi
    Mobile Networks and Applications, 2018, 23 : 1103 - 1110
  • [29] Transportation Object Detection with Bag of Visual Words Model by PLSA and MLP
    Song, Hyun Chul
    Choi, Kwang Nam
    MOBILE NETWORKS & APPLICATIONS, 2018, 23 (04): : 1103 - 1110
  • [30] Object recognition by learning informative, biologically inspired visual features
    Wu, Yang
    Zheng, Nanning
    You, Qubo
    Du, Shaoyi
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 181 - 184