BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION

被引:0
|
作者
Singh, Jaspreet [1 ]
Singh, Chandan [1 ]
机构
[1] Punjabi Univ, Dept Comp Sci, Patiala 147002, Punjab, India
关键词
Rotation; equivariance; invariance; classification; MOMENTS; SCALE;
D O I
10.1109/MLSP52302.2021.9596432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) are only equivariant to translation. Recently, equivariant CNNs are proposed for the task of image classification which are not only equivariant to translation but also to other affine geometric transformations. Moreover, CNNs and equivariant CNNs require a large amount of labeled training data to generalize its parameters which also limit their application areas. We propose a bag of groups of convolutional features (BoGCFs) model for the CNNs and group-equivariant CNNs (G-CNNs)[1], which preserves the fundamental property of equivariance of G-CNNs and generate the global invariant features by dividing the convolutional feature maps of the deeper layers of the network into groups. The proposed model for CNNs and G-CNNs, referred as CNN-BoGCFs and G-CNN-BoGCFs, performs significantly high when trained on a small amount of labeled data for image classification. The proposed method is evaluated using rotated MNIST, SIMPLIcity and Oxford flower 17 datasets.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Object recognition based on the Region of Interest and optimal Bag of Words model
    Li, Weisheng
    Dong, Peng
    Xiao, Bin
    Zhou, Lifang
    NEUROCOMPUTING, 2016, 172 : 271 - 280
  • [32] Object recognition and segmentation in videos by connecting heterogeneous visual features
    Gouet-Brunet, Valerie
    Larneyre, Bruno
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 111 (01) : 86 - 109
  • [33] Object Recognition in Inferotemporal Cortex: From Visual Features to Semantics
    Tanaka, Keiji
    I-PERCEPTION, 2017, 8 : 3 - 3
  • [34] Object Classification and Recognition using Bag-of-Words (BoW) Model
    Ali, Nursabillilah Mohd
    Jun, Soon Wei
    Karis, Mohd Safirin
    Ghazaly, Mariam Md
    Arai, Mohd Shahrieel Mohd
    2016 IEEE 12TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA), 2016, : 216 - 220
  • [35] A Fast Object Recognition and Categorization Technique for Robot Grasping Using the Visual Bag of Words
    Hannat, Mohamed
    Zrira, Nabila
    Raoui, Younes
    Bouyakhf, El Houssine
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 173 - 178
  • [36] Convolutional MLP orthogonal fusion of multiscale features for visual place recognition
    Gan, Wenjian
    Zhou, Yang
    Hu, Xiaofei
    Zhao, Luying
    Huang, Gaoshuang
    Zhang, Chenglong
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [37] Predicting Object Features Across Saccades: Evidence From Object Recognition and Visual Search
    Herwig, Arvid
    Schneider, Werner X.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2014, 143 (05) : 1903 - 1922
  • [38] SAM: A Rethinking of Prominent Convolutional Neural Network Architectures for Visual Object Recognition
    Wang, Zhenyang
    Deng, Zhidong
    Wang, Shiyao
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1008 - 1014
  • [39] Combining Deep Convolutional Feature Extraction with Hyperdimensional Computing for Visual Object Recognition
    Luczak, Piotr
    Slot, Krzysztof
    Kucharski, Jacek
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [40] Evaluating a bag-of-visual features approach using spatio-temporal features for action recognition
    Nazir, Saima
    Yousaf, Muhammad Haroon
    Velastin, Sergio A.
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 72 : 660 - 669