GENERALIZED POOLING PYRAMID WITH HIERARCHICAL DICTIONARY SPARSE CODING FOR EVENT AND OBJECT RECOGNITION

被引:0
|
作者
Chen, Shuai [1 ]
Ma, Bo [1 ]
Luo, Pei [1 ]
机构
[1] Beijing Inst Technol, Beijing Lab Intelligent Informat Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
bag-of-visual-words; image recognition; sparse codes; hierarchical dictionary; generalized pooling; IMAGE CLASSIFICATION;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Feature coding and vector pooling are essential for image recognition in bag-of-visual-words (BoW) method. Encoding the low-level feature to rich one and pooling it without any information loss are very challenging works. In this paper, generalized pooling pyramid with hierarchical dictionary sparse coding is introduced to get rich sparse codes and alleviate the information loss in the phase of pooling. It includes two modules: First, with the low-level feature, hierarchical dictionary is learned for sparse coding to generate the hierarchical sparse representation. Second, in the phase of vector pooling, we present generalized pooling pyramid by utilizing the probabilistic function to model the statistical distribution of sparse codes. In the generalized pooling pyramid, the Fisher vectors which are computed with Gaussian Mixture (GMM) in different levels, are fused to represent the images. The performance of our method outperforms state-of-the-art performance in a large number of image categorization experiments on the event dataset (URIC-Sport dataset) and the object recognition dataset (Caltech101 dataset).
引用
收藏
页码:2349 / 2353
页数:5
相关论文
共 50 条
  • [1] Hierarchical spatial pyramid max pooling based on SIFT features and sparse coding for image classification
    Han, Hong
    Han, Qiqiang
    Li, Xiaojun
    Gu, Jianyin
    [J]. IET COMPUTER VISION, 2013, 7 (02) : 144 - 150
  • [2] Group-based Sparse Coding Dictionary Learning for Object Recognition
    Zhao, Yanqin
    Li, Jinhua
    Zhong, Zhun
    [J]. PROCEEDINGS OF 2014 IEEE WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS (WARTIA), 2014, : 331 - 334
  • [3] ORDINAL PYRAMID POOLING FOR ROTATION INVARIANT OBJECT RECOGNITION
    Wang, Guoli
    Fan, Bin
    Pan, Chunhong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1349 - 1353
  • [4] INVARIANT HIERARCHICAL SPARSE CODING FOR OBJECT RECOGNITION VIA BAGS OF ATOMS
    Sun, Xiaoxia
    Nasrabadi, Nasser M.
    Tran, Trac D.
    [J]. 2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 212 - 216
  • [5] Sparse cortical coding and object recognition
    Bell, C.
    Moorhead, I.
    Haig, N.
    Ayre, C.
    [J]. PERCEPTION, 1999, 28 : 128 - 128
  • [6] SPATIAL PYRAMID ALIGNMENT FOR SPARSE CODING BASED OBJECT CLASSIFICATION
    Kim, Joonsoo
    Tahboub, Khalid
    Delp, Edward J.
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 1950 - 1954
  • [7] Sparse Weighting for Pyramid Pooling-Based SAR Image Target Recognition
    Wang, Shaona
    Liu, Yang
    Li, Linlin
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (07):
  • [8] Object Recognition Using Sparse Representation of Overcomplete Dictionary
    Loo, Chu-Kiong
    Memariani, Ali
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT IV, 2012, 7666 : 75 - 82
  • [9] Hierarchical Dictionary Learning and Sparse Coding for Static Signature Verification
    Zois, Elias N.
    Papagiannopoulou, Marianna
    Tsourounis, Dimitrios
    Economou, George
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 545 - 555
  • [10] Hierarchical sparse coding framework for speech emotion recognition
    Torres-Boza, Diana
    Oveneke, Meshia Cedric
    Wang, Fengna
    Jiang, Dongmei
    Verhelst, Werner
    Sahli, Hichem
    [J]. SPEECH COMMUNICATION, 2018, 99 : 80 - 89