Beyond bag of latent topics: spatial pyramid matching for scene category recognition

被引:0
|
作者
Fu-xiang Lu
Jun Huang
机构
[1] Lanzhou University,School of Information Science & Engineering
[2] Chinese Academy of Sciences,Shanghai Advanced Research Institute
关键词
Scene category recognition; Probabilistic latent semantic analysis; Bag-of-words; Adaptive boosting; A; TP391.4;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a heterogeneous, mid-level feature based method for recognizing natural scene categories. The proposed feature introduces spatial information among the latent topics by means of spatial pyramid, while the latent topics are obtained by using probabilistic latent semantic analysis (pLSA) based on the bag-of-words representation. The proposed feature always performs better than standard pLSA because the performance of pLSA is adversely affected in many cases due to the loss of spatial information. By combining various interest point detectors and local region descriptors used in the bag-of-words model, the proposed feature can make further improvement for diverse scene category recognition tasks. We also propose a two-stage framework for multi-class classification. In the first stage, for each of possible detector/descriptor pairs, adaptive boosting classifiers are employed to select the most discriminative topics and further compute posterior probabilities of an unknown image from those selected topics. The second stage uses the prod-max rule to combine information coming from multiple sources and assigns the unknown image to the scene category with the highest ‘final’ posterior probability. Experimental results on three benchmark scene datasets show that the proposed method exceeds most state-of-the-art methods.
引用
收藏
页码:817 / 828
页数:11
相关论文
共 31 条
  • [21] Spatial pyramid face feature representation and weighted dissimilarity matching for improved face recognition
    Choi, Jae Young
    VISUAL COMPUTER, 2018, 34 (11): : 1535 - 1549
  • [22] Recognizing in the depth: Selective 3D Spatial Pyramid Matching Kernel for object and scene categorization
    Redondo-Cabrera, Carolina
    Lopez-Sastre, Roberto J.
    Acevedo-Rodriguez, Javier
    Maldonado-Bascon, Saturnino
    IMAGE AND VISION COMPUTING, 2014, 32 (12) : 965 - 978
  • [23] Spatial multi-scale gradient orientation consistency for place instance and Scene category recognition
    Gao, Changxin
    Sang, Nong
    Huang, Rui
    INFORMATION SCIENCES, 2016, 372 : 84 - 97
  • [24] Multitask joint spatial pyramid matching using sparse representation with dynamic coefficients for object recognition
    Hajigholam, Mohammad-Hossein
    Raie, Abolghasem-Asadollah
    Faez, Karim
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (02)
  • [25] Sparse coded spatial pyramid matching and multi-kernel integrated SVM for non-linear scene classification
    Gajjar, Bhavinkumar
    Mewada, Hiren
    Patani, Ashwin
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2021, 72 (06): : 374 - 380
  • [26] Fusing integrated visual vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classification
    Yousef Alqasrawi
    Daniel Neagu
    Peter I. Cowling
    Signal, Image and Video Processing, 2013, 7 : 759 - 775
  • [27] Fusing integrated visual vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classification
    Alqasrawi, Yousef
    Neagu, Daniel
    Cowling, Peter I.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2013, 7 (04) : 759 - 775
  • [28] Intestinal Polyp Recognition Based on Salient Codebook Locality-Constrained Linear Coding with Annular Spatial Pyramid Matching
    Dongwei He
    Sheng Li
    Xiongxiong He
    Liping Chang
    Ni Zhang
    Qianru Jiang
    Journal of Medical and Biological Engineering, 2020, 40 : 473 - 483
  • [29] Intestinal Polyp Recognition Based on Salient Codebook Locality-Constrained Linear Coding with Annular Spatial Pyramid Matching
    He, Dongwei
    Li, Sheng
    He, Xiongxiong
    Chang, Liping
    Zhang, Ni
    Jiang, Qianru
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2020, 40 (04) : 473 - 483
  • [30] Improving the Bag-of-Words model with Spatial Pyramid matching using data augmentation for fine-grained arbitrary-oriented ship classification
    Viet Hung Luu
    Van Kiet Dinh
    Nguyen Hoang Hoa Luong
    Quang Hung Bui
    Thi Nhat Thanh Nguyen
    REMOTE SENSING LETTERS, 2019, 10 (09) : 826 - 834