Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification

被引:82
|
作者
Wu, Ruobing [1 ]
Wang, Baoyuan [2 ]
Wang, Wenping [1 ]
Yu, Yizhou [1 ]
机构
[1] Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[2] Microsoft Technol & Res, Redmond, WA USA
关键词
D O I
10.1109/ICCV.2015.152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work on scene classification still makes use of generic CNN features in a rudimentary manner. In this paper, we present a novel pipeline built upon deep CNN features to harvest discriminative visual objects and parts for scene classification. We first use a region proposal technique to generate a set of high-quality patches potentially containing objects, and apply a pre-trained CNN to extract generic deep features from these patches. Then we perform both unsupervised and weakly supervised learning to screen these patches and discover discriminative ones representing category-specific objects and parts. We further apply discriminative clustering enhanced with local CNN fine-tuning to aggregate similar objects and parts into groups, called meta objects. A scene image representation is constructed by pooling the feature response maps of all the learned meta objects at multiple spatial scales. We have confirmed that the scene image representation obtained using this new pipeline is capable of delivering state-of-the-art performance on two popular scene benchmark datasets, MIT Indoor 67 [22] and Sun397 [31].
引用
收藏
页码:1287 / 1295
页数:9
相关论文
共 50 条
  • [1] Learning Discriminative and Shareable Features for Scene Classification
    Zuo, Zhen
    Wang, Gang
    Shuai, Bing
    Zhao, Lifan
    Yang, Qingxiong
    Jiang, Xudong
    COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 : 552 - 568
  • [2] Integrating Local Features into Discriminative Graphlets for Scene Classification
    Zhang, Luming
    Bian, Wei
    Song, Mingli
    Tao, Dacheng
    Liu, Xiao
    NEURAL INFORMATION PROCESSING, PT III, 2011, 7064 : 657 - +
  • [3] Deep Discriminative Representation Learning with Attention Map for Scene Classification
    Li, Jun
    Lin, Daoyu
    Wang, Yang
    Xu, Guangluan
    Zhang, Yunyan
    Ding, Chibiao
    Zhou, Yanhai
    REMOTE SENSING, 2020, 12 (09)
  • [4] Efficient Learning of Sample-Specific Discriminative Features for Scene Classification
    Han, Yina
    Liu, Guizhong
    IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (11) : 683 - 686
  • [5] Scene Classification with the Discriminative Representation
    Sun, Hao
    Chen, Yaxiong
    Chen, Wenjing
    Huang, Zhangcan
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 267 - 271
  • [6] Capturing Discriminative Information Using a Deep Architecture in Acoustic Scene Classification
    Shim, Hye-jin
    Jung, Jee-weon
    Kim, Ju-ho
    Yu, Ha-jin
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [7] Deriving High-Level Scene Descriptions from Deep Scene CNN Features
    Bayat, Akram
    Pomptun, Marc
    PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [8] Scene Categorization Through Using Objects Represented by Deep Features
    Bai, Shuang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (09)
  • [9] Sphere Loss: Learning Discriminative Features for Scene Classification in a Hyperspherical Feature Space
    Wang, Jue
    Chen, He
    Ma, Long
    Chen, Liang
    Gong, Xiaodong
    Liu, Wenchao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] Deep Discriminative CNN with Temporal Ensembling for Ambiguously-Labeled Image Classification
    Yao, Yao
    Deng, Jiehui
    Chen, Xiuhua
    Gong, Chen
    Wu, Jianxin
    Yang, Jian
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12669 - 12676