Multi-scale Orderless Pooling of Deep Convolutional Activation Features

被引:0
|
作者
Gong, Yunchao [1 ]
Wang, Liwei [2 ]
Guo, Ruiqi [2 ]
Lazebnik, Svetlana [2 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27515 USA
[2] Univ Illinois, Urbana, IL 61801 USA
来源
基金
美国国家科学基金会;
关键词
CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNN) have shown their promise as a universal representation for recognition. However, global CNN activations lack geometric invariance, which limits their robustness for classification and matching of highly variable scenes. To improve the invariance of CNN activations without degrading their discriminative power, this paper presents a simple but effective scheme called multiscale orderless pooling (MOP-CNN). This scheme extracts CNN activations for local patches at multiple scale levels, performs orderless VLAD pooling of these activations at each level separately, and concatenates the result. The resulting MOP-CNN representation can be used as a generic feature for either supervised or unsupervised recognition tasks, from image classification to instance-level retrieval; it consistently outperforms global CNN activations without requiring any joint training of prediction layers for a particular target dataset. In absolute terms, it achieves state-of-the-art results on the challenging SUN397 and MIT Indoor Scenes classification datasets, and competitive results on ILSVRC2012/2013 classification and INRIA Holidays retrieval datasets.
引用
收藏
页码:392 / 407
页数:16
相关论文
共 50 条
  • [1] No-Reference Image Quality Assessment with Multi-Scale Orderless Pooling of Deep Features
    Varga, Domonkos
    [J]. JOURNAL OF IMAGING, 2021, 7 (07)
  • [2] Multi-scale Pyramid Pooling for Deep Convolutional Representation
    Yoo, Donggeun
    Park, Sunggyun
    Lee, Joon-Young
    Kweon, In So
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [3] Multi-scale orderless cross-regions-pooling of deep attributes for image retrieval
    Luo, Jianwei
    Jiang, Zhiguo
    Li, Jianguo
    [J]. ELECTRONICS LETTERS, 2016, 52 (04) : 276 - +
  • [4] Scene understanding based on Multi-Scale Pooling of deep learning features
    Li, DongYang
    Zhou, Yue
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1732 - 1737
  • [5] Multi-scale Hybrid Pooling Convolutional Neural Network Algorithm
    Zhao, Nan
    Wang, Xin
    Li, Ying-na
    Wu, Sheng
    [J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL, AUTOMATION AND ROBOTICS (ECAR 2018), 2018, 307 : 339 - 342
  • [6] Part-based Player Identification using Deep Convolutional Representation and Multi-scale Pooling
    Senocak, Arda
    Oh, Tae-Hyun
    Kim, Junsik
    Kweon, In So
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1813 - 1820
  • [7] Multi-Scale Correlation Tracking with Convolutional Features
    Xu, Yulong
    Li, Yang
    Wang, Jiabao
    Zou, Shan
    Miao, Zhuang
    Zhang, Yafei
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 310 - 314
  • [8] Applying Multi-scale Features in Deep Convolutional Neural Networks for Myanmar Speech Recognition
    Soe, Thandar
    Maung, Su Su
    Oo, Nyein Nyein
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2018, : 98 - 101
  • [9] Transferring scale-independent features to support multi-scale object recognition with deep convolutional neural network
    Zhou, Xiran
    [J]. 26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 614 - 615
  • [10] Multi-scale texture classification from generalized locally orderless images
    van Ginneken, B
    Romeny, BMT
    [J]. PATTERN RECOGNITION, 2003, 36 (04) : 899 - 911