Discriminative part model for visual recognition

被引:15
|
作者
Sicre, Ronan [1 ]
Jurie, Frederic [1 ]
机构
[1] Univ Caen Basse Normandie, CNRS, UMR 6072, ENSICAEN, Caen, France
关键词
Computer vision; Image classification; Visual recognition; Part-based models;
D O I
10.1016/j.cviu.2015.08.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent literature on visual recognition and image classification has been mainly focused on Deep Convolutional Neural Networks (Deep CNN) [A. Krizhevsky, I. Sutskever, G. E. Hinton, Imagenet classification with deep convolutional neural networks, in: Advances in neural information processing systems, 2012, pp. 1097-1105.] and their variants, which has resulted in a significant progression of the performance of these algorithms. Building on these recent advances, this paper proposes to explicitly add translation and scale invariance to Deep CNN-based local representations, by introducing a new algorithm for image recognition which is modeling image categories as a collection of automatically discovered distinctive parts. These parts are matched across images while learning their visual model and are finally pooled to provide images signatures. The appearance model of the parts is learnt from the training images to allow the distinction between the categories to be recognized. A key ingredient of the approach is a softassign-like matching algorithm that simultaneously learns the model of each part and automatically assigns image regions to the model's parts. Once the model of the category is trained, it can be used to classify new images by finding image's regions similar to the learned parts and encoding them in a single compact signature. The experimental validation shows that the performance of the proposed approach is better than those of the latest Deep Convolutional Neural Networks approaches, hence providing state-of-the art results on several publicly available datasets. (c) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:28 / 37
页数:10
相关论文
共 50 条
  • [21] Enhancing discriminative appearance model for visual tracking
    He, Xuedong
    Chen, Calvin Yu-Chian
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [22] A Discriminative Model for Age Invariant Face Recognition
    Li, Zhifeng
    Park, Unsang
    Jain, Anil K.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2011, 6 (03) : 1028 - 1037
  • [23] DISCRIMINATIVE MODEL SELECTION FOR OBJECT MOTION RECOGNITION
    Nascimento, Jacinto C.
    Marques, Jorge S.
    Figueiredo, Mario A. T.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3953 - 3956
  • [24] Unsupervised part learning for visual recognition
    Sicre, Ronan
    Avrithis, Yannis
    Kijak, Ewa
    Jurie, Frederic
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3116 - 3124
  • [25] Dynamic visual features based on discriminative speech class projection for visual speech recognition
    Lei, X
    Cai, XL
    Fu, ZH
    Zhao, RC
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 687 - 690
  • [26] Fast Learning Discriminative Dictionaries for Large-scale Visual Recognition
    Zhao, Tianyi
    Qu, Yanyun
    Fan, Jianping
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [27] SDE: A Novel Selective, Discriminative and Equalizing Feature Representation for Visual Recognition
    Xie, Guo-Sen
    Zhang, Xu-Yao
    Yan, Shuicheng
    Liu, Cheng-Lin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (02) : 145 - 168
  • [28] Learning discriminative visual semantic embedding for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Yuan, Jianying
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [29] Discriminative Learning of Relaxed Hierarchy for Large-scale Visual Recognition
    Gao, Tianshi
    Koller, Daphne
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2072 - 2079
  • [30] Learning Spatially Embedded Discriminative Part Detectors for Scene Character Recognition
    Wang, Yanna
    Shi, Cunzhao
    Xiao, Baihua
    Wang, Chunheng
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 363 - 368