Improving visual vocabularies: A more discriminative, representative and compact bag of visual words

被引:0
|
作者
Chang, Leonardo [1 ]
Pérez-Suárez, Airel [1 ]
Hernández-Palancar, José [1 ]
Arias-Estrada, Miguel [2 ]
Sucar, L. Enrique [2 ]
机构
[1] Advanced Technologies Application Center, 7A #21406 Siboney, Playa, Havana,C.P. 12220, Cuba
[2] Instituto Nacional de Astrofísica, Óptica y Electrónica, Luis Enrique Erro No. 1, Sta. María Tonantzintla, Puebla,C.P. 72840, Mexico
来源
Informatica (Slovenia) | 2017年 / 41卷 / 03期
关键词
Classification (of information) - Object recognition - Computer vision;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we introduce three properties and their corresponding quantitative evaluation measures to assess the ability of a visual word to represent and discriminate an object class, in the context of the BoW approach. Also, based on these properties, we propose a methodology for reducing the size of the visual vocabulary, retaining those visual words that best describe an object class. Reducing the vocabulary will provide a more reliable and compact image representation. Our proposal does not depend on the quantization method used for building the set of visual words, the feature descriptor or the weighting scheme used, which makes our approach suitable to any visual vocabulary. Throughout the experiments we show that using only the most discriminative and representative visual words obtained by our proposed methodology improves the classification performance; the best results obtained with our proposed method are statistically superior to those obtained with the entire vocabularies. In the Caltech-101 dataset, average best results outperformed the baseline by a 4.6% and 4.8% in mean classification accuracy using SVM and KNN, respectively. In the Pascal VOC 2006 dataset there was a 1.6% and 4.7% improvement for SVM and KNN, respectively. Furthermore, these accuracy improvements were always obtained with more compact representations. Vocabularies 10 times smaller always obtained better accuracy results than the baseline vocabularies in the Caltech-101 dataset, and in the 93.75% of the experiments on the Pascal VOC dataset.
引用
收藏
页码:333 / 347
相关论文
共 50 条
  • [1] Improving the Discriminative Power of Bag of Visual Words Model
    Ouni, Achref
    Urruty, Thierry
    Visani, Muriel
    MULTIMEDIA MODELING, MMM 2017, PT II, 2017, 10133 : 245 - 256
  • [2] Creating Compact and Discriminative Visual Vocabularies using Visual Bits
    Kirishanthy, T.
    Ramanan, A.
    2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 258 - 263
  • [3] Improving Bag of Visual Words Representations with Genetic Programming
    Jair Escalante, Hugo
    Martinez-Carraza, Jose
    Escalera, Sergio
    Ponce-Lopez, Victor
    Baro, Xavier
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [4] A quantitative evaluation of the conceptual consistency of visual words and visual vocabularies
    Stommel, M.
    Herzog, O.
    Xu, W. L.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 28 : 120 - 129
  • [5] Compact and Discriminative Approach for Encoding Spatial-Relationship of Visual Words
    Pedrosa, Glauco V.
    Traina, Agma J. M.
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 92 - 95
  • [6] A Quantitative Metric of Visual-words Separability for A More Discriminative Visual Vocabulary in An Unsupervised Manner
    Feng, Xin
    Li, Bo
    Ge, Yongxin
    Tan, Jiaxing
    2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,
  • [7] Discriminative Bag-of-Words-Based Adaptive Appearance Model for Robust Visual Tracking
    Zeng, Fanxiang
    Huang, Zhitong
    Ji, Yuefeng
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (06) : 907 - 911
  • [8] Image bag generator based on bag of visual words
    Zhao, Shu
    Xu, Chao
    Xu, Xiansheng
    Xu, Chenchu
    Zhang, Yanping
    Ye, Hong
    Journal of Information and Computational Science, 2013, 10 (05): : 1453 - 1462
  • [9] High discriminative SIFT feature and feature pair selection to improve the bag of visual words model
    Liu, Lifeng
    Ma, Yan
    Zhang, Xiangfen
    Zhang, Yuping
    Li, Shunbao
    IET IMAGE PROCESSING, 2017, 11 (11) : 994 - 1001
  • [10] Improving bag-of-visual-words image retrieval with predictive clustering trees
    Dimitrovski, Ivica
    Kocev, Dragi
    Loskovska, Suzana
    Dzeroski, Saso
    INFORMATION SCIENCES, 2016, 329 : 851 - 865