Improving bag-of-visual-words image retrieval with predictive clustering trees

被引:36
|
作者
Dimitrovski, Ivica [1 ]
Kocev, Dragi [2 ]
Loskovska, Suzana [1 ]
Dzeroski, Saso [2 ]
机构
[1] Univ Ss Cyril & Methodius, Fac Comp Sci & Engn, Skopje, North Macedonia
[2] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana, Slovenia
关键词
Image retrieval; Feature extraction; Visual codebook; Predictive clustering; SCALE; GEOMETRY;
D O I
10.1016/j.ins.2015.05.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent overwhelming increase in the amount of available visual information, especially digital images, has brought up a pressing need to develop efficient and accurate systems for image retrieval. State-of-the-art systems for image retrieval use the bag-of-visual-words representation of images. However, the computational bottleneck in all such systems is the construction of the visual codebook, i.e., obtaining the visual words. This is typically performed by clustering hundreds of thousands or millions of local descriptors, where the resulting clusters correspond to visual words. Each image is then represented by a histogram of the distribution of its local descriptors across the codebook. The major issue in retrieval systems is that by increasing the sizes of the image databases, the number of local descriptors to be clustered increases rapidly: Thus, using conventional clustering techniques is infeasible. Considering this, we propose to construct the visual codebook by using predictive clustering trees (PCTs), which can be constructed and executed efficiently and have good predictive performance. Moreover, to increase the stability of the model, we propose to use random forests of predictive clustering trees. We create a random forest of PCTs that represents both the codebook and the indexing structure. We evaluate the proposed improvement of the bag-of-visual-words approach on three reference datasets and two additional datasets of 100 K images and 1 M images, compare it to two state-of-the-art methods based on approximate k-means and extremely randomized tree ensembles. The results reveal that the proposed method produces a visual codebook with superior discriminative power and thus better retrieval performance while maintaining excellent computational efficiency. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:851 / 865
页数:15
相关论文
共 50 条
  • [41] A study of Bag-of-Visual-Words representations for handwritten keyword spotting
    David Aldavert
    Marçal Rusiñol
    Ricardo Toledo
    Josep Lladós
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 223 - 234
  • [42] Audio Sentiment Analysis using Spectrogram and Bag-of-Visual-Words
    Luitel, Sophina
    Anwar, Mohd
    [J]. 2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 200 - 205
  • [43] A study of Bag-of-Visual-Words representations for handwritten keyword spotting
    Aldavert, David
    Rusinol, Marcal
    Toledo, Ricardo
    Llados, Josep
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (03) : 223 - 234
  • [44] A Novel Bag-of-Visual-Words Approach for Geospatial Object Detection
    Aytekin, Caglar
    Alatan, A. Aydin
    [J]. OPTICAL PATTERN RECOGNITION XXII, 2011, 8055
  • [45] Bag-of-visual-words model for artificial pornographic images recognition
    Fang-fang Li
    Si-wei Luo
    Xi-yao Liu
    Bei-ji Zou
    [J]. Journal of Central South University, 2016, 23 : 1383 - 1389
  • [46] Pedestrian Detection Based on Bag-of-Visual-Words and SVM method
    Li, Jun
    Liao, Yuanjiang
    Zhang, Hongmei
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 189 - 192
  • [47] Bag-of-visual-words model for artificial pornographic images recognition
    Li Fang-fang
    Luo Si-wei
    Liu Xi-yao
    Zou Bei-ji
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2016, 23 (06) : 1383 - 1389
  • [48] Bag-of-Visual-Words Model for Classification of Interferometric SAR Images
    Cagatay, Nazli Deniz
    Datcu, Mihai
    [J]. 11TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR (EUSAR 2016), 2016, : 243 - 246
  • [49] Robust Acoustic Event Classification using Bag-of-Visual-Words
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3319 - 3322
  • [50] Collaborative Clustering Approach Based on Dempster-Shafer Theory for Bag-of-Visual-Words Codebook Generation
    Hafdhellaoui, Sabrine
    Boualleg, Yaakoub
    Farah, Mohamed
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 263 - 273