Improving bag-of-visual-words image retrieval with predictive clustering trees

被引:36
|
作者
Dimitrovski, Ivica [1 ]
Kocev, Dragi [2 ]
Loskovska, Suzana [1 ]
Dzeroski, Saso [2 ]
机构
[1] Univ Ss Cyril & Methodius, Fac Comp Sci & Engn, Skopje, North Macedonia
[2] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana, Slovenia
关键词
Image retrieval; Feature extraction; Visual codebook; Predictive clustering; SCALE; GEOMETRY;
D O I
10.1016/j.ins.2015.05.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent overwhelming increase in the amount of available visual information, especially digital images, has brought up a pressing need to develop efficient and accurate systems for image retrieval. State-of-the-art systems for image retrieval use the bag-of-visual-words representation of images. However, the computational bottleneck in all such systems is the construction of the visual codebook, i.e., obtaining the visual words. This is typically performed by clustering hundreds of thousands or millions of local descriptors, where the resulting clusters correspond to visual words. Each image is then represented by a histogram of the distribution of its local descriptors across the codebook. The major issue in retrieval systems is that by increasing the sizes of the image databases, the number of local descriptors to be clustered increases rapidly: Thus, using conventional clustering techniques is infeasible. Considering this, we propose to construct the visual codebook by using predictive clustering trees (PCTs), which can be constructed and executed efficiently and have good predictive performance. Moreover, to increase the stability of the model, we propose to use random forests of predictive clustering trees. We create a random forest of PCTs that represents both the codebook and the indexing structure. We evaluate the proposed improvement of the bag-of-visual-words approach on three reference datasets and two additional datasets of 100 K images and 1 M images, compare it to two state-of-the-art methods based on approximate k-means and extremely randomized tree ensembles. The results reveal that the proposed method produces a visual codebook with superior discriminative power and thus better retrieval performance while maintaining excellent computational efficiency. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:851 / 865
页数:15
相关论文
共 50 条
  • [1] Image Retrieval using Extended Bag-of-Visual-Words
    Bhattacharya, Nandita
    Sil, Jaya
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1969 - 1975
  • [2] Weighting scheme for image retrieval based on bag-of-visual-words
    Zhu, Lei
    Jin, Hai
    Zheng, Ran
    Feng, Xiaowen
    [J]. IET IMAGE PROCESSING, 2014, 8 (09) : 509 - 518
  • [3] Image Reconstruction from Bag-of-Visual-Words
    Kato, Hiroharu
    Harada, Tatsuya
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 955 - 962
  • [4] Improving Bag-of-Visual-Words Towards Effective Facial Expressive Image Classification
    Al Chanti, Dawood
    Caplier, Alice
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 145 - 152
  • [5] Image Retrieval via Generalized I-Divergence in the Bag-of-Visual-Words Framework
    Rocha, B. M.
    Nogueira, E. A.
    Guliato, D.
    Ferreira, D. L. P.
    Barcelos, C. A. Z.
    [J]. 2014 21ST IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2014, : 734 - 737
  • [6] Attacking image classification based on Bag-of-Visual-Words
    Melloni, A.
    Bestagini, P.
    Costanzo, A.
    Barni, M.
    Tagliasacchi, M.
    Tubaro, S.
    [J]. PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS'13), 2013, : 103 - 108
  • [7] Bag-of-Visual-Words vs Global Image Descriptors on Two-Stage Multimodal Retrieval
    Zagoris, Konstantinos
    Chatzichristofis, Savvas A.
    Arampatzis, Avi
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1251 - 1252
  • [8] Spatial Weighting for Bag-of-Visual-Words and Its Application in Content-Based Image Retrieval
    Chen, Xin
    Hu, Xiaohua
    Shen, Xiajiong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 867 - +
  • [9] Fisher Vectors: Beyond Bag-of-Visual-Words Image Representations
    Csurka, Gabriela
    Perronnin, Florent
    [J]. COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS: THEORY AND APPLICATIONS, 2011, 229 : 28 - 42
  • [10] An Effective Bag-of-Visual-Words Framework for SAR Image Classification
    Feng, Jie
    Jiao, L. C.
    Zhang, Xiangrong
    Niu, Ruican
    [J]. MIPPR 2011: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2011, 8006