Compact and Discriminative Approach for Encoding Spatial-Relationship of Visual Words

被引:3
|
作者
Pedrosa, Glauco V. [1 ]
Traina, Agma J. M. [1 ]
机构
[1] Univ Sao Paulo, ICMC, Sao Carlos, SP, Brazil
关键词
image representation; local features; bag-of-features; spatial-relationship; visual words; visual dictionaries;
D O I
10.1145/2695664.2695951
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Bag-of-Visual-Words approach has been successfully used for video and image analysis by encoding local features as visual words, and the final representation is a histogram of the visual words detected in the image. One limitation of this approach relies on its inability of encoding spatial distribution of the visual words within an image, which is important for similarity measurement between images. In this paper, we present a novel technique to incorporate spatial information, called Global Spatial Arrangement (GSA). The idea is to split the image space into quadrants using each detected point as origin. To ensure rotation invariance, we use the information of the gradient of each detected point to define each quarter of the quadrant. The final representation uses only two extra information into the final feature vector to encode the spatial arrangement of visual words, with the advantage of being invariant to rotation. We performed representative experimental evaluations using several public datasets. Compared to other techniques, such as the Spatial Pyramid (SP), the proposed method needs 90% less information to encode spatial information of visual words. The results in image retrieval and classification demonstrated that our proposed approach improved the retrieval accuracy compared to other traditional techniques, while being the most compact descriptor.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [21] Visual Tracking via Subspace Learning: A Discriminative Approach
    Yao Sui
    Yafei Tang
    Li Zhang
    Guanghui Wang
    International Journal of Computer Vision, 2018, 126 : 515 - 536
  • [22] Bag of Visual Words Approach for Classification of Benign and Malignant Masses in Mammograms Using Voting Based Feature Encoding
    Suhail, Zobia
    Mahmood, Arif
    Denton, Erika R. E.
    Zwiggelaar, Reyer
    14TH INTERNATIONAL WORKSHOP ON BREAST IMAGING (IWBI 2018), 2018, 10718
  • [23] Spatial Verification via Compact Words for Mobile Instance Search
    Wang, Bo
    Shao, Jie
    He, Chengkun
    Hu, Gang
    Xu, Xing
    MULTIMEDIA MODELING, MMM 2017, PT II, 2017, 10133 : 356 - 367
  • [24] MINING THE SPATIAL DISTRIBUTION OF VISUAL WORDS FOR SCENE CLASSIFICATION
    Hu, Jingwen
    Lp, Menghan
    Xia, Gui-Song
    Zhang, Liangpei
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 2415 - 2418
  • [25] A new bag of visual words encoding method for human action recognition
    Cortes, Xavier
    Conte, Donatello
    Cardot, Hubert
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2480 - 2485
  • [26] Is there a cost at encoding words with joined letters during visual word recognition?
    Roldan, Manuel
    Marcet, Ana
    Perea, Manuel
    PSICOLOGICA, 2018, 39 (02): : 279 - 291
  • [27] Serial and semantic encoding of lists of words in schizophrenia patients with visual hallucinations
    Brebion, Gildas
    Ohlsen, Ruth I.
    Pilowsky, Lyn S.
    David, Anthony S.
    PSYCHIATRY RESEARCH, 2011, 186 (01) : 5 - 10
  • [28] A LEFT VISUAL-FIELD BIAS FOR SEMANTIC ENCODING OF UNATTENDED WORDS
    LAMBERT, A
    VOOT, N
    NEUROPSYCHOLOGIA, 1993, 31 (01) : 67 - 73
  • [29] Discriminative Bag-of-Words-Based Adaptive Appearance Model for Robust Visual Tracking
    Zeng, Fanxiang
    Huang, Zhitong
    Ji, Yuefeng
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (06) : 907 - 911
  • [30] HEMISPHERIC ASYMMETRY - VERBAL AND SPATIAL ENCODING OF VISUAL STIMULI
    GEFFEN, G
    BRADSHAW, JL
    NETTLETON, NC
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1972, 95 (01): : 25 - +