Compact and Discriminative Approach for Encoding Spatial-Relationship of Visual Words

被引:3
|
作者
Pedrosa, Glauco V. [1 ]
Traina, Agma J. M. [1 ]
机构
[1] Univ Sao Paulo, ICMC, Sao Carlos, SP, Brazil
关键词
image representation; local features; bag-of-features; spatial-relationship; visual words; visual dictionaries;
D O I
10.1145/2695664.2695951
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Bag-of-Visual-Words approach has been successfully used for video and image analysis by encoding local features as visual words, and the final representation is a histogram of the visual words detected in the image. One limitation of this approach relies on its inability of encoding spatial distribution of the visual words within an image, which is important for similarity measurement between images. In this paper, we present a novel technique to incorporate spatial information, called Global Spatial Arrangement (GSA). The idea is to split the image space into quadrants using each detected point as origin. To ensure rotation invariance, we use the information of the gradient of each detected point to define each quarter of the quadrant. The final representation uses only two extra information into the final feature vector to encode the spatial arrangement of visual words, with the advantage of being invariant to rotation. We performed representative experimental evaluations using several public datasets. Compared to other techniques, such as the Spatial Pyramid (SP), the proposed method needs 90% less information to encode spatial information of visual words. The results in image retrieval and classification demonstrated that our proposed approach improved the retrieval accuracy compared to other traditional techniques, while being the most compact descriptor.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [41] High discriminative SIFT feature and feature pair selection to improve the bag of visual words model
    Liu, Lifeng
    Ma, Yan
    Zhang, Xiangfen
    Zhang, Yuping
    Li, Shunbao
    IET IMAGE PROCESSING, 2017, 11 (11) : 994 - 1001
  • [42] Spatial orientations of visual word pairs to improve Bag-of-Visual-Words model
    Khan, Rahat
    Barat, Cecile
    Muselet, Damien
    Ducottet, Christophe
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [43] Coherent encoding of subjective spatial position in visual cortex and hippocampus
    Aman B. Saleem
    E. Mika Diamanti
    Julien Fournier
    Kenneth D. Harris
    Matteo Carandini
    Nature, 2018, 562 : 124 - 127
  • [44] Designing information fusion for the encoding of visual-spatial information
    Waldron, Samuel M.
    Patrick, John
    Duggan, Geoffrey B.
    Banbury, Simon
    Howes, Andrew
    ERGONOMICS, 2008, 51 (06) : 775 - 797
  • [45] Temporal Encoding of Spatial Information during Active Visual Fixation
    Kuang, Xutao
    Poletti, Martina
    Victor, Jonathan D.
    Rucci, Michele
    CURRENT BIOLOGY, 2012, 22 (06) : 510 - 514
  • [46] TEMPORAL AND SPATIAL CHARACTERISTICS OF SELECTIVE ENCODING FROM VISUAL DISPLAYS
    ERIKSEN, CW
    HOFFMAN, JE
    PERCEPTION & PSYCHOPHYSICS, 1972, 12 (2B): : 201 - &
  • [47] Melanopsin Contributions to Encoding Spatial Patterns in the Mouse Visual System
    Allen, Annette
    Martial, Franck
    Lucas, Robert
    PERCEPTION, 2016, 45 (06) : 692 - 692
  • [48] Distinct Encoding of Spatial and Nonspatial Visual Information in Parietal Cortex
    Freedman, David J.
    Assad, John A.
    JOURNAL OF NEUROSCIENCE, 2009, 29 (17): : 5671 - 5680
  • [49] Coherent encoding of subjective spatial position in visual cortex and hippocampus
    Saleem, Aman B.
    Diamanti, E. Mika
    Fournier, Julien
    Harris, Kenneth D.
    Carandini, Matteo
    NATURE, 2018, 562 (7725) : 124 - +
  • [50] DSP: Discriminative Spatial Part modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Dongming
    Li, Jintao
    Zhou, Jianshe
    Zhang, Shiliang
    Zhang, Yongdong
    IMAGE AND VISION COMPUTING, 2017, 63 : 24 - 37