Improvement the Bag of Words Image Representation Using Spatial Information

被引:0
|
作者
Farhangi, Mohammad Mehdi [1 ]
Soryani, Mohsen [1 ]
Fathy, Mahmood [1 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran
关键词
BOW Representation; Spatial Information; N-gram Model; Spatial Pyramid Matching;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bag of visual words (BOW) model is an effective way to represent images in order to classify and detect their contents. However, this type of representation suffers from the fact that, it does not contain any spatial information. In this paper we propose a novel image representation which adds two types of spatial information. The first type which is the spatial locations of the words in the image is added using the spatial pyramid matching approach. The second type is the spatial relation between words. To explore this information a binary tree structure which models the is-a relationships in the vocabulary is constructed from the visual words. This approach is a simple and computationally effective way for modeling the spatial relations of the visual words which shows improvement on the visual classification performance. We evaluated our method on visual classification of two known data sets, namely 15 natural scenes and Caltech-101.
引用
收藏
页码:681 / 690
页数:10
相关论文
共 50 条
  • [41] The locally weighted bag of words framework for document representation
    Lebanon, Guy
    Mao, Yi
    Dillon, Joshua
    Journal of Machine Learning Research, 2007, 8 : 2405 - 2441
  • [42] A Bag of Strings Representation for Image Categorization
    Ros, Julien
    Laurent, Christophe
    Jolion, Jean-Michel
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2009, 35 (01) : 51 - 67
  • [43] A Bag of Strings Representation for Image Categorization
    Julien Ros
    Christophe Laurent
    Jean-Michel Jolion
    Journal of Mathematical Imaging and Vision, 2009, 35 : 51 - 67
  • [44] The locally weighted bag of words framework for document representation
    Lebanon, Guy
    Mao, Yi
    Dillon, Joshua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 2405 - 2441
  • [45] Fuzzy bag of words for social image description
    Li, Yanshan
    Liu, Weiming
    Huang, Qinghua
    Li, Xuelong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (03) : 1371 - 1390
  • [46] Fuzzy bag of words for social image description
    Yanshan Li
    Weiming Liu
    Qinghua Huang
    Xuelong Li
    Multimedia Tools and Applications, 2016, 75 : 1371 - 1390
  • [47] Bag-of-Multimedia-Words for Image Classification
    Znaidia, Amel
    Shabou, Aymen
    Le Borgne, Herye
    Hudelot, Celine
    Paragios, Nikos
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1509 - 1512
  • [48] Content-Based Image (Object) Retrieval with Rotational Invariant Bag-of-Visual Words Representation
    Chathurani, N. W. U. D.
    Geva, S.
    Chandran, V
    Cynthujah, V
    2015 IEEE 10TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2015, : 152 - 157
  • [49] Blood Cell Image Retrieval System Using Color, Shape and Bag of Words
    Zare, Mohammad Reza
    Seng, Woo Chaw
    NEURAL INFORMATION PROCESSING, ICONIP 2014, PT III, 2014, 8836 : 218 - 225
  • [50] Evaluating Weighting Schemes for Adult Image Detection using Bag of Visual Words
    Choi, SuGil
    Han, SeungWan
    2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 817 - 818