Improvement the Bag of Words Image Representation Using Spatial Information

被引:0
|
作者
Farhangi, Mohammad Mehdi [1 ]
Soryani, Mohsen [1 ]
Fathy, Mahmood [1 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran
关键词
BOW Representation; Spatial Information; N-gram Model; Spatial Pyramid Matching;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bag of visual words (BOW) model is an effective way to represent images in order to classify and detect their contents. However, this type of representation suffers from the fact that, it does not contain any spatial information. In this paper we propose a novel image representation which adds two types of spatial information. The first type which is the spatial locations of the words in the image is added using the spatial pyramid matching approach. The second type is the spatial relation between words. To explore this information a binary tree structure which models the is-a relationships in the vocabulary is constructed from the visual words. This approach is a simple and computationally effective way for modeling the spatial relations of the visual words which shows improvement on the visual classification performance. We evaluated our method on visual classification of two known data sets, namely 15 natural scenes and Caltech-101.
引用
收藏
页码:681 / 690
页数:10
相关论文
共 50 条
  • [1] Creating the bag-of-words with spatial context information for image retrieval
    Li, Zhenwei
    Zhang, Jing
    Liu, Xin
    Zhuo, Li
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 4788 - 4791
  • [2] Elimination of Spatial Incoherency in Bag-of-Visual Words Image Representation Using Visual Sentence Modelling
    Olaode, Abass A.
    Naghdy, Golshah
    [J]. 2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
  • [3] Informative visual words construction to improve bag of words image representation
    Farhangi, Mohammad Mehdi
    Soryani, Mohsen
    Fathy, Mahmood
    [J]. IET IMAGE PROCESSING, 2014, 8 (05) : 310 - 318
  • [4] A Bag of Constrained Visual Words Model for Image Representation
    Mukherjee, Anindita
    Sil, Jaya
    Chowdhury, Ananda S.
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 403 - 415
  • [5] Bag of Visual Words Approach for Image Retrieval Using Color Information
    Mansoori, Naimeh Sadat
    Nejati, Mansour
    Razzaghi, Parvin
    Samavi, Shadrokh
    [J]. 2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [6] Approximate Image Matching using Strings of Bag-of-Visual Words Representation
    Hong Thinh Nguyen
    Barat, Cecile
    Ducottet, Christophe
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 345 - 353
  • [7] Enhanced Bags of Visual Words Representation Using Spatial Information
    Abdi, Lotfi
    Kalboussi, Rahma
    Meddeb, Aref
    [J]. IMAGE ANALYSIS AND PROCESSING (ICIAP 2017), PT II, 2017, 10485 : 171 - 179
  • [8] Motor Fault Diagnosis Using Image Visual Information and Bag of Words Model
    Long, Zhuo
    Zhang, Xiaofei
    Song, Dianyi
    Tang, Yao
    Huang, Shoudao
    Liang, Weizhi
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (19) : 21798 - 21807
  • [9] Using sub-dictionaries for image representation based on the bag-of-visual-words approach
    Pedrosa, Glauco Vitor
    Traina, Agma J. M.
    Traina, Caetano, Jr.
    [J]. 2014 IEEE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2014, : 165 - 168
  • [10] Compressed image classification using bag of visual words
    [J]. 2012, Cairo University (59):