Improvement the Bag of Words Image Representation Using Spatial Information

被引:0
|
作者
Farhangi, Mohammad Mehdi [1 ]
Soryani, Mohsen [1 ]
Fathy, Mahmood [1 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran
关键词
BOW Representation; Spatial Information; N-gram Model; Spatial Pyramid Matching;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bag of visual words (BOW) model is an effective way to represent images in order to classify and detect their contents. However, this type of representation suffers from the fact that, it does not contain any spatial information. In this paper we propose a novel image representation which adds two types of spatial information. The first type which is the spatial locations of the words in the image is added using the spatial pyramid matching approach. The second type is the spatial relation between words. To explore this information a binary tree structure which models the is-a relationships in the vocabulary is constructed from the visual words. This approach is a simple and computationally effective way for modeling the spatial relations of the visual words which shows improvement on the visual classification performance. We evaluated our method on visual classification of two known data sets, namely 15 natural scenes and Caltech-101.
引用
收藏
页码:681 / 690
页数:10
相关论文
共 50 条
  • [21] The influence of preprocessing on text classification using a bag-of-words representation
    HaCohen-Kerner, Yaakov
    Miller, Daniel
    Yigal, Yair
    PLOS ONE, 2020, 15 (05):
  • [22] Image Retrieval using Extended Bag-of-Visual-Words
    Bhattacharya, Nandita
    Sil, Jaya
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1969 - 1975
  • [23] Image Classification Model Using Visual Bag of Semantic Words
    Yali Qi
    Guoshan Zhang
    Yeli Li
    Pattern Recognition and Image Analysis, 2019, 29 : 404 - 414
  • [24] Visual Cognitive Attention based Bag-of-words Image Representation for Object Discovery
    Ma, Zhong
    Wang, Zhuping
    PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), 2018, : 234 - 239
  • [25] Entropy Optimized Feature-Based Bag-of-Words Representation for Information Retrieval
    Passalis, Nikolaos
    Tefas, Anastasios
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (07) : 1664 - 1677
  • [26] EXPANDED BAG OF WORDS REPRESENTATION FOR OBJECT CLASSIFICATION
    Liu, Tinglin
    Liu, Jing
    Liu, Qinshan
    Lu, Hanqing
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 297 - 300
  • [27] Towards Visual Words to Words Text Detection with a General Bag of Words Representation
    Mehta, Rakesh
    Chum, Ondrej
    Matas, Jiri
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 641 - 645
  • [28] A "Bag" or a "Window" of Words for Information Filtering?
    Nanas, Nikolaos
    Vavalis, Manolis
    ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, SETN 2008, 2008, 5138 : 182 - 193
  • [29] Recurrent Neural Network Language Model With Incremental Updated Context Information Generated Using Bag-of-Words Representation
    Haidar, Md. Akmal
    Kurimo, Mikko
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3504 - 3508
  • [30] A Multi-Sample, Multi-Tree Approach to Bag-of-Words Image Representation for Image Retrieval
    Wu, Zhong
    Ke, Qifa
    Sun, Jian
    Shum, Heung-Yeung
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 1992 - 1999