Improvement the Bag of Words Image Representation Using Spatial Information

被引：0

作者：

Farhangi, Mohammad Mehdi ^{[1
]}

Soryani, Mohsen ^{[1
]}

Fathy, Mahmood ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran

来源：

ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2 | 2013年 / 177卷

关键词：

BOW Representation; Spatial Information; N-gram Model; Spatial Pyramid Matching;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Bag of visual words (BOW) model is an effective way to represent images in order to classify and detect their contents. However, this type of representation suffers from the fact that, it does not contain any spatial information. In this paper we propose a novel image representation which adds two types of spatial information. The first type which is the spatial locations of the words in the image is added using the spatial pyramid matching approach. The second type is the spatial relation between words. To explore this information a binary tree structure which models the is-a relationships in the vocabulary is constructed from the visual words. This approach is a simple and computationally effective way for modeling the spatial relations of the visual words which shows improvement on the visual classification performance. We evaluated our method on visual classification of two known data sets, namely 15 natural scenes and Caltech-101.

引用

页码：681 / 690

页数：10

共 50 条

[41] The locally weighted bag of words framework for document representation
Lebanon, Guy
Mao, Yi
Dillon, Joshua
Journal of Machine Learning Research, 2007, 8 : 2405 - 2441
[42] A Bag of Strings Representation for Image Categorization
Ros, Julien
Laurent, Christophe
Jolion, Jean-Michel
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2009, 35 (01) : 51 - 67
[43] A Bag of Strings Representation for Image Categorization
Julien Ros
Christophe Laurent
Jean-Michel Jolion
Journal of Mathematical Imaging and Vision, 2009, 35 : 51 - 67
[44] The locally weighted bag of words framework for document representation
Lebanon, Guy
Mao, Yi
Dillon, Joshua
JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 2405 - 2441
[45] Fuzzy bag of words for social image description
Li, Yanshan
Liu, Weiming
Huang, Qinghua
Li, Xuelong
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (03) : 1371 - 1390
[46] Fuzzy bag of words for social image description
Yanshan Li
Weiming Liu
Qinghua Huang
Xuelong Li
Multimedia Tools and Applications, 2016, 75 : 1371 - 1390
[47] Bag-of-Multimedia-Words for Image Classification
Znaidia, Amel
Shabou, Aymen
Le Borgne, Herye
Hudelot, Celine
Paragios, Nikos
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1509 - 1512
[48] Content-Based Image (Object) Retrieval with Rotational Invariant Bag-of-Visual Words Representation
Chathurani, N. W. U. D.
Geva, S.
Chandran, V
Cynthujah, V
2015 IEEE 10TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2015, : 152 - 157
[49] Blood Cell Image Retrieval System Using Color, Shape and Bag of Words
Zare, Mohammad Reza
Seng, Woo Chaw
NEURAL INFORMATION PROCESSING, ICONIP 2014, PT III, 2014, 8836 : 218 - 225
[50] Evaluating Weighting Schemes for Adult Image Detection using Bag of Visual Words
Choi, SuGil
Han, SeungWan
2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 817 - 818

← 1 2 3 4 5 →