Improvement the Bag of Words Image Representation Using Spatial Information

被引：0

作者：

Farhangi, Mohammad Mehdi ^{[1
]}

Soryani, Mohsen ^{[1
]}

Fathy, Mahmood ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran

来源：

ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2 | 2013年 / 177卷

关键词：

BOW Representation; Spatial Information; N-gram Model; Spatial Pyramid Matching;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Bag of visual words (BOW) model is an effective way to represent images in order to classify and detect their contents. However, this type of representation suffers from the fact that, it does not contain any spatial information. In this paper we propose a novel image representation which adds two types of spatial information. The first type which is the spatial locations of the words in the image is added using the spatial pyramid matching approach. The second type is the spatial relation between words. To explore this information a binary tree structure which models the is-a relationships in the vocabulary is constructed from the visual words. This approach is a simple and computationally effective way for modeling the spatial relations of the visual words which shows improvement on the visual classification performance. We evaluated our method on visual classification of two known data sets, namely 15 natural scenes and Caltech-101.

引用

页码：681 / 690

页数：10

共 50 条

[1] Creating the bag-of-words with spatial context information for image retrieval
Li, Zhenwei
Zhang, Jing
Liu, Xin
Zhuo, Li
MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 4788 - 4791
[2] Elimination of Spatial Incoherency in Bag-of-Visual Words Image Representation Using Visual Sentence Modelling
Olaode, Abass A.
Naghdy, Golshah
2018 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2018,
[3] Informative visual words construction to improve bag of words image representation
Farhangi, Mohammad Mehdi
Soryani, Mohsen
Fathy, Mahmood
IET IMAGE PROCESSING, 2014, 8 (05) : 310 - 318
[4] A Bag of Constrained Visual Words Model for Image Representation
Mukherjee, Anindita
Sil, Jaya
Chowdhury, Ananda S.
PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 403 - 415
[5] Approximate Image Matching using Strings of Bag-of-Visual Words Representation
Hong Thinh Nguyen
Barat, Cecile
Ducottet, Christophe
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 345 - 353
[6] Bag of Visual Words Approach for Image Retrieval Using Color Information
Mansoori, Naimeh Sadat
Nejati, Mansour
Razzaghi, Parvin
Samavi, Shadrokh
2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
[7] Enhanced Bags of Visual Words Representation Using Spatial Information
Abdi, Lotfi
Kalboussi, Rahma
Meddeb, Aref
IMAGE ANALYSIS AND PROCESSING (ICIAP 2017), PT II, 2017, 10485 : 171 - 179
[8] Motor Fault Diagnosis Using Image Visual Information and Bag of Words Model
Long, Zhuo
Zhang, Xiaofei
Song, Dianyi
Tang, Yao
Huang, Shoudao
Liang, Weizhi
IEEE SENSORS JOURNAL, 2021, 21 (19) : 21798 - 21807
[9] Using sub-dictionaries for image representation based on the bag-of-visual-words approach
Pedrosa, Glauco Vitor
Traina, Agma J. M.
Traina, Caetano, Jr.
2014 IEEE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2014, : 165 - 168
[10] Compressed image classification using bag of visual words
2012, Cairo University (59):

← 1 2 3 4 5 →