Object Retrieval Using Image Semantic Structure Groupings

被引：0

作者：

Ahmad, Nishat ^{[1
]}

Lee, Younghun ^{[2
]}

Park, Jongan ^{[1
]}

机构：

[1] Chosun Univ, Gwangju, South Korea

[2] Hannam Univ, Daejeon, South Korea

来源：

INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING | 2013年 / 6卷 / 02期

关键词：

object recognition; semantic structures; graph theoretic;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper explores basic level of semantic structure formation in the human vision inferential processes in line with Gestalt laws and proposes micro level semantic structure formations and their relational combinations. Using this approach two sets of semantic features have been derived for visual object class recognition. The first algorithm uses the hypothesis in line with Gestalt laws of proximity that; in an image, basic semantic structures are formed by line segments ( arcs also approximated and broken into smaller line segments based on pixel deviation threshold) which are in close proximity of each other. Based on the notion of proximity a transitive relation is defined, which combines basic micro level semantic structures hierarchically till such a point where semantic meanings of the structure can be extracted. The algorithm extracts line segments in an image and then forms semantic groups of these line segments based on a minimum distance threshold from each other. The line segment groups so formed can be differentiated from each other, by the number of group members and their geometrical properties. The geometrical properties of these semantic groups are used to generate rotation, translation and scale invariant histograms used as feature vectors for object class recognition tasks in a K-nearest neighbor framework. In the second approach a semantic group based on the proximity distance is clustered and modeled as a graph vertex. The line segments which are common to more than one semantic group are defined as semantic relations between the semantic groups and are modeled as edges of the graph. This way an image object is transformed into a graph using micro level structure formations. Each vertex and edge is labeled using translation, rotation and scale invariant properties of the member segments of each vertex and edge. From a set of training images, a graph model is constructed for visual object class recognition. The graph model is constructed by iteratively combining the training graphs and frequency labeling the vertices and edges. After the combining phase, all the vertices and edges whose repetition frequency is below a threshold are removed. The final graph model consists of the semantic nodes which are highly common in the training images. The recognition is based on graph matching the query image graph and the model graph. The model graph generates a vote for the query and ties are resolved by considering the node frequencies in the query and model graph. The algorithms have been applied to classify 101 object classes at one time. The results have been compared with existing state of the art approaches and are found promising. Results from above approaches show that low level image structure and other features can be used to construct different type of semantic features, which can help a model or a classifier make more intelligent decisions and work more effectively for the task compared to low level features alone. Our experimental results are comparable, or outperform other state-of-the-art approaches. We have also summarized the state-of-the-art at the time this work was finished. We conclude with a discussion about the possible future extensions.

引用

页码：103 / 112

页数：10

共 50 条

[11] Object retrieval in image databases using image composition
Philipp-Foliguet, S
Lekkat, M
[J]. XVI BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2003, : 159 - 166
[12] Using semantic commonsense resources in image retrieval
Popescu, Adrian
Grefenstette, Gregory
Moellic, Pierre-Alain
[J]. FIRST INTERNATIONAL WORKSHOP ON SEMANTIC MEDIA ADAPTATION AND PERSONALIZATION, PROCEEDINGS, 2006, : 31 - 36
[13] Improving image retrieval using semantic resources
Popescu, Adrian
Grefenstette, Gregory
Moellic, Pierre-Alain
[J]. ADVANCES IN SEMANTIC MEDIA ADAPTATION AND PERSONALIZATION, 2008, 93 : 75 - 96
[14] Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement
Kuo, Yin-Hsi
Cheng, Wen-Huang
Lin, Hsuan-Tien
Hsu, Winston H.
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (04) : 1079 - 1090
[15] Using structure for video object retrieval
Hohl, L
Souvannavong, F
Merialdo, B
Huet, B
[J]. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 564 - 572
[16] Multilevel indexing structure for object based image retrieval
Wei, Shikui
Zhao, Yao
Zhu, Zhenfeng
[J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1215 - +
[17] ENHANCING THE PERFORMANCE OF MULTI-MODALITY ONTOLOGY SEMANTIC IMAGE RETRIEVAL USING OBJECT PROPERTIES FILTER
Sulaiman, Mohd Suffian
Nordin, Sharifalillah
Jamil, Nursuriati
[J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 65 - 72
[18] Semantic Relationship-Based Image Retrieval Using KD-Tree Structure
Nguyen Thi Dinh
Thanh The Van
Thanh Manh Le
[J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT I, 2022, 13757 : 455 - 468
[19] Semantic access to a database of images:: An approach to object-related image retrieval
Martínez, A
Serra, JR
[J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 624 - 629
[20] Effective Image Object Retrieval and Tag Refinement by Augmenting Unsupervised Semantic Features
Manimegalai, M.
VanithaSivagami, S.
[J]. INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, IMAGE PROCESSING AND PATTERN RECOGNITION (ICSIPR 2013), 2013, : 234 - 237

← 1 2 3 4 5 →