Framework for Automatic Semantic Annotation of Images Based on Image's Low-Level Features and Surrounding Text

被引：1

作者：

Helmy, Tarek ^{[1
]}

Djatmiko, Fahim ^{[1
]}

机构：

[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Mail Box 413, Dhahran 31261, Saudi Arabia

来源：

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING | 2023年 / 48卷 / 02期

关键词：

Image processing; Feature extraction; Semantic annotation; MODEL;

D O I：

10.1007/s13369-022-06828-z

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Semantic annotation of images is the process of assigning metadata in the form of captions to a digital image. This is an important process for the indexing and searching of images in a big database. In this paper, we present the framework of automatic semantic annotation of images and explore the effectiveness of using it to annotate images based on both the image's low-level features and its surrounding text. In the proposed framework, the image's features have been extracted by using convolutional neural networks, while words in the surrounding text have been represented by word-embedding vectors. Both modalities are further processed using recurrent neural networks with long short-term memory cells that possess an attention mechanism to generate an annotation sentence that describes the image. Empirical evaluations of the proposed framework, acquired using a news dataset, show promising performance results and are comparable to the results of recent image annotation systems. The produced semantic annotations in free-text format can be further converted into a structured resource description framework that enables more expressive queries across a diverse source of images.

引用

页码：1991 / 2007

页数：17

共 50 条

[31] Low-level motion activity features for semantic characterization of video
Peker, KA
Alatan, AA
Akansu, AN
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 801 - 804
[32] A Direct Method for Semantic Partitioning of Low-level Image Data
Li, Zhongsheng
Huang, Tongcheng
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 108 - 111
[33] Low-Level Image Features for Stamps Detection and Classification
Forczmanski, Pawel
Markiewicz, Andrzej
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 383 - 392
[34] Image Saliency Detection with Low-Level Features Enhancement
Zhao, Ting
Wu, Xiangqian
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 408 - 419
[35] Automatic Semantic Annotation for Image Retrieval Based on Multiple Kernel Learning
Hou, Alin
Wu, Liang
Wang, Chongjin
Li, Fei
Guo, Junliang
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 647 - 651
[36] A multi-expert based framework for automatic image annotation
Bahrololoum, Abbas
Nezamabadi-pour, Hossein
PATTERN RECOGNITION, 2017, 61 : 169 - 184
[37] Soft clustering based on high- and low-level features for image smoothing
Yang, Yang
He, Tongyao
Zeng, Lanling
Zhao, Yan
Wang, Xinyu
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
[38] Semantics-based satellite image retrieval using low-level features
Li, Y
Bretschneider, T
IGARSS 2004: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM PROCEEDINGS, VOLS 1-7: SCIENCE FOR SOCIETY: EXPLORING AND MANAGING A CHANGING PLANET, 2004, : 4406 - 4409
[39] Automatic Image Annotation by Sequentially Learning From Multi-Level Semantic Neighborhoods
Li, Houjie
Li, Wei
Zhang, Hongda
He, Xin
Zheng, Mingxiao
Song, Haiyu
IEEE ACCESS, 2021, 9 : 135742 - 135754
[40] Optimizing metrics combining low-level visual descriptors for image annotation and retrieval
Zhang, Qianni
Izquierdo, Ebroul
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1653 - 1656

← 1 2 3 4 5 →