Framework for Automatic Semantic Annotation of Images Based on Image's Low-Level Features and Surrounding Text

被引:1
|
作者
Helmy, Tarek [1 ]
Djatmiko, Fahim [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Informat & Comp Sci Dept, Mail Box 413, Dhahran 31261, Saudi Arabia
关键词
Image processing; Feature extraction; Semantic annotation; MODEL;
D O I
10.1007/s13369-022-06828-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Semantic annotation of images is the process of assigning metadata in the form of captions to a digital image. This is an important process for the indexing and searching of images in a big database. In this paper, we present the framework of automatic semantic annotation of images and explore the effectiveness of using it to annotate images based on both the image's low-level features and its surrounding text. In the proposed framework, the image's features have been extracted by using convolutional neural networks, while words in the surrounding text have been represented by word-embedding vectors. Both modalities are further processed using recurrent neural networks with long short-term memory cells that possess an attention mechanism to generate an annotation sentence that describes the image. Empirical evaluations of the proposed framework, acquired using a news dataset, show promising performance results and are comparable to the results of recent image annotation systems. The produced semantic annotations in free-text format can be further converted into a structured resource description framework that enables more expressive queries across a diverse source of images.
引用
收藏
页码:1991 / 2007
页数:17
相关论文
共 50 条
  • [31] Low-level motion activity features for semantic characterization of video
    Peker, KA
    Alatan, AA
    Akansu, AN
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 801 - 804
  • [32] A Direct Method for Semantic Partitioning of Low-level Image Data
    Li, Zhongsheng
    Huang, Tongcheng
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 108 - 111
  • [33] Low-Level Image Features for Stamps Detection and Classification
    Forczmanski, Pawel
    Markiewicz, Andrzej
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 383 - 392
  • [34] Image Saliency Detection with Low-Level Features Enhancement
    Zhao, Ting
    Wu, Xiangqian
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 408 - 419
  • [35] Automatic Semantic Annotation for Image Retrieval Based on Multiple Kernel Learning
    Hou, Alin
    Wu, Liang
    Wang, Chongjin
    Li, Fei
    Guo, Junliang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 647 - 651
  • [36] A multi-expert based framework for automatic image annotation
    Bahrololoum, Abbas
    Nezamabadi-pour, Hossein
    PATTERN RECOGNITION, 2017, 61 : 169 - 184
  • [37] Soft clustering based on high- and low-level features for image smoothing
    Yang, Yang
    He, Tongyao
    Zeng, Lanling
    Zhao, Yan
    Wang, Xinyu
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [38] Semantics-based satellite image retrieval using low-level features
    Li, Y
    Bretschneider, T
    IGARSS 2004: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM PROCEEDINGS, VOLS 1-7: SCIENCE FOR SOCIETY: EXPLORING AND MANAGING A CHANGING PLANET, 2004, : 4406 - 4409
  • [39] Automatic Image Annotation by Sequentially Learning From Multi-Level Semantic Neighborhoods
    Li, Houjie
    Li, Wei
    Zhang, Hongda
    He, Xin
    Zheng, Mingxiao
    Song, Haiyu
    IEEE ACCESS, 2021, 9 : 135742 - 135754
  • [40] Optimizing metrics combining low-level visual descriptors for image annotation and retrieval
    Zhang, Qianni
    Izquierdo, Ebroul
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1653 - 1656