An Image Caption Model Based on the Scene Graph and Semantic Prior Network

被引:0
|
作者
Liu, Weifeng [1 ]
Zhang, Nan [1 ]
Wang, Yaning [2 ]
Di, Wu [3 ]
机构
[1] Shaanxi Univ Sci & Technol, Sch Elect & Control Engn, Xian, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou, Peoples R China
[3] Shaanxi Univ Chinese Med, Sch Basic Med Sci, Xianyang, Shaanxi, Peoples R China
来源
2022 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS) | 2022年
关键词
Image caption; scene graph; semantic prior; memory network;
D O I
10.1109/ICCAIS56082.2022.9990458
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose an image caption model based on scene graphs and semantic priors to address the problem of traditional image caption models that are overly dependent on training data. First, the original image features and the scene graph features are fused by embedding the scene image features into the feature space. Second, using image captions from the existing dataset, the sentence reconstruction task is used to train the memory network to retain semantic prior knowledge. The scene graph features are then combined with semantic prior information to reconstruct the new features, which are then sent into the Decoder to produce an image caption.
引用
收藏
页码:60 / 66
页数:7
相关论文
共 50 条
  • [21] Topic Scene Graph Generation by Attention Distillation from Caption
    Wang, Wenbin
    Wang, Ruiping
    Chen, Xilin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15880 - 15890
  • [22] SPICE: Semantic Propositional Image Caption Evaluation
    Anderson, Peter
    Fernando, Basura
    Johnson, Mark
    Gould, Stephen
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
  • [23] Model-based inexact graph matching on top of DNNs for semantic scene understanding
    Chopin, Jeremy
    Fasquel, Jean-Baptiste
    Mouchere, Harold
    Dahyot, Rozenn
    Bloch, Isabelle
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 235
  • [24] Natural Scene Retrieval Based on Graph Semantic Similarity for Adaptive Scene Classification
    Jamil, Nuraini
    Kang, Sanggil
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: SEMANTIC WEB, SOCIAL NETWORKS AND MULTIAGENT SYSTEMS, 2009, 5796 : 676 - 684
  • [25] An Underwater Image Color Correction Algorithm Based on Underwater Scene Prior and Residual Network
    Huang, Mengxing
    Ye, Jinjin
    Zhu, Shenghan
    Chen, Yang
    Wu, Yuanyuan
    Wu, Di
    Feng, Siling
    Shu, Feng
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT II, 2022, 13339 : 129 - 139
  • [26] Scene Emotion Detection using closed caption based on Hierarchical Attention Network
    Kwak, Chang-Uk
    Son, Jeong-Woo
    Lee, Alex
    Kim, Sun-Joong
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1206 - 1208
  • [27] CGNN: Caption-assisted graph neural network for image-text retrieval
    Hu, Yongli
    Zhang, Hanfu
    Jiang, Huajie
    Bi, Yandong
    Yin, Baocai
    PATTERN RECOGNITION LETTERS, 2022, 161 : 137 - 142
  • [28] A Novel Image Caption Model Based on Transformer Structure
    Wang, Shuang
    Zhu, Yaping
    2021 IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2021), 2021, : 144 - 148
  • [29] A Bayesian Scene-Prior-Based Deep Network Model for Face Verification
    Wang, Huafeng
    Song, Wenfeng
    Liu, Wanquan
    Song, Ning
    Wang, Yuehai
    Pan, Haixia
    SENSORS, 2018, 18 (06)
  • [30] A graph-based sensor recommendation model in semantic sensor network
    Chen, Yuanyi
    Lin, Yihao
    Yu, Peng
    Tao, Yanyun
    Zheng, Zengwei
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2022, 18 (05):