An Image Caption Model Based on the Scene Graph and Semantic Prior Network

被引:0
|
作者
Liu, Weifeng [1 ]
Zhang, Nan [1 ]
Wang, Yaning [2 ]
Di, Wu [3 ]
机构
[1] Shaanxi Univ Sci & Technol, Sch Elect & Control Engn, Xian, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou, Peoples R China
[3] Shaanxi Univ Chinese Med, Sch Basic Med Sci, Xianyang, Shaanxi, Peoples R China
来源
2022 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS) | 2022年
关键词
Image caption; scene graph; semantic prior; memory network;
D O I
10.1109/ICCAIS56082.2022.9990458
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose an image caption model based on scene graphs and semantic priors to address the problem of traditional image caption models that are overly dependent on training data. First, the original image features and the scene graph features are fused by embedding the scene image features into the feature space. Second, using image captions from the existing dataset, the sentence reconstruction task is used to train the memory network to retain semantic prior knowledge. The scene graph features are then combined with semantic prior information to reconstruct the new features, which are then sent into the Decoder to produce an image caption.
引用
收藏
页码:60 / 66
页数:7
相关论文
共 50 条
  • [31] Towards Traffic Scene Description: The Semantic Scene Graph
    Zipfl, Maximilian
    Zoellner, J. Marius
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3748 - 3755
  • [32] Image-Caption Model Based on Fusion Feature
    Geng, Yaogang
    Mei, Hongyan
    Xue, Xiaorong
    Zhang, Xing
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [33] Image caption generation method based on an interaction mechanism and scene concept selection module
    Zhang, Liping
    Lu, Qin
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 141 - 148
  • [34] Image Caption Generation with Local Semantic and Global Information
    Liu, Xing
    Liu, Weibin
    Xing, Weiwei
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 680 - 685
  • [35] Controllable Image Caption Generation Based on Encoder-decoder for Power Construction Scene
    Yang R.
    Shao J.
    Luo Y.
    Bai W.
    Dianwang Jishu/Power System Technology, 2022, 46 (07): : 2572 - 2580
  • [36] RESEARCH OF SEMANTIC-BASED SCENE IMAGE RETRIEVAL
    Jiang, Derong
    Hu, Jianfeng
    PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 992 - 995
  • [37] Adversarial Image Caption Generator Network
    Dehaqi A.M.
    Seydi V.
    Madadi Y.
    SN Computer Science, 2021, 2 (3)
  • [38] Modeling coverage with semantic embedding for image caption generation
    Jiang, Teng
    Zhang, Zehan
    Yang, Yupu
    VISUAL COMPUTER, 2019, 35 (11): : 1655 - 1665
  • [39] Modeling coverage with semantic embedding for image caption generation
    Teng Jiang
    Zehan Zhang
    Yupu Yang
    The Visual Computer, 2019, 35 : 1655 - 1665
  • [40] Semantic Scene Graph Generation Using RDF Model and Deep Learning
    Kim, Seongyong
    Jeon, Tae Hyeon
    Rhiu, Ilsun
    Ahn, Jinhyun
    Im, Dong-Hyuk
    APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 12