An Image Caption Model Based on the Scene Graph and Semantic Prior Network

被引：0

作者：

Liu, Weifeng ^{[1
]}

Zhang, Nan ^{[1
]}

Wang, Yaning ^{[2
]}

Di, Wu ^{[3
]}

机构：

[1] Shaanxi Univ Sci & Technol, Sch Elect & Control Engn, Xian, Peoples R China

[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou, Peoples R China

[3] Shaanxi Univ Chinese Med, Sch Basic Med Sci, Xianyang, Shaanxi, Peoples R China

来源：

2022 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS) | 2022年

关键词：

Image caption; scene graph; semantic prior; memory network;

D O I：

10.1109/ICCAIS56082.2022.9990458

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose an image caption model based on scene graphs and semantic priors to address the problem of traditional image caption models that are overly dependent on training data. First, the original image features and the scene graph features are fused by embedding the scene image features into the feature space. Second, using image captions from the existing dataset, the sentence reconstruction task is used to train the memory network to retain semantic prior knowledge. The scene graph features are then combined with semantic prior information to reconstruct the new features, which are then sent into the Decoder to produce an image caption.

引用

页码：60 / 66

页数：7

共 50 条

[31] Towards Traffic Scene Description: The Semantic Scene Graph
Zipfl, Maximilian
Zoellner, J. Marius
2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3748 - 3755
[32] Image-Caption Model Based on Fusion Feature
Geng, Yaogang
Mei, Hongyan
Xue, Xiaorong
Zhang, Xing
APPLIED SCIENCES-BASEL, 2022, 12 (19):
[33] Image caption generation method based on an interaction mechanism and scene concept selection module
Zhang, Liping
Lu, Qin
2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 141 - 148
[34] Image Caption Generation with Local Semantic and Global Information
Liu, Xing
Liu, Weibin
Xing, Weiwei
2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 680 - 685
[35] Controllable Image Caption Generation Based on Encoder-decoder for Power Construction Scene
Yang R.
Shao J.
Luo Y.
Bai W.
Dianwang Jishu/Power System Technology, 2022, 46 (07): : 2572 - 2580
[36] RESEARCH OF SEMANTIC-BASED SCENE IMAGE RETRIEVAL
Jiang, Derong
Hu, Jianfeng
PROCEEDINGS OF THE 38TH INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2008, : 992 - 995
[37] Adversarial Image Caption Generator Network
Dehaqi A.M.
Seydi V.
Madadi Y.
SN Computer Science, 2021, 2 (3)
[38] Modeling coverage with semantic embedding for image caption generation
Jiang, Teng
Zhang, Zehan
Yang, Yupu
VISUAL COMPUTER, 2019, 35 (11): : 1655 - 1665
[39] Modeling coverage with semantic embedding for image caption generation
Teng Jiang
Zehan Zhang
Yupu Yang
The Visual Computer, 2019, 35 : 1655 - 1665
[40] Semantic Scene Graph Generation Using RDF Model and Deep Learning
Kim, Seongyong
Jeon, Tae Hyeon
Rhiu, Ilsun
Ahn, Jinhyun
Im, Dong-Hyuk
APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 12

← 1 2 3 4 5 →