Research on the Image Description Algorithm of Double-Layer LSTM Based on Adaptive Attention Mechanism

被引:0
|
作者
Qin, Cifeng [1 ]
Gong, Wenyin [1 ]
Li, Xiang [1 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430078, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention mechanisms - Double layers - Image descriptions - Image texts - Multi-modal data - Non visuals - Original model - Processing problems - Research focus - Visual word;
D O I
10.1155/2022/2315341
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Image text description is a multimodal data processing problem in the computer field, which involves the research tasks of computer vision and natural language processing. At present, the research focus of image text description task is mainly on the method based on deep learning. The work of this paper is mainly focused on the imprecise description of visual words and nonvisual words in the description of image description tasks in the image text description. An adaptive attention double-layer LSTM (long short-term memory) model based on coding-decoding is proposed. Compared with the algorithm based on the adaptive attention mechanism based on the coding-decoding framework, the evaluation index BLEU-1 is improved by 1.21%. The METEOR was 0.75% higher and CIDEr was 0.55%, while the indexes of BLEU-4 and ROUGE-L were not as good as those of the original model, but the index was not different. Although it cannot surpass all the performance indicators of the original model, the description of visual words and nonvisual words is more accurate in the actual image text description.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Attention Based Double Layer LSTM for Chinese Image Captioning
    Wu, Wei
    Sun, Deshuai
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [2] Recommendation Algorithm in Double-Layer Network Based on Vector Dynamic Evolution Clustering and Attention Mechanism
    Chen, Jianrui
    Wang, Zhihui
    Zhu, Tingting
    Rosas, Fernando E.
    [J]. COMPLEXITY, 2020, 2020
  • [3] Novel double-layer bidirectional LSTM network with improved attention mechanism for predicting energy consumption
    He, Yan-Lin
    Chen, Lei
    Gao, Yanlu
    Ma, Jia-Hui
    Xu, Yuan
    Zhu, Qun-Xiong
    [J]. ISA TRANSACTIONS, 2022, 127 : 350 - 360
  • [4] Network Course Recommendation System Based on Double-Layer Attention Mechanism
    Zhu, Qianyao
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [5] A Surface Target Recognition Algorithm Based on Coordinate Attention and Double-Layer Cascade
    Guo, Runze
    Zuo, Zhen
    Su, Shaojing
    Sun, Bei
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [6] A new image description generation algorithm based on improved attention mechanism
    Du, Wenke
    Zuo, Haiyu
    Feng, Qianyuan
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2021, 128 : 189 - 190
  • [7] A Novel Quantum Image Steganography Algorithm Based on Double-Layer Gray Code
    Jin-Liang Yao
    Hong-Mei Yang
    Dong-Huan Jiang
    Bin Yan
    Jeng-Shyang Pan
    Meng-Xi Wang
    [J]. International Journal of Theoretical Physics, 62
  • [8] A Novel Quantum Image Steganography Algorithm Based on Double-Layer Gray Code
    Yao, Jin-Liang
    Yang, Hong-Mei
    Jiang, Dong-Huan
    Yan, Bin
    Pan, Jeng-Shyang
    Wang, Meng-Xi
    [J]. INTERNATIONAL JOURNAL OF THEORETICAL PHYSICS, 2023, 62 (03)
  • [9] Feedback LSTM Network Based on Attention for Image Description Generator
    Qu, Zhaowei
    Cao, Bingyu
    Wang, Xiaoru
    Li, Fu
    Xu, Peirong
    Zhang, Luhan
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 59 (02): : 575 - 589
  • [10] Research on GCN-LSTM Emotion Recognition Algorithm with Attention Mechanism based on EEG
    Chang, Lina
    Li, Qi
    Yan, Xurong
    [J]. PROCEEDINGS OF 2024 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND INTELLIGENT COMPUTING, BIC 2024, 2024, : 170 - 174