Visual Attention Based on Long-Short Term Memory Model for Image Caption Generation

被引:0
|
作者
Qu, Shiru [1 ]
Xi, Yuling [1 ]
Ding, Songtao [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
关键词
Image Caption; RNN; LSTM; CNN; Attention Mechanism;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image caption generation becomes a raising topic in computer vision and artificial intelligence. In order to solve the problem of stiff description, we intend to extract richer features using convolutional neural network (CNN). A neural and probabilistic framework has been proposed consequently which combines CNN with a special form of recurrent neural network (RNN) to produce an end-to-end image captioning. We use a model that takes advantage of word to vector to encode the variable length input into a fixed dimensional vector. Considering the description of the object in an image is not specific enough, we introduce an attention mechanism through visualization to show how the model is able to fix its gaze on salient objects. We validate our model on three benchmark datasets and get great performance by using standard evaluation metrics.
引用
收藏
页码:4789 / 4794
页数:6
相关论文
共 50 条
  • [41] Temporal Convolution-Based Long-Short Term Memory Network With Attention Mechanism for Remaining Useful Life Prediction
    Hsu, Chia-Yu
    Lu, Yi-Wei
    Yan, Jia-Hong
    [J]. IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2022, 35 (02) : 220 - 228
  • [42] Method of Rain Attenuation Prediction Based on Long-Short Term Memory Network
    Cornejo, Andres
    Landeros-Ayala, Salvador
    Matias, Jose M.
    Ortiz-Gomez, Flor
    Martinez, Ramon
    Salas-Natera, Miguel
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (04) : 2959 - 2995
  • [43] A BCI System with Motor Imagery Based on Bidirectional Long-Short Term Memory
    Lin, Jzau-Sheng
    She, Bing-Hong
    [J]. 3RD ANNUAL INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGY AND COMMUNICATION ENGINEERING, 2020, 719
  • [44] Heterogeneous Data Feature Extraction Technology based on Long-Short Term Memory
    Wang, Jiye
    Liang, Yundan
    Gao, Lingchao
    Pi, Zhixian
    Yang, Xiao
    Zhang, Huaixun
    Sun, Jiasong
    [J]. PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 141 - 144
  • [45] Prediction of Electricity Consumption Demand Based on Long-Short Term Memory Network
    Khan, Amanullah
    Maharum, Siti Marwangi Mohamad
    Harun, Faezah
    Shah, Jawad Ali
    [J]. Lecture Notes in Electrical Engineering, 2024, 1142 : 165 - 177
  • [46] A visual long-short-term memory based integrated CNN model for fabric defect image classification
    Zhao, Yudi
    Hao, Kuangrong
    He, Haibo
    Tang, Xuesong
    Wei, Bing
    [J]. NEUROCOMPUTING, 2020, 380 : 259 - 270
  • [47] Image caption generation method based on adaptive attention mechanism
    Jin, Huazhong
    Wu, Yu
    Wan, Fang
    Hu, Man
    Li, Qingqing
    [J]. MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [48] CONVOLUTIONAL LONG-SHORT TERM MEMORY NETWORKS MODEL FOR LONG DURATION EEG SIGNAL CLASSIFICATION
    Baloglu, Ulas Baran
    Yildirim, Ozal
    [J]. JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2019, 19 (01)
  • [49] Spatial and long-short temporal attention correlation filters for visual tracking
    Zhao, Jianwei
    Wei, Fuyuan
    Chen, NingNing
    Zhou, Zhenghua
    [J]. IET IMAGE PROCESSING, 2022, 16 (11) : 3011 - 3024
  • [50] Multi Long-Short Term Memory Models for Short Term Traffic Flow Prediction
    Xue, Zelong
    Xue, Yang
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12): : 3272 - 3275