Visual Attention Based on Long-Short Term Memory Model for Image Caption Generation

被引:0
|
作者
Qu, Shiru [1 ]
Xi, Yuling [1 ]
Ding, Songtao [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
关键词
Image Caption; RNN; LSTM; CNN; Attention Mechanism;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image caption generation becomes a raising topic in computer vision and artificial intelligence. In order to solve the problem of stiff description, we intend to extract richer features using convolutional neural network (CNN). A neural and probabilistic framework has been proposed consequently which combines CNN with a special form of recurrent neural network (RNN) to produce an end-to-end image captioning. We use a model that takes advantage of word to vector to encode the variable length input into a fixed dimensional vector. Considering the description of the object in an image is not specific enough, we introduce an attention mechanism through visualization to show how the model is able to fix its gaze on salient objects. We validate our model on three benchmark datasets and get great performance by using standard evaluation metrics.
引用
收藏
页码:4789 / 4794
页数:6
相关论文
共 50 条
  • [11] Forecasting stock prices with long-short term memory neural network based on attention mechanism
    Qiu, Jiayu
    Wang, Bin
    Zhou, Changjun
    [J]. PLOS ONE, 2020, 15 (01):
  • [12] Bearing life prediction method based on convolutional attention long-short term memory network
    Zhou J.-M.
    Gao S.
    Li J.-H.
    Xiong W.-H.
    Wang Y.-Q.
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (06): : 1140 - 1148
  • [13] Stock Price Prediction with Long-short Term Memory Model
    Wang, Runyu
    Zuo, Zhengyu
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 274 - 279
  • [14] Visual Tracking With Long-Short Term Based Correlation Filter
    Yang, Yuxiang
    Xing, Weiwei
    Zhang, Shunli
    Gao, Limin
    Yu, Qi
    Che, Xiaoping
    Lu, Wei
    [J]. IEEE ACCESS, 2020, 8 : 20257 - 20269
  • [15] Short Term Prediction of Wind Speed Based on Long-Short Term Memory Networks
    Salman, Umar T.
    Rehman, Shafiqur
    Alawode, Basit
    Alhems, Luai M.
    [J]. FME TRANSACTIONS, 2021, 49 (03): : 643 - 652
  • [16] Image Caption Generation with Hierarchical Contextual Visual Spatial Attention
    Khademi, Mahmoud
    Schulte, Oliver
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2024 - 2032
  • [17] Quantum image chaos encryption scheme based on quantum long-short term memory network*
    Wang Wei-Jie
    Jiang Mei-Mei
    Wang Shu-Mei
    Qu Ying-Jie
    Ma Hong-Yang
    Qiu Tian-Hui
    [J]. ACTA PHYSICA SINICA, 2023, 72 (12)
  • [18] Visual Relocalization using Long-Short Term Memory Fully Convolutional Network
    Zhou, Lipu
    [J]. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 602 - 609
  • [19] Joint model for residual life estimation based on Long-Short Term Memory network
    Lyu, Yi
    Gao, Junyan
    Chen, Ci
    Jiang, Yijie
    Li, Huachuan
    Chen, Kairui
    Zhang, Yun
    [J]. NEUROCOMPUTING, 2020, 410 : 284 - 294
  • [20] Attention-Based Bi-Directional Long-Short Term Memory Network for Earthquake Prediction
    Banna, Md. Hasan Al
    Ghosh, Tapotosh
    Nahian, Md. Jaber Al
    Taher, Kazi Abu
    Kaiser, M. Shamim
    Mahmud, Mufti
    Hossain, Mohammad Shahadat
    Andersson, Karl
    [J]. IEEE ACCESS, 2021, 9 : 56589 - 56603