Image Describing Based on Bidirectional LSTM and Improved Sequence Sampling

被引:0
|
作者
Li, Ji [1 ]
Shen, Yongfei [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
关键词
Deep Learning; Image Describing; Scheduled Sampling; Bi-LSTM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by great performance gained by Recurrent neural network applied on machine translation, people began to pay attention to image describing with related deep learning methods. Recurrent neural network can not remember long term information but Long-Short Term Memory(LSTM) can handle this well. However, the LSTM applied on image describing to predict sentences in previous literature [1] can only train and inference in the single direction. In fact, the words in a sentence not only relates to the context before but also later. In the paper, we propose a Bidirectional LSTM, it can generate sentences in both forward and backward direction with more richer information. Besides, we also improved sampling sentences. We conducted experiment on three datasets: Flickr8K, Flickr30K and MSCOCO datasets and our proposed models outperform related models.
引用
收藏
页码:735 / 739
页数:5
相关论文
共 50 条
  • [1] Describing Video With Attention-Based Bidirectional LSTM
    Bin, Yi
    Yang, Yang
    Shen, Fumin
    Xie, Ning
    Shen, Heng Tao
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (07) : 2631 - 2641
  • [2] Sequence-Based Recommendation with Bidirectional LSTM Network
    Fu, Hailin
    Li, Jianguo
    Chen, Jiemin
    Tang, Yong
    Zhu, Jia
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 428 - 438
  • [3] Improvement of image description using bidirectional LSTM
    Vahid Chahkandi
    Mohammad Javad Fadaeieslam
    Farzin Yaghmaee
    International Journal of Multimedia Information Retrieval, 2018, 7 : 147 - 155
  • [4] Improvement of image description using bidirectional LSTM
    Chahkandi, Vahid
    Fadaeieslam, Mohammad Javad
    Yaghmaee, Farzin
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (03) : 147 - 155
  • [5] An Improved Attention-based Bidirectional LSTM Model for Cyanobacterial Bloom Prediction
    Jianjun Ni
    Ruping Liu
    Guangyi Tang
    Yingjuan Xie
    International Journal of Control, Automation and Systems, 2022, 20 : 3445 - 3455
  • [6] Improved Dota2 Lineup Recommendation Model Based on a Bidirectional LSTM
    Lei Zhang
    Chenbo Xu
    Yihua Gao
    Yi Han
    Xiaojiang Du
    Zhihong Tian
    TsinghuaScienceandTechnology, 2020, 25 (06) : 712 - 720
  • [7] An Improved Attention-based Bidirectional LSTM Model for Cyanobacterial Bloom Prediction
    Ni, Jianjun
    Liu, Ruping
    Tang, Guangyi
    Xie, Yingjuan
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (10) : 3445 - 3455
  • [8] Improved Dota2 Lineup Recommendation Model Based on a Bidirectional LSTM
    Zhang, Lei
    Xu, Chenbo
    Gao, Yihua
    Han, Yi
    Du, Xiaojiang
    Tian, Zhihong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (06) : 712 - 720
  • [9] Context-based Bidirectional-LSTM Model for Sequence Labeling in Clinical Reports
    Zhu, Henghui
    Paschalidis, Ioannis Ch.
    Tahmasebi, Amir M.
    MEDICAL IMAGING 2019: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2019, 10954
  • [10] The Bidirectional Information Fusion Using an Improved LSTM Model
    Zheng, Tianwei
    Wang, Mei
    Guo, Yuan
    Wang, Zheng
    MOBILE INFORMATION SYSTEMS, 2021, 2021