Image Describing Based on Bidirectional LSTM and Improved Sequence Sampling

被引:0
|
作者
Li, Ji [1 ]
Shen, Yongfei [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
来源
2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA) | 2017年
关键词
Deep Learning; Image Describing; Scheduled Sampling; Bi-LSTM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by great performance gained by Recurrent neural network applied on machine translation, people began to pay attention to image describing with related deep learning methods. Recurrent neural network can not remember long term information but Long-Short Term Memory(LSTM) can handle this well. However, the LSTM applied on image describing to predict sentences in previous literature [1] can only train and inference in the single direction. In fact, the words in a sentence not only relates to the context before but also later. In the paper, we propose a Bidirectional LSTM, it can generate sentences in both forward and backward direction with more richer information. Besides, we also improved sampling sentences. We conducted experiment on three datasets: Flickr8K, Flickr30K and MSCOCO datasets and our proposed models outperform related models.
引用
收藏
页码:735 / 739
页数:5
相关论文
共 50 条
  • [41] DCB-RRT*: DYNAMIC CONSTRAINED SAMPLING BASED BIDIRECTIONAL RRT* WITH IMPROVED CONVERGENCE RATE
    Cui, Xining
    Wang, Caiqi
    Xiong, Yi
    Wu, Shiqian
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2024, 39 (05): : 391 - 406
  • [42] Improved Soil Moisture and Electrical Conductivity Prediction of Citrus Orchards Based on IoT Using Deep Bidirectional LSTM
    Gao, Peng
    Xie, Jiaxing
    Yang, Mingxin
    Zhou, Ping
    Chen, Wenbin
    Liang, Gaotian
    Chen, Yufeng
    Han, Xiongzhe
    Wang, Weixing
    AGRICULTURE-BASEL, 2021, 11 (07):
  • [43] Improved matrix model of sequence grid partition based on vector space sampling
    Cui, Lina
    PHYSICAL COMMUNICATION, 2024, 64
  • [44] Generating Image Sequence from Description with LSTM Conditional GAN
    Ouyang, Xu
    Zhang, Xi
    Ma, Di
    Agam, Gady
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2456 - 2461
  • [45] A Sequence-to-Sequence Approach for Remaining Useful Lifetime Estimation Using Attention-augmented Bidirectional LSTM
    Bin Shah, Sayed Rafay
    Chadha, Gavneet Singh
    Schwung, Andreas
    Ding, Steven X.
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2021, 10-11
  • [46] Describing Image with Attention based GRU
    Mallick, Vikash Raja
    Naik, Dinesh
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [47] A Hierarchical Bidirectional LSTM Sequence Model for Extractive Text Summarization in Electric Power Systems
    Jiang, Wei
    Zou, Yunfeng
    Zhao, Ting
    Zhang, Qiang
    Ma, Yinglong
    2020 13TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2020), 2020, : 290 - 294
  • [48] An improved color image encryption scheme and cryptanalysis based on a hyperchaotic sequence
    Yang, Feifei
    Mou, Jun
    Luo, Chunfeng
    Cao, Yinghong
    PHYSICA SCRIPTA, 2019, 94 (08)
  • [49] Motion estimation of image sequence based on improved quad-tree
    2005, Central South University of Technology, Changsha, China (36):
  • [50] Reference Based LSTM for Image Captioning
    Chen, Minghai
    Ding, Guiguang
    Zhao, Sicheng
    Chen, Hui
    Han, Jungong
    Liu, Qiang
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3981 - 3987