Image Describing Based on Bidirectional LSTM and Improved Sequence Sampling

被引:0
|
作者
Li, Ji [1 ]
Shen, Yongfei [1 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
来源
2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA) | 2017年
关键词
Deep Learning; Image Describing; Scheduled Sampling; Bi-LSTM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by great performance gained by Recurrent neural network applied on machine translation, people began to pay attention to image describing with related deep learning methods. Recurrent neural network can not remember long term information but Long-Short Term Memory(LSTM) can handle this well. However, the LSTM applied on image describing to predict sentences in previous literature [1] can only train and inference in the single direction. In fact, the words in a sentence not only relates to the context before but also later. In the paper, we propose a Bidirectional LSTM, it can generate sentences in both forward and backward direction with more richer information. Besides, we also improved sampling sentences. We conducted experiment on three datasets: Flickr8K, Flickr30K and MSCOCO datasets and our proposed models outperform related models.
引用
收藏
页码:735 / 739
页数:5
相关论文
共 50 条
  • [21] Attention-Based Convolutional LSTM for Describing Video
    Liu, Zhongyu
    Chen, Tian
    Ding, Enjie
    Liu, Yafeng
    Yu, Wanli
    IEEE ACCESS, 2020, 8 : 133713 - 133724
  • [22] Sequence SAR Image Classification Based on Bidirectional Convolution-Recurrent Network
    Bai, Xueru
    Xue, Ruihang
    Wang, Li
    Zhou, Feng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9223 - 9235
  • [23] Application of Image Classification Based on Improved LSTM in Internet Reading Therapy Platform
    Xiong, Jianxin
    Yin, Hui
    Pan, Meisen
    IEEE ACCESS, 2024, 12 : 1660 - 1671
  • [24] Deep belief improved bidirectional LSTM for multivariate time series forecasting
    Jiang, Keruo
    Huang, Zhen
    Zhou, Xinyan
    Tong, Chudong
    Zhu, Minjie
    Wang, Heshan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 16596 - 16627
  • [25] Application load forecasting method based on multi-layer bidirectional LSTM and improved PSO algorithm
    Cai L.
    Zhou H.-C.
    Bai H.
    Cai Z.-G.
    Yin K.-T.
    Bei Y.-J.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (12): : 2414 - 2422
  • [26] Enhancing THAI Image Captioning Performance using CNN and Bidirectional LSTM
    Tieancho, Witchaphon
    Phumeechanya, Sopon
    2024 21ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, ECTI-CON 2024, 2024,
  • [27] Investigating emotional design of the intelligent cockpit based on visual sequence data and improved LSTM
    Wang, Nanyi
    Shi, Di
    Li, Zengrui
    Chen, Pingting
    Ren, Xipei
    ADVANCED ENGINEERING INFORMATICS, 2024, 61
  • [28] Bidirectional-Convolutional LSTM Based Spectral-Spatial Feature Learning for Hyperspectral Image Classification
    Liu, Qingshan
    Zhou, Feng
    Hang, Renlong
    Yuan, Xiaotong
    REMOTE SENSING, 2017, 9 (12)
  • [29] A bidirectional LSTM-based morphological analyzer for Gujarati
    Baxi, Jatayu
    Bhatt, Brijesh
    NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 198 - 214
  • [30] Early Warning Model for Learning based on Bidirectional LSTM
    Li, Yufan
    Zhang, Huifu
    2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 237 - 241