Attention-Based Bidirectional Recurrent Neural Networks for Description Generation of Videos

被引:0
|
作者
Du, Xiaotong [1 ]
Yuan, Jiabin [1 ]
Liu, Hu [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
来源
CLOUD COMPUTING AND SECURITY, PT VI | 2018年 / 11068卷
关键词
Video description; Convolutional Neural Networks; Bidirectional Recurrent Neural Networks; Attention mechanism;
D O I
10.1007/978-3-030-00021-9_40
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Describing videos in human language is of vital importance in many applications, such as managing massive videos on line and providing descriptive video service (DVS) for blind people. In order to further promote existing video description frameworks, this paper presents an end-to-end deep learning model incorporating Convolutional Neural Networks (CNNs) and Bidirectional Recurrent Neural Networks (BiRNNs) based on a multimodal attention mechanism. Firstly, the model produces richer video representations, including image feature, motion feature and audio feature, than other similar researches. Secondly, BiRNNs model encodes these features in both forward and backward directions. Finally, an attention-based decoder translates sequential outputs of encoder to sequential words. The model is evaluated on Microsoft Research Video Description Corpus (MSVD) dataset. The results demonstrate the necessity of combining BiRNNs with a multimodal attention mechanism and the superiority of this model over other state-of-the-art methods conducted on this dataset.
引用
收藏
页码:440 / 451
页数:12
相关论文
共 50 条
  • [21] Dual-Stage Attention-Based Recurrent Neural Networks for Market Microstructure
    Chung, Chaeshick
    Park, Sukjin
    2021 IEEE ASIA-PACIFIC CONFERENCE ON COMPUTER SCIENCE AND DATA ENGINEERING (CSDE), 2021,
  • [22] Attention-based graph neural networks: a survey
    Sun, Chengcheng
    Li, Chenhao
    Lin, Xiang
    Zheng, Tianji
    Meng, Fanrong
    Rui, Xiaobin
    Wang, Zhixiao
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 2263 - 2310
  • [23] Attention-based graph neural networks: a survey
    Chengcheng Sun
    Chenhao Li
    Xiang Lin
    Tianji Zheng
    Fanrong Meng
    Xiaobin Rui
    Zhixiao Wang
    Artificial Intelligence Review, 2023, 56 : 2263 - 2310
  • [24] Outcome-Oriented Predictive Process Monitoring with Attention-based Bidirectional LSTM Neural Networks
    Wang, Jiaojiao
    Yu, Dongjin
    Liu, Chengfei
    Sun, Xiaoxiao
    2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, : 360 - 367
  • [25] Attention-Based Recurrent Neural Network for Multicriteria Recommendations
    Bougteb, Yahya
    Frikh, Bouchra
    Ouhbi, Brahim
    Zemmouri, El Moukhtar
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 2, INTELLISYS 2023, 2024, 823 : 264 - 274
  • [26] Attention-Based Recurrent Neural Network for Sequence Labeling
    Li, Bofang
    Liu, Tao
    Zhao, Zhe
    Du, Xiaoyong
    WEB AND BIG DATA (APWEB-WAIM 2018), PT I, 2018, 10987 : 340 - 348
  • [27] Attention-based Recurrent Neural Network for Location Recommendation
    Xia, Bin
    Li, Yun
    Li, Qianmu
    Li, Tao
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [28] Wind Power Forecasting Using Attention-Based Recurrent Neural Networks: A Comparative Study
    Huang, Bin
    Liang, Yuying
    Qiu, Xiaolin
    IEEE ACCESS, 2021, 9 : 40432 - 40444
  • [29] Selecting Features from Time Series Using Attention-Based Recurrent Neural Networks
    Myller, Michal
    Kawulok, Michal
    Nalepa, Jakub
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 87 - 97
  • [30] Automated Labeling of Bugs and Tickets Using Attention-Based Mechanisms in Recurrent Neural Networks
    Lyubinets, Volodymyr
    Nicholas, Deon
    Boiko, Taras
    2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 271 - 275