Video Description Using Bidirectional Recurrent Neural Networks

被引:18
|
作者
Peris, Alvaro [1 ]
Bolanos, Marc [2 ,3 ]
Radeva, Petia [2 ,3 ]
Casacuberta, Francisco [1 ]
机构
[1] Univ Politecn Valencia, PRHLT Res Ctr, Valencia, Spain
[2] Univ Barcelona, Barcelona, Spain
[3] Comp Vision Ctr, Bellaterra, Spain
关键词
Video description; Neural Machine Translation; Birectional Recurrent Neural Networks; LSTM; Convolutional Neural Networks;
D O I
10.1007/978-3-319-44781-0_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions. The combination of Convolutional and Recurrent Neural Networks in these models has proven to outperform the previous state of the art, obtaining more accurate video descriptions. In this work we propose pushing further this model by introducing two contributions into the encoding stage. First, producing richer image representations by combining object and location information from Convolutional Neural Networks and second, introducing Bidirectional Recurrent Neural Networks for capturing both forward and backward temporal relationships in the input frames.
引用
收藏
页码:3 / 11
页数:9
相关论文
共 50 条
  • [41] Speaker-Adapted Confidence Measures for ASR Using Deep Bidirectional Recurrent Neural Networks
    Angel Del-Agua, Miguel
    Gimenez, Adria
    Sanchis, Albert
    Civera, Jorge
    Juan, Alfons
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (07) : 1194 - 1202
  • [42] Improved Prediction Model of Protein Lysine Crotonylation Sites Using Bidirectional Recurrent Neural Networks
    Tng, Sian Soo
    Le, Nguyen Quoc Khanh
    Yeh, Hui-Yuan
    Chua, Matthew Chin Heng
    JOURNAL OF PROTEOME RESEARCH, 2022, 21 (01) : 265 - 273
  • [43] Combining Very Deep Convolutional Neural Networks and Recurrent Neural Networks for Video Classification
    Kiziltepe, Rukiye Savran
    Gan, John Q.
    Escobar, Juan Jose
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 811 - 822
  • [44] Deep-fake video detection approaches using convolutional - recurrent neural networks
    Suratkar, Shraddha
    Bhiungade, Sayali
    Pitale, Jui
    Soni, Komal
    Badgujar, Tushar
    Kazi, Faruk
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (02) : 198 - 214
  • [45] Game Character Facial Animation Using Actor Video Corpus and Recurrent Neural Networks
    Schiffer, Sheldon
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 674 - 681
  • [46] Prediction of MPEG-coded video source traffic using recurrent neural networks
    Bhattacharya, A
    Parlos, AG
    Atiya, AF
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (08) : 2177 - 2190
  • [47] Hand Gesture Recognition in Video Sequences Using Deep Convolutional and Recurrent Neural Networks
    Obaid, Falah
    Babadi, Amin
    Yoosofan, Ahmad
    APPLIED COMPUTER SYSTEMS, 2020, 25 (01) : 57 - 61
  • [48] SLIDING BIDIRECTIONAL RECURRENT NEURAL NETWORKS FOR SEQUENCE DETECTION IN COMMUNICATION SYSTEMS
    Farsad, Nariman
    Goldsmith, Andrea
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2331 - 2335
  • [49] Cascaded bidirectional recurrent neural networks for protein secondary structure prediction
    Chen, Jinmiao
    Chaudhari, Narendra S.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (04) : 572 - 582
  • [50] Ship Trajectory Prediction Based on Attention in Bidirectional Recurrent Neural Networks
    Wang, Chao
    Fu, Yuhui
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 529 - 533