Video to Text Study using an Encoder-Decoder Networks Approach

被引:0
|
作者
Ismael Orozco, Carlos [1 ]
Elena Buemi, Maria [2 ]
Jacobo Berlles, Julio [2 ]
机构
[1] Univ Nacl Salta, FCE, Dept Informat, Salta, Argentina
[2] Univ Buenos Aires, FCEyN, Dept Comp, Buenos Aires, DF, Argentina
关键词
Video Summarization; Long Short-Term Memory; Deep Learning; Natural Language Processing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The automatic generation of video description is currently a topic of interest in computer vision due to applications such as web indexation, video description for people with visual disabilities, among others. In this work we present a Neural Network architecture Encoder-Decoder. First, a Convolutional Neural Network 3D extracts the features of the input video. Then, an Long Short-Term Memory decodes the vector to automatically generate the description of the video. To perform the training and testing we use the Microsoft Video Description Corpus data set (MSVD). Evaluate the performance of our system using the challenge of COCO Image Captioning Challenge. We obtain as results 0.3984, 0.2941 and 0.5052 for the BLEU, METEOR and CIDEr metrics respectively. Competitive results compared with certificates in the bibliography.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Weakly-Supervised Video Summarization Using Variational Encoder-Decoder and Web Prior
    Cai, Sijia
    Zuo, Wangmeng
    Davis, Larry S.
    Zhang, Lei
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 193 - 210
  • [42] NDE Data Correlation Using Encoder-Decoder Networks with Wavelet Scalogram Images
    Dargahi, Mozhgan Momtaz
    Lattanzi, David
    Azari, Hoda
    [J]. JOURNAL OF NONDESTRUCTIVE EVALUATION, 2022, 41 (04)
  • [43] Effective Video Summarization Using Channel Attention-Assisted Encoder-Decoder Framework
    Alharbi, Faisal
    Habib, Shabana
    Albattah, Waleed
    Jan, Zahoor
    Alanazi, Meshari D.
    Islam, Muhammad
    [J]. SYMMETRY-BASEL, 2024, 16 (06):
  • [44] Dynamic video summarisation using stacked encoder-decoder architecture with residual learning network
    Dhanushree, M.
    Priya, R.
    Aruna, P.
    Bhavani, R.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2024, 12 (01) : 27 - 59
  • [45] Variational Memory Encoder-Decoder
    Hung Le
    Truyen Tran
    Thin Nguyen
    Venkatesh, Svetha
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [46] Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition
    Cui, Mengmeng
    Wang, Wei
    Zhang, Jinjin
    Wang, Liang
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 156 - 170
  • [47] Enhanced Attention-Based Encoder-Decoder Framework for Text Recognition
    Prabu, S.
    Sundar, K. Joseph Abraham
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 2071 - 2086
  • [48] Forecasting Stock Market Using Machine Learning Approach Encoder-Decoder ConvLSTM
    Iqbal, Khurum
    Hassan, Ali
    Ul Hassan, Syed Shah Mir
    Iqbal, Shuaib
    Aslam, Faheem
    Mughal, Khurrum S.
    [J]. 2021 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2021), 2021, : 43 - 48
  • [49] Are all shortcuts in encoder-decoder networks beneficial for CT denoising?
    Chen, Junhua
    Zhang, Chong
    Wee, Leonard
    Dekker, Andre
    Bermejo, Inigo
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (01): : 59 - 66
  • [50] Graph Regularized Encoder-Decoder Networks for Image Representation Learning
    Yang, Shijie
    Li, Liang
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3124 - 3136