Video to Text Study using an Encoder-Decoder Networks Approach

被引：0

作者：

Ismael Orozco, Carlos ^{[1
]}

Elena Buemi, Maria ^{[2
]}

Jacobo Berlles, Julio ^{[2
]}

机构：

[1] Univ Nacl Salta, FCE, Dept Informat, Salta, Argentina

[2] Univ Buenos Aires, FCEyN, Dept Comp, Buenos Aires, DF, Argentina

来源：

2018 37TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC) | 2018年

关键词：

Video Summarization; Long Short-Term Memory; Deep Learning; Natural Language Processing;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The automatic generation of video description is currently a topic of interest in computer vision due to applications such as web indexation, video description for people with visual disabilities, among others. In this work we present a Neural Network architecture Encoder-Decoder. First, a Convolutional Neural Network 3D extracts the features of the input video. Then, an Long Short-Term Memory decodes the vector to automatically generate the description of the video. To perform the training and testing we use the Microsoft Video Description Corpus data set (MSVD). Evaluate the performance of our system using the challenge of COCO Image Captioning Challenge. We obtain as results 0.3984, 0.2941 and 0.5052 for the BLEU, METEOR and CIDEr metrics respectively. Competitive results compared with certificates in the bibliography.

引用

页数：5

共 50 条

[41] Weakly-Supervised Video Summarization Using Variational Encoder-Decoder and Web Prior
Cai, Sijia
Zuo, Wangmeng
Davis, Larry S.
Zhang, Lei
[J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 193 - 210
[42] NDE Data Correlation Using Encoder-Decoder Networks with Wavelet Scalogram Images
Dargahi, Mozhgan Momtaz
Lattanzi, David
Azari, Hoda
[J]. JOURNAL OF NONDESTRUCTIVE EVALUATION, 2022, 41 (04)
[43] Effective Video Summarization Using Channel Attention-Assisted Encoder-Decoder Framework
Alharbi, Faisal
Habib, Shabana
Albattah, Waleed
Jan, Zahoor
Alanazi, Meshari D.
Islam, Muhammad
[J]. SYMMETRY-BASEL, 2024, 16 (06):
[44] Dynamic video summarisation using stacked encoder-decoder architecture with residual learning network
Dhanushree, M.
Priya, R.
Aruna, P.
Bhavani, R.
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2024, 12 (01) : 27 - 59
[45] Variational Memory Encoder-Decoder
Hung Le
Truyen Tran
Thin Nguyen
Venkatesh, Svetha
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[46] Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition
Cui, Mengmeng
Wang, Wei
Zhang, Jinjin
Wang, Liang
[J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 156 - 170
[47] Enhanced Attention-Based Encoder-Decoder Framework for Text Recognition
Prabu, S.
Sundar, K. Joseph Abraham
[J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 2071 - 2086
[48] Forecasting Stock Market Using Machine Learning Approach Encoder-Decoder ConvLSTM
Iqbal, Khurum
Hassan, Ali
Ul Hassan, Syed Shah Mir
Iqbal, Shuaib
Aslam, Faheem
Mughal, Khurrum S.
[J]. 2021 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT 2021), 2021, : 43 - 48
[49] Are all shortcuts in encoder-decoder networks beneficial for CT denoising?
Chen, Junhua
Zhang, Chong
Wee, Leonard
Dekker, Andre
Bermejo, Inigo
[J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (01): : 59 - 66
[50] Graph Regularized Encoder-Decoder Networks for Image Representation Learning
Yang, Shijie
Li, Liang
Wang, Shuhui
Zhang, Weigang
Huang, Qingming
Tian, Qi
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3124 - 3136

← 1 2 3 4 5 →