Attention-Based Multi-Layered Encoder-Decoder Model for Summarizing Non-Interactive User-Based Videos

被引：0

作者：

Tiwari, Vasudha ^{[1
]}

Bhatnagar, Charul ^{[1
]}

机构：

[1] GLA Univ, Dept CEA, Mathura, India

来源：

JOURNAL OF ELECTRICAL SYSTEMS | 2024年 / 20卷 / 02期

关键词：

Multi-layered encoder-decoder; video summarization; attention; BiLSTM; LSTM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Video summarization extracts the relevant contents from a video and presents the entire content of the video in a compact and summarized form. User based video summarization, can summarize a video as per the requirement of the user. In this work, a non interactive and a perception-based video summarization technique is proposed that makes use of attention mechanism to capture user's interest and extract relevant keyshots in temporal sequence from the video content. Here, video summarization has been articulated as a sequence-to-sequence learning problem and a supervised method has been proposed for summarization of the video. Adding layers to the existing network makes it deeper, enables higher level of abstraction and facilitates better feature extraction. Therefore, the proposed model uses a multi-layered, deep summarization encoder-decoder network (MLAVS), with attention mechanism to select final keyshots from the video. The contextual information of the video frames is encoded using a multi-layered Bidirectional Long Short-Term Memory network (BiLSTM) as the encoder. To decode, a multi-layered attention-based Long Short-Term memory (LSTM) using a multiplicative score function is employed. The experiments are performed on the benchmark TVSum dataset and the results obtained are compared with recent works. The results show considerable improvement and clearly demonstrate the efficacy of this methodology against most of the other available state-of-art methods.

引用

下载

页码：1 / 13

页数：13

共 50 条

[31] Lane-Level Heterogeneous Traffic Flow Prediction: A Spatiotemporal Attention-Based Encoder-Decoder Model
Zheng, Yan
Li, Wenquan
Zheng, Wen
Dong, Chunjiao
Wang, Shengyou
Chen, Qian
IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (03) : 51 - 67
[32] An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents
Ly, Nam Tuan
Nguyen, Cuong Tuan
Nakagawa, Masaki
PATTERN RECOGNITION LETTERS, 2020, 136 : 134 - 141
[33] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
Chen, Zhengyang
Han, Bing
Wang, Shuai
Qian, Yanmin
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649
[34] Code generation from a graphical user interface via attention-based encoder–decoder model
Wen-Yin Chen
Pavol Podstreleny
Wen-Huang Cheng
Yung-Yao Chen
Kai-Lung Hua
Multimedia Systems, 2022, 28 (1) : 121 - 130
[35] Attention-based Encoder-Decoder Recurrent Neural Networks for HTTP Payload Anomaly Detection
Wu, Shang
Wang, Yijie
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1452 - 1459
[36] Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition
Hayashi, Sergio Y.
Hirata, Nina S. T.
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1586 - 1592
[37] Hybrid Attention-Based Encoder-Decoder Fully Convolutional Network for PolSAR Image Classification
Fang, Zheng
Zhang, Gong
Dai, Qijun
Xue, Biao
Wang, Peng
REMOTE SENSING, 2023, 15 (02)
[38] 3D Skeleton-Based Non-Autoregressive Human Motion Prediction Using Encoder-Decoder Attention-Based Model
Lovanshi, Mayank
Tiwari, Vivek
Ingle, Rajesh
Jain, Swati
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
[39] Automatic Generation of Chinese Couplets with Attention Based Encoder-Decoder Model
Yuan, Shengqiong
Zhong, Luo
Li, Lin
Zhang, Rui
2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 65 - 70
[40] Audio signal quality enhancement using multi-layered convolutional neural network based auto encoder-decoder
Raj, Shivangi
Prakasam, P.
Gupta, Shubham
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 425 - 437

← 1 2 3 4 5 →