Attention-Based Multi-Layered Encoder-Decoder Model for Summarizing Non-Interactive User-Based Videos

被引:0
|
作者
Tiwari, Vasudha [1 ]
Bhatnagar, Charul [1 ]
机构
[1] GLA Univ, Dept CEA, Mathura, India
关键词
Multi-layered encoder-decoder; video summarization; attention; BiLSTM; LSTM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video summarization extracts the relevant contents from a video and presents the entire content of the video in a compact and summarized form. User based video summarization, can summarize a video as per the requirement of the user. In this work, a non interactive and a perception-based video summarization technique is proposed that makes use of attention mechanism to capture user's interest and extract relevant keyshots in temporal sequence from the video content. Here, video summarization has been articulated as a sequence-to-sequence learning problem and a supervised method has been proposed for summarization of the video. Adding layers to the existing network makes it deeper, enables higher level of abstraction and facilitates better feature extraction. Therefore, the proposed model uses a multi-layered, deep summarization encoder-decoder network (MLAVS), with attention mechanism to select final keyshots from the video. The contextual information of the video frames is encoded using a multi-layered Bidirectional Long Short-Term Memory network (BiLSTM) as the encoder. To decode, a multi-layered attention-based Long Short-Term memory (LSTM) using a multiplicative score function is employed. The experiments are performed on the benchmark TVSum dataset and the results obtained are compared with recent works. The results show considerable improvement and clearly demonstrate the efficacy of this methodology against most of the other available state-of-art methods.
引用
下载
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] Lane-Level Heterogeneous Traffic Flow Prediction: A Spatiotemporal Attention-Based Encoder-Decoder Model
    Zheng, Yan
    Li, Wenquan
    Zheng, Wen
    Dong, Chunjiao
    Wang, Shengyou
    Chen, Qian
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (03) : 51 - 67
  • [32] An attention-based row-column encoder-decoder model for text recognition in Japanese historical documents
    Ly, Nam Tuan
    Nguyen, Cuong Tuan
    Nakagawa, Masaki
    PATTERN RECOGNITION LETTERS, 2020, 136 : 134 - 141
  • [33] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
    Chen, Zhengyang
    Han, Bing
    Wang, Shuai
    Qian, Yanmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649
  • [34] Code generation from a graphical user interface via attention-based encoder–decoder model
    Wen-Yin Chen
    Pavol Podstreleny
    Wen-Huang Cheng
    Yung-Yao Chen
    Kai-Lung Hua
    Multimedia Systems, 2022, 28 (1) : 121 - 130
  • [35] Attention-based Encoder-Decoder Recurrent Neural Networks for HTTP Payload Anomaly Detection
    Wu, Shang
    Wang, Yijie
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1452 - 1459
  • [36] Understanding attention-based encoder-decoder networks: a case study with chess scoresheet recognition
    Hayashi, Sergio Y.
    Hirata, Nina S. T.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1586 - 1592
  • [37] Hybrid Attention-Based Encoder-Decoder Fully Convolutional Network for PolSAR Image Classification
    Fang, Zheng
    Zhang, Gong
    Dai, Qijun
    Xue, Biao
    Wang, Peng
    REMOTE SENSING, 2023, 15 (02)
  • [38] 3D Skeleton-Based Non-Autoregressive Human Motion Prediction Using Encoder-Decoder Attention-Based Model
    Lovanshi, Mayank
    Tiwari, Vivek
    Ingle, Rajesh
    Jain, Swati
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [39] Automatic Generation of Chinese Couplets with Attention Based Encoder-Decoder Model
    Yuan, Shengqiong
    Zhong, Luo
    Li, Lin
    Zhang, Rui
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 65 - 70
  • [40] Audio signal quality enhancement using multi-layered convolutional neural network based auto encoder-decoder
    Raj, Shivangi
    Prakasam, P.
    Gupta, Shubham
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 425 - 437