Attention-Based Multi-Layered Encoder-Decoder Model for Summarizing Non-Interactive User-Based Videos

被引:0
|
作者
Tiwari, Vasudha [1 ]
Bhatnagar, Charul [1 ]
机构
[1] GLA Univ, Dept CEA, Mathura, India
关键词
Multi-layered encoder-decoder; video summarization; attention; BiLSTM; LSTM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video summarization extracts the relevant contents from a video and presents the entire content of the video in a compact and summarized form. User based video summarization, can summarize a video as per the requirement of the user. In this work, a non interactive and a perception-based video summarization technique is proposed that makes use of attention mechanism to capture user's interest and extract relevant keyshots in temporal sequence from the video content. Here, video summarization has been articulated as a sequence-to-sequence learning problem and a supervised method has been proposed for summarization of the video. Adding layers to the existing network makes it deeper, enables higher level of abstraction and facilitates better feature extraction. Therefore, the proposed model uses a multi-layered, deep summarization encoder-decoder network (MLAVS), with attention mechanism to select final keyshots from the video. The contextual information of the video frames is encoded using a multi-layered Bidirectional Long Short-Term Memory network (BiLSTM) as the encoder. To decode, a multi-layered attention-based Long Short-Term memory (LSTM) using a multiplicative score function is employed. The experiments are performed on the benchmark TVSum dataset and the results obtained are compared with recent works. The results show considerable improvement and clearly demonstrate the efficacy of this methodology against most of the other available state-of-art methods.
引用
下载
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Modeling User Session and Intent with an Attention-based Encoder-Decoder Architecture
    Loyola, Pablo
    Liu, Chen
    Hirate, Yu
    PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 147 - 151
  • [2] Code generation from a graphical user interface via attention-based encoder-decoder model
    Chen, Wen Yin
    Podstreleny, Pavol
    Cheng, Wen-Huang
    Chen, Yung-Yao
    Hua, Kai-Lung
    MULTIMEDIA SYSTEMS, 2022, 28 (01) : 121 - 130
  • [3] Attention-based encoder-decoder networks for workflow recognition
    Min Zhang
    Haiyang Hu
    Zhongjin Li
    Jie Chen
    Multimedia Tools and Applications, 2021, 80 : 34973 - 34995
  • [4] Video Summarization With Attention-Based Encoder-Decoder Networks
    Ji, Zhong
    Xiong, Kailin
    Pang, Yanwei
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) : 1709 - 1717
  • [5] Attention-based encoder-decoder networks for workflow recognition
    Zhang, Min
    Hu, Haiyang
    Li, Zhongjin
    Chen, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 34973 - 34995
  • [6] Arabic Machine Transliteration using an Attention-based Encoder-decoder Model
    Ameur, Mohamed Seghir Hadj
    Meziane, Farid
    Guessoum, Ahmed
    ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 287 - 297
  • [7] Attention-Based Encoder-Decoder Model for Photovoltaic Power Generation Prediction
    Zhu, Xiang
    Hu, Juntao
    Song, Liangcai
    Suo, Guilong
    Zhan, Yong
    5TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2020), 2020, 1575
  • [8] Attention-Based Personalized Encoder-Decoder Model for Local Citation Recommendation
    Yang, Libin
    Zhang, Zeqing
    Cai, Xiaoyan
    Dai, Tao
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [9] Attention-based encoder-decoder model for answer selection in question answering
    Yuan-ping Nie
    Yi Han
    Jiu-ming Huang
    Bo Jiao
    Ai-ping Li
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 535 - 544
  • [10] Attention-based encoder-decoder model for answer selection in question answering
    Nie, Yuan-ping
    Han, Yi
    Huang, Jiu-ming
    Jiao, Bo
    Li, Ai-ping
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (04) : 535 - 544