A multi-stage deep adversarial network for video summarization with knowledge distillation

被引:0
|
作者
M. U. Sreeja
Binsu C. Kovoor
机构
[1] Cochin University of Science and Technology,Division of Information Technology
关键词
GAN; Static summaries; Dynamic summaries; Knowledge distillation; Adversarial learning; Key frame; Key segment;
D O I
暂无
中图分类号
学科分类号
摘要
Video summarization is defined as the process of automatically identifying and extracting the relevant contents from a video that can best represent the contents of the video. The proposed model implements a video summarization framework based on generative adversarial network (GAN) for feature extraction and knowledge distillation for key frame or segment selection. The ideal characteristics of a video summary is diversity and representativeness. The primary stage of the proposed model based on adversarial learning ensures that the extracted features contain diverse and representative elements from the video. The generator is a convolutional recurrent autoencoder that learns the hidden representation of the video through the reconstruction loss. The generator model is followed by a discriminator that aims at improving the efficiency of the generator model by trying to discriminate between the original and reconstructed video samples. The adversarial network is followed by a knowledge distillation phase which acts as a key frame or segment selector by employing a simple network whose input data is retrieved from the preceding GAN model. Comprehensive evaluations conducted on public and custom datasets substantiate the relevance of GANs and knowledge distillation phase for video summarization. Quantitative and qualitative evaluations further prove that the proposed model produces remarkable results with summaries that are diverse, representative and concise.
引用
下载
收藏
页码:9823 / 9838
页数:15
相关论文
共 50 条
  • [21] Dilated temporal relational adversarial network for generic video summarization
    Yujia Zhang
    Michael Kampffmeyer
    Xiaodan Liang
    Dingwen Zhang
    Min Tan
    Eric P. Xing
    Multimedia Tools and Applications, 2019, 78 : 35237 - 35261
  • [22] Multi-stage frame alignment video super- resolution network
    Wang S.
    Zhu Y.
    Zhang Y.
    Wang Q.
    He Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (16): : 2430 - 2443
  • [23] MULTI-STAGE FEATURE ALIGNMENT NETWORK FOR VIDEO SUPER-RESOLUTION
    Suzuki, Keito
    Ikehara, Masaaki
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2001 - 2005
  • [24] Multi-Stage Feature Fusion Network for Video Super-Resolution
    Song, Huihui
    Xu, Wenjie
    Liu, Dong
    Liu, Bo
    Liu, Qingshan
    Metaxas, Dimitris N.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2923 - 2934
  • [25] Dilated temporal relational adversarial network for generic video summarization
    Zhang, Yujia
    Kampffmeyer, Michael
    Liang, Xiaodan
    Zhang, Dingwen
    Tan, Min
    Xing, Eric P.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (24) : 35237 - 35261
  • [26] KDFAS: Multi-stage Knowledge Distillation Vision Transformer for Face Anti-spoofing
    Zhang, Jun
    Zhang, Yunfei
    Shao, Feixue
    Ma, Xuetao
    Zhou, Daoxiang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 159 - 171
  • [27] A multi-stage underwater image aesthetic enhancement algorithm based on a generative adversarial network
    Hu, Kai
    Weng, Chenghang
    Shen, Chaowen
    Wang, Tianyan
    Weng, Liguo
    Xia, Min
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [28] MRDN: A lightweight Multi-stage residual distillation network for image Super-Resolution
    Yang, Xin
    Guo, Yingqing
    Li, Zhiqiang
    Zhou, Dake
    Li, Tao
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [29] Multi-stage Deep Convolutional Neural Network for Histopathological Analysis of Osteosarcoma
    Jayachandran, A.
    Ganesh, S.
    Kumar, S. Ratheesh
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20351 - 20364
  • [30] Multi-stage Deep Convolutional Neural Network for Histopathological Analysis of Osteosarcoma
    A. Jayachandran
    S. Ganesh
    S. Ratheesh Kumar
    Neural Computing and Applications, 2023, 35 : 20351 - 20364