A multi-stage deep adversarial network for video summarization with knowledge distillation

被引:0
|
作者
M. U. Sreeja
Binsu C. Kovoor
机构
[1] Cochin University of Science and Technology,Division of Information Technology
关键词
GAN; Static summaries; Dynamic summaries; Knowledge distillation; Adversarial learning; Key frame; Key segment;
D O I
暂无
中图分类号
学科分类号
摘要
Video summarization is defined as the process of automatically identifying and extracting the relevant contents from a video that can best represent the contents of the video. The proposed model implements a video summarization framework based on generative adversarial network (GAN) for feature extraction and knowledge distillation for key frame or segment selection. The ideal characteristics of a video summary is diversity and representativeness. The primary stage of the proposed model based on adversarial learning ensures that the extracted features contain diverse and representative elements from the video. The generator is a convolutional recurrent autoencoder that learns the hidden representation of the video through the reconstruction loss. The generator model is followed by a discriminator that aims at improving the efficiency of the generator model by trying to discriminate between the original and reconstructed video samples. The adversarial network is followed by a knowledge distillation phase which acts as a key frame or segment selector by employing a simple network whose input data is retrieved from the preceding GAN model. Comprehensive evaluations conducted on public and custom datasets substantiate the relevance of GANs and knowledge distillation phase for video summarization. Quantitative and qualitative evaluations further prove that the proposed model produces remarkable results with summaries that are diverse, representative and concise.
引用
下载
收藏
页码:9823 / 9838
页数:15
相关论文
共 50 条
  • [1] A multi-stage deep adversarial network for video summarization with knowledge distillation
    Sreeja, M. U.
    Kovoor, Binsu C.
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 14 (8) : 9823 - 9838
  • [2] Online Knowledge Distillation Based on Multi-stage Multi-generative Adversarial Network
    Huang, Zhonghao
    Zhou, Yimin
    Yang, Xingyao
    IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
  • [3] Multi-stage knowledge distillation for sequential recommendation with interest knowledge
    Du, Yongping
    Niu, Jinyu
    Wang, Yuxin
    Jin, Xingnan
    INFORMATION SCIENCES, 2024, 654
  • [4] Video Summarization Using Knowledge Distillation-Based Attentive Network
    Qin, Jialin
    Yu, Hui
    Liang, Wei
    Ding, Derui
    COGNITIVE COMPUTATION, 2024, 16 (03) : 1022 - 1031
  • [5] Multi-stage Knowledge Propagation Network for Recommendation
    Xue, Feng
    Zhou, Wenjie
    Hong, Zikun
    Liu, Kang
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE GRAPH EMPOWERS NEW INFRASTRUCTURE CONSTRUCTION, 2021, 1466 : 253 - 264
  • [6] Multi-Stage Raw Video Denoising with Adversarial Loss and Gradient Mask
    Paliwal, Avinash
    Zeng, Libing
    Kalantari, Nima Khademi
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY (ICCP), 2021,
  • [7] Video summarization with a convolutional attentive adversarial network
    Liang, Guoqiang
    Lv, Yanbing
    Li, Shucheng
    Zhang, Shizhou
    Zhang, Yanning
    PATTERN RECOGNITION, 2022, 131
  • [8] Multi-stage Tag Guidance Network in Video Caption
    Wang, Lanxiao
    Shang, Chao
    Qiu, Heqian
    Zhao, Taijin
    Qiu, Benliu
    Li, Hongliang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4610 - 4614
  • [9] Temporal Coherency based Criteria for Predicting Video Frames using Deep Multi-stage Generative Adversarial Networks
    Bhattacharjee, Prateep
    Das, Sukhendu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [10] User-Ranking Video Summarization With Multi-Stage Spatio-Temporal Representation
    Huang, Siyu
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    Han, Junwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 2654 - 2664