Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks

被引:51
|
作者
He, Xufeng [1 ]
Hua, Yang [2 ]
Song, Tao [1 ]
Zhang, Zongpu [1 ]
Xue, Zhengui [1 ]
Ma, Ruhui [1 ]
Robertson, Neil [2 ]
Guan, Haibing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Queens Univ Belfast, Belfast, Antrim, North Ireland
关键词
video summarization; generative adversarial networks; video analysis; deep learning;
D O I
10.1145/3343031.3351056
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the rapid growth of video data, video summarization technique plays a key role in reducing people's efforts to explore the content of videos by generating concise but informative summaries. Though supervised video summarization approaches have been well studied and achieved state-of-the-art performance, unsupervised methods are still highly demanded due to the intrinsic difficulty of obtaining high-quality annotations. In this paper, we propose a novel yet simple unsupervised video summarization method with attentive conditional Generative Adversarial Networks (GANs). Firstly, we build our framework upon Generative Adversarial Networks in an unsupervised manner. Specifically, the generator produces high-level weighted frame features and predicts frame-level importance scores, while the discriminator tries to distinguish between weighted frame features and raw frame features. Furthermore, we utilize a conditional feature selector to guide GAN model to focus on more important temporal regions of the whole video frames. Secondly, we are the first to introduce the frame-level multi-head self-attention for video summarization, which learns long-range temporal dependencies along the whole video sequence and overcomes the local constraints of recurrent units, e.g., LSTMs. Extensive evaluations on two datasets, SumMe and TVSum, show that our proposed framework surpasses state-of-the-art unsupervised methods by a large margin, and even outperforms most of the supervised methods. Additionally, we also conduct the ablation study to unveil the influence of each component and parameter settings in our framework.
引用
收藏
页码:2296 / 2304
页数:9
相关论文
共 50 条
  • [21] Unsupervised Feature Propagation for Fast Video Object Detection Using Generative Adversarial Networks
    Zhang, Xuan
    Han, Guangxing
    He, Wenduo
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 617 - 627
  • [22] Shadow Detection with Conditional Generative Adversarial Networks
    Vu Nguyen
    Vicente, Tomas F. Yago
    Zhao, Maozheng
    Hoai, Minh
    Samaras, Dimitris
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4520 - 4528
  • [23] TOPOLOGY DESIGN WITH CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS
    Sharpe, Conner
    Seepersad, Carolyn Conner
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2019, VOL 2A, 2020,
  • [24] Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization
    Yuan, Li
    Tay, Francis E. H.
    Li, Ping
    Zhou, Li
    Feng, Jiashi
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9143 - 9150
  • [25] Generative Attribute Controller with Conditional Filtered Generative Adversarial Networks
    Kaneko, Takuhiro
    Hiramatsu, Kaoru
    Kashino, Kunio
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7006 - 7015
  • [26] Automatic Video Colorization Using 3D Conditional Generative Adversarial Networks
    Kouzouglidis, Panagiotis
    Sfikas, Giorgos
    Nikou, Christophoros
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 209 - 218
  • [27] PathGAN: Local path planning with attentive generative adversarial networks
    Choi, Dooseop
    Han, Seung-Jun
    Min, Kyoung-Wook
    Choi, Jeongdan
    ETRI JOURNAL, 2022, 44 (06) : 1004 - 1019
  • [28] Unsupervised video summarization using deep Non-Local video summarization networks
    Zang, Sha-Sha
    Yu, Hui
    Song, Yan
    Zeng, Ru
    NEUROCOMPUTING, 2023, 519 : 26 - 35
  • [29] Unsupervised video summarization with adversarial graph-based attention network
    Gunuganti, Jeshmitha
    Yeh, Zhi-Ting
    Wang, Jenq-Haur
    Norouzi, Mehdi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [30] Unsupervised Image Generation with Infinite Generative Adversarial Networks
    Ying, Hui
    Wang, He
    Shao, Tianjia
    Yang, Yin
    Zhou, Kun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14264 - 14273