Discriminative Feature Learning for Unsupervised Video Summarization

被引:0
|
作者
Jung, Yunjae [1 ]
Cho, Donghyeon [1 ]
Kim, Dahun [1 ]
Woo, Sanghyun [1 ]
Kweon, In So [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of unsupervised video summarization that automatically extracts key-shots from an input video. Specifically, we tackle two critical issues based on our empirical observations: (i) Ineffective feature learning due to flat distributions of output importance scores for each frame, and (ii) training difficulty when dealing with long-length video inputs. To alleviate the first problem, we propose a simple yet effective regularization loss term called variance loss. The proposed variance loss allows a network to predict output scores for each frame with high discrepancy which enables effective feature learning and significantly improves model performance. For the second problem, we design a novel two-stream network named Chunk and Stride Network (CSNet) that utilizes local (chunk) and global (stride) temporal view on the video features. Our CSNet gives better summarization results for long-length videos compared to the existing methods. In addition, we introduce an attention mechanism to handle the dynamic information in videos. We demonstrate the effectiveness of the proposed methods by conducting extensive ablation studies and show that our final model achieves new state-of-the-art results on two benchmark datasets.
引用
收藏
页码:8537 / 8544
页数:8
相关论文
共 50 条
  • [1] Endoscopy Video Summarization based on Unsupervised Learning and Feature Discrimination
    Ben Ismail, M. Maher
    Bchir, Ouiem
    Emam, Ahmed Z.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [2] Unsupervised feature learning with discriminative encoder
    Pandey, Gaurav
    Dukkipati, Ambedkar
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 367 - 376
  • [3] Unsupervised Feature Learning Through Divergent Discriminative Feature Accumulation
    Szerlip, Paul A.
    Morse, Gregory
    Pugh, Justin K.
    Stanley, Kenneth O.
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2979 - 2985
  • [4] Unsupervised Video Summarization Based on the Diffusion Model of Feature Fusion
    Yu, Qinghao
    Yu, Hui
    Sun, Ying
    Ding, Derui
    Jian, Muwei
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 6010 - 6021
  • [5] Joint Reinforcement and Contrastive Learning for Unsupervised Video Summarization
    Zhang, Yunzuo
    Liu, Yameng
    Zhu, Pengfei
    Kang, Weili
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2587 - 2591
  • [6] Unsupervised learning of visual and semantic features for video summarization
    Huang, Yansen
    Zhong, Rui
    Yao, Wenjin
    Wang, Rui
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [7] Unsupervised Reinforcement Learning For Video Summarization Reward Function
    Wang, Lei
    Zhu, Yaping
    Pan, Hong
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 40 - 44
  • [8] Local and Global Discriminative Learning for Unsupervised Feature Selection
    Du, Liang
    Shen, Zhiyong
    Li, Xuan
    Zhou, Peng
    Shen, Yi-Dong
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 131 - 140
  • [9] Discriminative Unsupervised Feature Learning with Convolutional Neural Networks
    Dosovitskiy, Alexey
    Springenberg, Jost Tobias
    Riedmiller, Martin
    Brox, Thomas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [10] Unsupervised geographically discriminative feature learning for landmark tagging
    Zhang, Xiaoming
    Zhao, Zhonghua
    Zhang, Haijun
    Wang, Senzhang
    Li, Zhoujun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 149 : 143 - 154