Detecting shot boundary with sparse coding for video summarization

被引:26
|
作者
Li, Jiatong [1 ]
Yao, Ting [2 ]
Ling, Qiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;
D O I
10.1016/j.neucom.2017.04.065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 78
页数:13
相关论文
共 50 条
  • [31] Video shot boundary detection algorithm
    Ko, Kyong-Cheol
    Cheon, Young-Min
    Kim, Gye-Young
    Choi, Hyung-Il
    Shin, Seong-Yoon
    Rhee, Yang-Won
    COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2006, 4338 : 388 - +
  • [32] Cooperative Shot Boundary Detection for Video
    Teng, Shaohua
    Tan, Wenwei
    Zhang, Wei
    COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN IV, 2008, 5236 : 99 - 110
  • [33] Video Shot Boundary Detection: A Review
    SenGupta, Ananya
    Thounaojam, Dalton Meitei
    Singh, Kh. Manglem
    Roy, Sudipta
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [34] Shot Boundary Detection in Video Retrieval
    Wu, Zhonglan
    Xu, Pin
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2014, : 86 - 89
  • [35] Detecting shot transitions for video indexing with FAM
    Jang, SW
    Kim, GY
    Choi, HI
    ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, 2001, 2130 : 1020 - 1025
  • [36] Detecting shot transitions based on video content
    Clua, E.
    Fonseca, M. S.
    Conci, A.
    Montenegro, A.
    PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, : 339 - 342
  • [37] Similarity Based Block Sparse Subset Selection for Video Summarization
    Ma, Mingyang
    Mei, Shaohui
    Wan, Shuai
    Wang, Zhiyong
    Feng, David Dagan
    Bennamoun, Mohammed
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) : 3967 - 3980
  • [38] User-Generated-Video Summarization using Sparse Modelling
    Liu, Yulong
    Liu, Huaping
    Liu, Yunhui
    Sun, Fuchun
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3909 - 3915
  • [39] Unleashing the Power of Contrastive Learning for Zero-Shot Video Summarization
    Pang, Zongshang
    Nakashima, Yuta
    Otani, Mayu
    Nagahara, Hajime
    JOURNAL OF IMAGING, 2024, 10 (09)
  • [40] Efficient Video Shot Summarization Using an Enhanced Spectral Clustering Approach
    Chasanis, Vasilcios
    Likas, Aristidis
    Galatsanos, Nikolaos
    ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT I, 2008, 5163 : 847 - 856