Detecting shot boundary with sparse coding for video summarization

被引:26
|
作者
Li, Jiatong [1 ]
Yao, Ting [2 ]
Ling, Qiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;
D O I
10.1016/j.neucom.2017.04.065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 78
页数:13
相关论文
共 50 条
  • [21] SPARSE UNSUPERVISED CLUSTERING WITH MIXTURE OBSERVATIONS FOR VIDEO SUMMARIZATION
    Xiang, Xiang
    Tran, Dung N.
    Tran, Trac D.
    2017 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2017,
  • [22] Video summarization via block sparse dictionary selection
    Ma, Mingyang
    Mei, Shaohui
    Wan, Shuai
    Hou, Junhui
    Wang, Zhiyong
    Feng, David Dagan
    NEUROCOMPUTING, 2020, 378 : 197 - 209
  • [23] Video Summarization Based on SVD and Sparse Subspace Clustering
    Hao, Xue
    Peng, Guohua
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2017, 29 (03): : 485 - 492
  • [24] SPARSE MODELING FOR TOPIC-ORIENTED VIDEO SUMMARIZATION
    Panda, Rameswar
    Roy-Chowdhury, Amit K.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1388 - 1392
  • [25] Video Summarization via Nonlinear Sparse Dictionary Selection
    Ma, Mingyang
    Mei, Shaohui
    Wan, Shuai
    Wang, Zhiyong
    Feng, Dagan
    IEEE ACCESS, 2019, 7 : 11763 - 11774
  • [26] Nonlinear Block Sparse Dictionary Selection for Video Summarization
    Ma M.
    Mei S.
    Wan S.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2019, 53 (05): : 142 - 148
  • [27] Patch Based Video Summarization With Block Sparse Representation
    Mei, Shaohui
    Ma, Mingyang
    Wan, Shuai
    Hou, Junhui
    Wang, Zhiyong
    Feng, David Dagan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 732 - 747
  • [28] Home video summarization by shot characteristics and user's feedback
    Cooharojananone, N
    Auethavekiat, S
    Aizawa, K
    STORAGE AND RETRIEVAL METHODS AND APPLICATIONS FOR MULTIMEDIA 2004, 2004, 5307 : 388 - 395
  • [29] Video Summarization Using Shot Segmentation And Local Motion Estimation
    Sabbar, Wafae
    Chergui, Adil
    Bekkhoucha, Abdelkrim
    2012 SECOND INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2012, : 190 - 193
  • [30] Video Shot Boundary Detection: A Review
    Pal, Gautam
    Rudrapaul, Dwijen
    Acharjee, Suvojit
    Ray, Ruben
    Chakraborty, Sayan
    Dey, Nilanjan
    EMERGING ICT FOR BRIDGING THE FUTURE, VOL 2, 2015, 338 : 119 - 127