Detecting shot boundary with sparse coding for video summarization

被引:26
|
作者
Li, Jiatong [1 ]
Yao, Ting [2 ]
Ling, Qiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;
D O I
10.1016/j.neucom.2017.04.065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 78
页数:13
相关论文
共 50 条
  • [41] Shot type classification by dominant color for sports video segmentation and summarization
    Ekin, A
    Tekalp, AM
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 173 - 176
  • [42] Video Summarization Using a Key Frame Selection Based on Shot Segmentation
    Widiarto, Wisnu
    Yuniarno, Eko Mulyanto
    Hariadi, Mochamad
    2015 International Conference on Science in Information Technology (ICSITech), 2015, : 207 - 212
  • [43] Laplacian Sparse Coding of Scenes for Video Classification
    Yin, Yifang
    Liu, Zhenguang
    Satyam
    Zimmermann, Roger
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 499 - 506
  • [44] An Integrated Method for Video Shot Boundary Detection
    Zhu, Lei
    Qu, Junfeng
    Rahman, Muhammad Asadur
    Hong, Weihu
    IEEE SOUTHEASTCON 2010: ENERGIZING OUR FUTURE, 2010, : 151 - 154
  • [45] Comparison of video shot boundary detection techniques
    Boreczky, JS
    Rowe, LA
    STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 : 170 - 179
  • [46] Video Shot Detection based on SIFT Features and Video Summarization using Expectation-Maximization
    Majumdar, Jharna
    Awale, Manish
    Kumar, Santhosh K. L.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1033 - 1037
  • [47] An improved algorithm of video shot boundary detection
    Zhang Jianfeng
    Wei Zhiqiang
    Jiang Shuming
    Li Jian
    Xu Shijie
    Wang Shuai
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1258 - 1261
  • [48] Video segmentation based on shot boundary coefficient
    Yu, Junqing
    Tian, Bo
    Tang, Yang
    2007 2ND INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2007, : 630 - +
  • [49] An efficient algorithm for video shot boundary detection
    Zheng, J
    Zou, FM
    Shi, M
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 266 - 269
  • [50] Shot Boundary Detection Method for News Video
    Jiang, Ming
    Huang, Jingcheng
    Wang, Xingqi
    Tang, Jingfan
    Wu, Chunming
    JOURNAL OF COMPUTERS, 2013, 8 (12) : 3034 - 3038