A Bag-of-Importance Model With Locality-Constrained Coding Based Feature Learning for Video Summarization

被引:62
|
作者
Lu, Shiyang [1 ]
Wang, Zhiyong [1 ]
Mei, Tao [2 ]
Guan, Genliang [1 ]
Feng, David Dagan [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
[2] Microsoft Res, Beijing 100080, Peoples R China
关键词
Locality-constrained linear coding; sparse coding; video summarization; FRAMEWORK; SELECTION;
D O I
10.1109/TMM.2014.2319778
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization helps users obtain quick comprehension of video content. Recently, some studies have utilized local features to represent each video frame and formulate video summarization as a coverage problem of local features. However, the importance of individual local features has not been exploited. In this paper, we propose a novel Bag-of-Importance (BoI) model for static video summarization by identifying the frames with important local features as keyframes, which is one of the first studies formulating video summarization at local feature level, instead of at global feature level. That is, by representing each frame with local features, a video is characterized with a bag of local features weighted with individual importance scores and the frames with more important local features are more representative, where the representativeness of each frame is the aggregation of the weighted importance of the local features contained in the frame. In addition, we propose to learn a transformation from a raw local feature to a more powerful sparse nonlinear representation for deriving the importance score of each local feature, rather than directly utilize the hand-crafted visual features like most of the existing approaches. Specifically, we first employ locality-constrained linear coding (LCC) to project each local feature into a sparse transformed space. LCC is able to take advantage of the manifold geometric structure of the high dimensional feature space and form the manifold of the low dimensional transformed space with the coordinates of a set of anchor points. Then we calculate the norm of each anchor point as the importance score of each local feature which is projected to the anchor point. Finally, the distribution of the importance scores of all the local features in a video is obtained as the BoI representation of the video. We further differentiate the importance of local features with a spatial weighting template by taking the perceptual difference among spatial regions of a frame into account. As a result, our proposed video summarization approach is able to exploit both the inter-frame and intra-frame properties of feature representations and identify keyframes capturing both the dominant content and discriminative details within a video. Experimental results on three video datasets across various genres demonstrate that the proposed approach clearly outperforms several state-of-the-art methods.
引用
收藏
页码:1497 / 1509
页数:13
相关论文
共 50 条
  • [1] A BAG-OF-IMPORTANCE MODEL FOR VIDEO SUMMARIZATION
    Lu, Shiyang
    Wang, Zhiyong
    Song, Yuan
    Mei, Tao
    Feng, David Dagan
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [2] Locality-Constrained Discriminative Learning and Coding
    Wang, Shuyang
    Fu, Yun
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [3] Spatio-temporal Video Representation with Locality-Constrained Linear Coding
    Al Ghamdi, Manal
    Al Harbi, Nouf
    Gotoh, Yoshihiko
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 101 - 110
  • [4] SEMI-SUPERVISED LEARNING WITH KERNEL LOCALITY-CONSTRAINED LINEAR CODING
    Chang, Yao-Jen
    Chen, Tsuhan
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [5] CSIFT based locality-constrained linear coding for image classification
    Chen, Junzhou
    Li, Qing
    Peng, Qiang
    Wong, Kin Hong
    PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (02) : 441 - 450
  • [6] Human action recognition based on locality-constrained linear coding
    School of Instrumentation Science and Opto-electronics Engineering, Beijing University of Aeronautics and Astronautics, Beijing
    100191, China
    Beijing Hangkong Hangtian Daxue Xuebao, 6 (1122-1127):
  • [7] Learning Locality-Constrained Sparse Coding for Spectral Enhancement of Multispectral Imagery
    Hong, Danfeng
    Wu, Xin
    Gao, Lianru
    Zhang, Bing
    Chanussot, Jocelyn
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [8] CSIFT based locality-constrained linear coding for image classification
    Junzhou Chen
    Qing Li
    Qiang Peng
    Kin Hong Wong
    Pattern Analysis and Applications, 2015, 18 : 441 - 450
  • [9] Robust Visual Tracking via a Collaborative Model Based on Locality-Constrained Sparse Coding
    Hu, Jia
    Fan, Xiaoping
    IEEE ACCESS, 2020, 8 : 76737 - 76751
  • [10] WCE polyp detection based on novel feature descriptor with normalized variance locality-constrained linear coding
    Yang, Jianjun
    Chang, Liping
    Li, Sheng
    He, Xiongxiong
    Zhu, Tingwei
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2020, 15 (08) : 1291 - 1302