A Bag-of-Importance Model With Locality-Constrained Coding Based Feature Learning for Video Summarization

被引:62
|
作者
Lu, Shiyang [1 ]
Wang, Zhiyong [1 ]
Mei, Tao [2 ]
Guan, Genliang [1 ]
Feng, David Dagan [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
[2] Microsoft Res, Beijing 100080, Peoples R China
关键词
Locality-constrained linear coding; sparse coding; video summarization; FRAMEWORK; SELECTION;
D O I
10.1109/TMM.2014.2319778
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization helps users obtain quick comprehension of video content. Recently, some studies have utilized local features to represent each video frame and formulate video summarization as a coverage problem of local features. However, the importance of individual local features has not been exploited. In this paper, we propose a novel Bag-of-Importance (BoI) model for static video summarization by identifying the frames with important local features as keyframes, which is one of the first studies formulating video summarization at local feature level, instead of at global feature level. That is, by representing each frame with local features, a video is characterized with a bag of local features weighted with individual importance scores and the frames with more important local features are more representative, where the representativeness of each frame is the aggregation of the weighted importance of the local features contained in the frame. In addition, we propose to learn a transformation from a raw local feature to a more powerful sparse nonlinear representation for deriving the importance score of each local feature, rather than directly utilize the hand-crafted visual features like most of the existing approaches. Specifically, we first employ locality-constrained linear coding (LCC) to project each local feature into a sparse transformed space. LCC is able to take advantage of the manifold geometric structure of the high dimensional feature space and form the manifold of the low dimensional transformed space with the coordinates of a set of anchor points. Then we calculate the norm of each anchor point as the importance score of each local feature which is projected to the anchor point. Finally, the distribution of the importance scores of all the local features in a video is obtained as the BoI representation of the video. We further differentiate the importance of local features with a spatial weighting template by taking the perceptual difference among spatial regions of a frame into account. As a result, our proposed video summarization approach is able to exploit both the inter-frame and intra-frame properties of feature representations and identify keyframes capturing both the dominant content and discriminative details within a video. Experimental results on three video datasets across various genres demonstrate that the proposed approach clearly outperforms several state-of-the-art methods.
引用
收藏
页码:1497 / 1509
页数:13
相关论文
共 50 条
  • [31] A Locality-Constrained Linear Coding-Based Ensemble Learning Framework for Predicting Potentially Disease-Associated MiRNAs
    Shen, Yi
    Gao, Ying-Lian
    Li, Shu-Zhen
    Guan, Boxin
    Liu, Jin-Xing
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2022, 2022, 13760 : 295 - 302
  • [32] Locality-constrained Linear Coding based Fused Visual Features for Robust Acoustic Event Classification
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    INTERSPEECH 2019, 2019, : 2558 - 2562
  • [33] Locality-constrained linear coding based bi-layer model for multi-view facial expression recognition
    Wu, Jianlong
    Lin, Zhouchen
    Zheng, Wenming
    Zha, Hongbin
    NEUROCOMPUTING, 2017, 239 : 143 - 152
  • [34] Locality-constrained feature space learning for cross-resolution sketch-photo face recognition
    Guangwei Gao
    Yannan Wang
    Pu Huang
    Heyou Chang
    Huimin Lu
    Dong Yue
    Multimedia Tools and Applications, 2020, 79 : 14903 - 14917
  • [35] Locality-constrained feature space learning for cross-resolution sketch-photo face recognition
    Gao, Guangwei
    Wang, Yannan
    Huang, Pu
    Chang, Heyou
    Lu, Huimin
    Yue, Dong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14903 - 14917
  • [36] Locality-constrained Linear Coding Based on Principal Components of Visual Vocabulary for Visual Object Categorization
    Wang, Hongxia
    Zeng, Long
    Peng, Dewei
    Geng, Feng
    14TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS, ENGINEERING AND SCIENCE (DCABES 2015), 2015, : 356 - 359
  • [37] Smartphone-based Human Activity Recognition in Buildings Using Locality-constrained Linear Coding
    Zhu, Qingchang
    Chen, Zhenghua
    Soh, Yeng Chai
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 214 - 219
  • [38] LLCMDA: A Novel Method for Predicting miRNA Gene and Disease Relationship Based on Locality-Constrained Linear Coding
    Qu, Yu
    Zhang, Huaxiang
    Lyu, Chen
    Liang, Cheng
    FRONTIERS IN GENETICS, 2018, 9
  • [39] Computing object-based saliency via locality-constrained linear coding and conditional random fields
    Zhen Yang
    Huilin Xiong
    The Visual Computer, 2017, 33 : 1403 - 1413
  • [40] 3D Human Action Recognition Using a Single Depth Feature and Locality-Constrained Affine Subspace Coding
    Liang, Chengwu
    Qi, Lin
    He, Yifeng
    Guan, Ling
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2920 - 2932