Summarizing egocentric videos using deep features and optimal clustering

被引:11
|
作者
Sahu, Abhimanyu [1 ]
Chowdhury, Ananda S. [1 ]
机构
[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata 700032, India
关键词
Egocentric video summarization; Deep features; Center-surround model; Integer Knapsack; FRAMEWORK;
D O I
10.1016/j.neucom.2020.02.099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we address the problem of summarizing egocentric videos using deep features and an optimal clustering approach. Based on an augmented pre-trained convolutional neural network (CNN), each frame in an egocentric video is represented by deep features. An optimal clustering algorithm, based on a center-surround model (CSM) and an Integer Knapsack type formulation (IK) for K-means, termed as CSMIK K-means, is applied next to obtain the summary. In the center surround model, we compute difference in entropy and the optical flow values between the central region and that of the surrounding region of each frame. In the integer knapsack formulation, each cluster is treated as an item whose cost is assigned from the center surround model. A potential set of clusters in CSMIK K-means is obtained from the chi-square distance between color histograms of successive frames. CSMIK K-Means evaluates different cluster formations and simultaneously determines the optimal number of clusters and the corresponding summary. Experimental evaluation on four well-known benchmark datasets clearly indicate the superiority of the proposed method over several state-of-the-art approaches. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:209 / 221
页数:13
相关论文
共 50 条
  • [1] An Unsupervised Method for Summarizing Egocentric Sport Videos
    Habibi Aghdam, Hamed
    Jahani Heravi, Elnaz
    Puig, Domenec
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2015), 2015, 9875
  • [2] Together Recognizing, Localizing and Summarizing Actions in Egocentric Videos
    Sahu, Abhimanyu
    Chowdhury, Ananda S.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4330 - 4340
  • [3] Object Discovery Using CNN Features in Egocentric Videos
    Bolanos, Marc
    Garolera, Maite
    Radeva, Petia
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 67 - 74
  • [4] Summarizing While Recording: Context-Based Highlight Detection for Egocentric Videos
    Lin, Yen-Liang
    Morariu, Vlad I.
    Hsu, Winston
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 443 - 451
  • [5] Summarizing Videos by Key frame extraction using SSIM and other Visual Features
    Sandhu, Sharanjeet Kaur
    Agarwal, Anupam
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER & COMMUNICATION TECHNOLOGY (ICCCT-2015), 2015, : 209 - 213
  • [6] Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks
    Zhang, Mengmi
    Ma, Keng Teck
    Lim, Joo Hwee
    Zhao, Qi
    Feng, Jiashi
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3539 - 3548
  • [7] Summarizing Unconstrained Videos Using Salient Montages
    Sun, Min
    Farhadi, Ali
    Taskar, Ben
    Seitz, Steve
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2256 - 2269
  • [8] TVSum: Summarizing Web Videos Using Titles
    Song, Yale
    Vallmitjana, Jordi
    Stent, Amanda
    Jaimes, Alejandro
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5179 - 5187
  • [9] An efficient technique for summarizing videos using visual contents
    Oh, J
    Hua, KA
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1167 - 1170
  • [10] Unsupervised word clustering using deep features
    Kulkarni, Mandar
    Karande, Shirish
    Lodha, Sachin
    [J]. PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 263 - 268