Computational approaches to temporal sampling of video sequences

被引:27
|
作者
Liu, Tiecheng [1 ]
Kender, John R.
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] Columbia Univ, New York, NY 10027 USA
关键词
algorithms; video summarization; key frame selection; video content analysis; ubiquitous media access; temporal video sampling;
D O I
10.1145/1230812.1230813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video key frame extraction is one of the most important research problems for video summarization, indexing, and retrieval. For a variety of applications such as ubiquitous media access and video streaming, the temporal boundaries between video key frames are required for synchronizing visual content with audio. In this article, we define temporal video sampling as a unified process of extracting video key frames and computing their temporal boundaries, and formulate it as an optimization problem. We first provide an optimal approach that minimizes temporal video sampling error using a dynamic programming process. The optimal approach retrieves a key frame hierarchy and all temporal boundaries in 0(n(4)) time and O(n(2)) space. To further reduce computational complexity, we also provide a suboptimal greedy algorithm that exploits the data structure of a binary heap and uses a novel "look-ahead" computational technique, enabling all levels of key frames to be extracted with an average-case computational time of O(n log n) and memory usage of 0 (n). Both the optimal and the greedy methods are free of parameters, thus avoiding the threshold-selection problem that exists in other approaches. We empirically compare the proposed optimal and greedy methods with several existing methods in terms of video sampling error, computational cost, and subjective quality. An evaluation of eight videos of different genres shows that the greedy approach achieves performance very close to that of the optimal approach while drastically reducing computational cost, making it suitable for processing long video sequences in large video databases.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Temporal registration of video sequences
    Cheng, H
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 489 - 492
  • [2] On temporal span scalability of video sequences
    Tang, S
    Bigdeli, A
    Porat, M
    Salcic, Z
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 881 - 884
  • [3] INVESTIGATION OF TEMPORAL INTEGRATION BY VIDEO SAMPLING
    WILLIAMS, R
    [J]. PERCEPTION, 1973, 2 (04) : 441 - 490
  • [4] Spatio-temporal Sampling for Video
    Shankar, Mohan
    Pitsiauis, Nikos P.
    Brady, David
    [J]. IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
  • [5] DETECTION OF TEMPORAL INTERPOLATION IN VIDEO SEQUENCES
    Bestagini, P.
    Battaglia, S.
    Milani, S.
    Tagliasacchi, M.
    Tubaro, S.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3033 - 3037
  • [6] Learning Temporal Regularity in Video Sequences
    Hasan, Mahmudul
    Choi, Jonghyun
    Neumann, Jan
    Roy-Chowdhury, Amit K.
    Davis, Larry S.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 733 - 742
  • [7] Temporal Segmentation of Facial Expressions in Video Sequences
    Xue, Yu
    Mei, Xue
    Bian, Jiali
    Wu, Liang
    Ding, Yao
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10789 - 10794
  • [8] Finding the optimal temporal partitioning of video sequences
    Truong, BT
    Venkatesh, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1183 - 1186
  • [9] Temporal Segmentation of Human Actions in Video Sequences
    Maria Carmona, Josep
    Climent, Joan
    [J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 786 - 790
  • [10] Temporal alignment of video sequences for watermarking systems
    Delannay, D
    de Roover, C
    Macq, B
    [J]. SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS V, 2003, 5020 : 481 - 492