Computational approaches to temporal sampling of video sequences

被引：27

作者：

Liu, Tiecheng ^{[1
]}

Kender, John R.

机构：

[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA

[2] Columbia Univ, New York, NY 10027 USA

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2007年 / 3卷 / 02期

关键词：

algorithms; video summarization; key frame selection; video content analysis; ubiquitous media access; temporal video sampling;

D O I：

10.1145/1230812.1230813

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video key frame extraction is one of the most important research problems for video summarization, indexing, and retrieval. For a variety of applications such as ubiquitous media access and video streaming, the temporal boundaries between video key frames are required for synchronizing visual content with audio. In this article, we define temporal video sampling as a unified process of extracting video key frames and computing their temporal boundaries, and formulate it as an optimization problem. We first provide an optimal approach that minimizes temporal video sampling error using a dynamic programming process. The optimal approach retrieves a key frame hierarchy and all temporal boundaries in 0(n(4)) time and O(n(2)) space. To further reduce computational complexity, we also provide a suboptimal greedy algorithm that exploits the data structure of a binary heap and uses a novel "look-ahead" computational technique, enabling all levels of key frames to be extracted with an average-case computational time of O(n log n) and memory usage of 0 (n). Both the optimal and the greedy methods are free of parameters, thus avoiding the threshold-selection problem that exists in other approaches. We empirically compare the proposed optimal and greedy methods with several existing methods in terms of video sampling error, computational cost, and subjective quality. An evaluation of eight videos of different genres shows that the greedy approach achieves performance very close to that of the optimal approach while drastically reducing computational cost, making it suitable for processing long video sequences in large video databases.

引用

页数：23

共 50 条

[1] Temporal registration of video sequences
Cheng, H
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 489 - 492
[2] On temporal span scalability of video sequences
Tang, S
Bigdeli, A
Porat, M
Salcic, Z
[J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 881 - 884
[3] INVESTIGATION OF TEMPORAL INTEGRATION BY VIDEO SAMPLING
WILLIAMS, R
[J]. PERCEPTION, 1973, 2 (04) : 441 - 490
[4] Spatio-temporal Sampling for Video
Shankar, Mohan
Pitsiauis, Nikos P.
Brady, David
[J]. IMAGE RECONSTRUCTION FROM INCOMPLETE DATA V, 2008, 7076
[5] DETECTION OF TEMPORAL INTERPOLATION IN VIDEO SEQUENCES
Bestagini, P.
Battaglia, S.
Milani, S.
Tagliasacchi, M.
Tubaro, S.
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3033 - 3037
[6] Learning Temporal Regularity in Video Sequences
Hasan, Mahmudul
Choi, Jonghyun
Neumann, Jan
Roy-Chowdhury, Amit K.
Davis, Larry S.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 733 - 742
[7] Temporal Segmentation of Facial Expressions in Video Sequences
Xue, Yu
Mei, Xue
Bian, Jiali
Wu, Liang
Ding, Yao
[J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10789 - 10794
[8] Finding the optimal temporal partitioning of video sequences
Truong, BT
Venkatesh, S
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1183 - 1186
[9] Temporal Segmentation of Human Actions in Video Sequences
Maria Carmona, Josep
Climent, Joan
[J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 786 - 790
[10] Temporal alignment of video sequences for watermarking systems
Delannay, D
de Roover, C
Macq, B
[J]. SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS V, 2003, 5020 : 481 - 492

← 1 2 3 4 5 →