Synchronization of Multiple Camera Videos Using Audio-Visual Features

被引：31

作者：

Shrestha, Prarthana ^{[1
]}

Barbieri, Mauro ^{[1
]}

Weda, Hans ^{[1
]}

Sekulovski, Dragan ^{[1
]}

机构：

[1] Philips Res Europe, NL-5656 AE Eindhoven, Netherlands

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2010年 / 12卷 / 01期

关键词：

Content analysis and synthesis; feature extraction and representation; joint media and multimodal processing;

D O I：

10.1109/TMM.2009.2036285

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Digital video capturing is getting popular with the decreasing price of camcorders and the increasing availability of devices with embedded video cameras such as digital-still cameras, mobile phones and PDAs. While a raw home video is considered as visually non-appealing, having multiple recordings of the same event provides the opportunity to combine audio and video segments from different cameras for improving quality and aesthetics. Mixing content from different recordings requires precise synchronization among the recordings. In most present applications, synchronization is done manually and considered as a very tedious task. In this paper, we propose a novel automated synchronization approach based on detecting and matching audio and video features extracted from the recorded content. We assess experimentally three realizations of this approach on a common data set and make recommendations on the usability of the different realizations in practical use cases. The realizations have no limitations on the number and movement of the cameras. Moreover, they are robust against various ambient noises and audio-visual artifacts occurring during the recordings.

引用

页码：79 / 92

页数：14

共 50 条

[1] VIDEO CAMERA IDENTIFICATION USING AUDIO-VISUAL FEATURES
Milani, S.
Cuccovillo, L.
Tagliasacchi, M.
Tubaro, S.
Aichroth, P.
[J]. 2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,
[2] Fast Seriation of Multiple Homogeneous-content Videos Using Audio-visual Features
Zeng, Yi-Chong
Chang, Wen-Tsung
[J]. INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1157 - 1166
[3] Audio-visual events for multi-camera synchronization
Casanovas, Anna Llagostera
Cavallaro, Andrea
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (04) : 1317 - 1340
[4] Audio-visual events for multi-camera synchronization
Anna Llagostera Casanovas
Andrea Cavallaro
[J]. Multimedia Tools and Applications, 2015, 74 : 1317 - 1340
[5] Multimodal framework based on audio-visual features for summarisation of cricket videos
Javed, Ali
Irtaza, Aun
Malik, Hafiz
Mahmood, Muhammad Tariq
Adnan, Syed
[J]. IET IMAGE PROCESSING, 2019, 13 (04) : 615 - 622
[6] Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents
Zhang, Ye
Tanishige, Ryunosuke
Ide, Ichiro
Doman, Keisuke
Kawanishi, Yasutomo
Deguchi, Daisuke
Murase, Hiroshi
[J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2019, 13 (01) : 135 - 155
[7] Multiple camera in car audio-visual speech recognition using phonetic and visemic information
Biswas, Astik
Sahu, P. K.
Chandra, Mahesh
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2015, 47 : 35 - 50
[8] Audio-Visual Event Localization in Unconstrained Videos
Tian, Yapeng
Shi, Jing
Li, Bochen
Duan, Zhiyao
Xu, Chenliang
[J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 252 - 268
[9] Audio-visual speech recognition using MPEGA compliant visual features
Aleksic, PS
Williams, JJ
Wu, ZL
Katsaggelos, AK
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) : 1213 - 1227
[10] AUDIO-VISUAL SYNCHRONIZATION RECOVERY IN MULTIMEDIA CONTENT
Lee, Jong-Seok
Ebrahimi, Touradj
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2280 - 2283

← 1 2 3 4 5 →