Synchronization of Multiple Camera Videos Using Audio-Visual Features

被引:31
|
作者
Shrestha, Prarthana [1 ]
Barbieri, Mauro [1 ]
Weda, Hans [1 ]
Sekulovski, Dragan [1 ]
机构
[1] Philips Res Europe, NL-5656 AE Eindhoven, Netherlands
关键词
Content analysis and synthesis; feature extraction and representation; joint media and multimodal processing;
D O I
10.1109/TMM.2009.2036285
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital video capturing is getting popular with the decreasing price of camcorders and the increasing availability of devices with embedded video cameras such as digital-still cameras, mobile phones and PDAs. While a raw home video is considered as visually non-appealing, having multiple recordings of the same event provides the opportunity to combine audio and video segments from different cameras for improving quality and aesthetics. Mixing content from different recordings requires precise synchronization among the recordings. In most present applications, synchronization is done manually and considered as a very tedious task. In this paper, we propose a novel automated synchronization approach based on detecting and matching audio and video features extracted from the recorded content. We assess experimentally three realizations of this approach on a common data set and make recommendations on the usability of the different realizations in practical use cases. The realizations have no limitations on the number and movement of the cameras. Moreover, they are robust against various ambient noises and audio-visual artifacts occurring during the recordings.
引用
收藏
页码:79 / 92
页数:14
相关论文
共 50 条
  • [1] VIDEO CAMERA IDENTIFICATION USING AUDIO-VISUAL FEATURES
    Milani, S.
    Cuccovillo, L.
    Tagliasacchi, M.
    Tubaro, S.
    Aichroth, P.
    [J]. 2014 5TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP 2014), 2014,
  • [2] Fast Seriation of Multiple Homogeneous-content Videos Using Audio-visual Features
    Zeng, Yi-Chong
    Chang, Wen-Tsung
    [J]. INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1157 - 1166
  • [3] Audio-visual events for multi-camera synchronization
    Casanovas, Anna Llagostera
    Cavallaro, Andrea
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (04) : 1317 - 1340
  • [4] Audio-visual events for multi-camera synchronization
    Anna Llagostera Casanovas
    Andrea Cavallaro
    [J]. Multimedia Tools and Applications, 2015, 74 : 1317 - 1340
  • [5] Multimodal framework based on audio-visual features for summarisation of cricket videos
    Javed, Ali
    Irtaza, Aun
    Malik, Hafiz
    Mahmood, Muhammad Tariq
    Adnan, Syed
    [J]. IET IMAGE PROCESSING, 2019, 13 (04) : 615 - 622
  • [6] Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents
    Zhang, Ye
    Tanishige, Ryunosuke
    Ide, Ichiro
    Doman, Keisuke
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Murase, Hiroshi
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2019, 13 (01) : 135 - 155
  • [7] Multiple camera in car audio-visual speech recognition using phonetic and visemic information
    Biswas, Astik
    Sahu, P. K.
    Chandra, Mahesh
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2015, 47 : 35 - 50
  • [8] Audio-Visual Event Localization in Unconstrained Videos
    Tian, Yapeng
    Shi, Jing
    Li, Bochen
    Duan, Zhiyao
    Xu, Chenliang
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 252 - 268
  • [9] Audio-visual speech recognition using MPEGA compliant visual features
    Aleksic, PS
    Williams, JJ
    Wu, ZL
    Katsaggelos, AK
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2002, 2002 (11) : 1213 - 1227
  • [10] AUDIO-VISUAL SYNCHRONIZATION RECOVERY IN MULTIMEDIA CONTENT
    Lee, Jong-Seok
    Ebrahimi, Touradj
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2280 - 2283