Cost-Effective Solution to Synchronized Audio-Visual Capture using Multiple Sensors

被引:10
|
作者
Lichtenauer, Jeroen [1 ]
Valstar, Michel [1 ]
Shen, Jie [1 ]
Pantic, Maja [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2AZ, England
关键词
Video recording; Audio recording; Multisensor systems; Synchronization;
D O I
10.1109/AVSS.2009.92
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Applications such as surveillance and human motion capture require high-bandwidth recording from multiple cameras. Furthermore, the recent increase in research on sensor fusion has raised the demand on synchronization accuracy between video, audio and other sensor modalities. Previously, capturing synchronized, high resolution video from multiple cameras required complex, inflexible and expensive solutions. Our experiments show that a single PC, built from contemporary low-cost computer hardware, could currently handle up to 470MB/s of input data. This allows capturing from 18 cameras of 780x580pixels at 60fps each, or 36 cameras at 30fps. Furthermore, we achieve accurate synchronization between audio, video and additional sensors, by recording audio together with sensor trigger- or timestamp signals, using a multi-channel audio input. In this way, each sensor modality can be captured with separate software and hardware, allowing maximal flexibility with minimal cost.
引用
收藏
页码:324 / 329
页数:6
相关论文
共 50 条
  • [21] COST-EFFECTIVE SENSORS IN ROBOTIC APPLICATION
    RABIE, AM
    NAGI, F
    ROBOTS 13: CONFERENCE PROCEEDINGS, 1989, : G27 - G33
  • [22] Multimodal Learning Using 3D Audio-Visual Data or Audio-Visual Speech Recognition
    Su, Rongfeng
    Wang, Lan
    Liu, Xunying
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 40 - 43
  • [23] COST-EFFECTIVE METHOD USING FORCE SENSORS FOR CHIROPRACTIC TEACHING
    Shah, Iti
    Butler, Carolyn
    Salman, Muhamad
    PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 5, 2023,
  • [24] Toward More Effective Utilization of Audio-Visual Materials and Devices
    Witt, Paul W. F.
    TEACHERS COLLEGE RECORD, 1947, 49 (02): : 108 - 118
  • [25] Accumulation and decay of visual capture and the ventriloquism aftereffect caused by brief audio-visual disparities
    Adam K. Bosen
    Justin T. Fleming
    Paul D. Allen
    William E. O‘Neill
    Gary D. Paige
    Experimental Brain Research, 2017, 235 : 585 - 595
  • [26] Accumulation and decay of visual capture and the ventriloquism aftereffect caused by brief audio-visual disparities
    Bosen, Adam K.
    Fleming, Justin T.
    Allen, Paul D.
    O'Neill, William E.
    Paige, Gary D.
    EXPERIMENTAL BRAIN RESEARCH, 2017, 235 (02) : 585 - 595
  • [27] Visual Analysis of Simulation Uncertainty Using Cost-Effective Sampling
    Preston, Annie
    Li, Yiran
    Sauer, Franz
    Ma, Kwan-Liu
    2018 IEEE 8TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2018, : 1 - 11
  • [29] Noisy audio feature enhancement using audio-visual speech data
    Goecke, R
    Potamianos, G
    Neti, C
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2025 - 2028
  • [30] Audio-vision: Using audio-visual synchrony to locate sounds
    Hershey, J
    Movellan, J
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 813 - 819