Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering

被引:57
|
作者
Kilic, Volkan [1 ]
Barnard, Mark [1 ]
Wang, Wenwu [1 ]
Kittler, Josef [1 ]
机构
[1] Univ Surrey, Dept Elect Engn, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
Adaptive particle filter; audio-visual speaker tracking; particle filter; MULTIPLE SPEAKERS; LOCALIZATION; SEGMENTATION; MICROPHONE; FUSION;
D O I
10.1109/TMM.2014.2377515
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of tracking multiple moving speakers in indoor environments has received much attention. Earlier techniques were based purely on a single modality, e.g., vision. Recently, the fusion of multi-modal information has been shown to be instrumental in improving tracking performance, as well as robustness in the case of challenging situations like occlusions (by the limited field of view of cameras or by other speakers). However, data fusion algorithms often suffer from noise corrupting the sensor measurements which cause non-negligible detection errors. Here, a novel approach to combining audio and visual data is proposed. We employ the direction of arrival angles of the audio sources to reshape the typical Gaussian noise distribution of particles in the propagation step and to weight the observation model in the measurement step. This approach is further improved by solving a typical problem associated with the PF, whose efficiency and accuracy usually depend on the number of particles and noise variance used in state estimation and particle propagation. Both parameters are specified beforehand and kept fixed in the regular PF implementation which makes the tracker unstable in practice. To address these problems, we design an algorithm which adapts both the number of particles and noise variance based on tracking error and the area occupied by the particles in the image. Experiments on the AV16.3 dataset show the advantage of our proposed methods over the baseline PF method and an existing adaptive PF algorithm for tracking occluded speakers with a significantly reduced number of particles.
引用
收藏
页码:186 / 200
页数:15
相关论文
共 50 条
  • [31] Integrating the Projective Transform with Particle Filtering for Visual Tracking
    Bouttefroy, P. L. M.
    Bouzerdoum, A.
    Phung, S. L.
    Beghdadi, A.
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2011,
  • [32] Visual Tracking via Particle Filtering on the Affine Group
    Kwon, Junghyun
    Park, Frank C.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2010, 29 (2-3): : 198 - 217
  • [33] A particle filtering framework with indirect measurements for visual tracking
    Zhang, HH
    Huang, WM
    Huang, ZY
    Zhang, BL
    [J]. 2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 723 - 728
  • [34] Adaptive Part Mining for Robust Visual Tracking
    Ma, Yinchao
    He, Jianfeng
    Yang, Dawei
    Zhang, Tianzhu
    Wu, Feng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11443 - 11457
  • [35] Learning Adaptive Metric for Robust Visual Tracking
    Jiang, Nan
    Liu, Wenyu
    Wu, Ying
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (08) : 2288 - 2300
  • [36] Siamese Visual Tracking with Robust Adaptive Learning
    Zhang, Wancheng
    Chen, Zhi
    Liu, Peizhong
    Deng, Jianhua
    [J]. PROCEEDINGS OF 2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (IEEE-ASID'2019), 2019, : 153 - 157
  • [37] Adaptive Keypoints Grouping for Robust Visual Tracking
    Lao, Mingjie
    Tang, Yazhe
    Kevin, Ang Zong Yao
    Lin, Feng
    [J]. PROCEEDINGS OF THE ION 2017 PACIFIC PNT MEETING, 2017, : 155 - 161
  • [38] Robust Visual Tracking via Adaptive Forest
    Cao, Song
    Xue, Wenfang
    [J]. PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 30 - 34
  • [39] An adaptive spatiotemporal correlation filtering visual tracking method
    Liu, Yuhan
    Yan, He
    Zhang, Wei
    Li, Mengxue
    Liu, Lingkun
    [J]. PLOS ONE, 2023, 18 (01):
  • [40] Noise Estimation and Adaptive Filtering During Visual Tracking
    Ndiour, Ibrahima J.
    Vela, Patricio A.
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4365 - 4368