Microphone Array Speaker Localizers Using Spatial-Temporal Information

被引:0
|
作者
Sharon Gannot
Tsvi Gregory Dvorkind
机构
[1] Bar-Ilan University,School of Engineering
[2] Technion –Israel Institute of Technology,Department of Electrical Engineering
关键词
Azimuth; Kalman Filter; Speech Signal; Temporal Information; Extended Kalman Filter;
D O I
暂无
中图分类号
学科分类号
摘要
A dual-step approach for speaker localization based on a microphone array is addressed in this paper. In the first stage, which is not the main concern of this paper, the time difference between arrivals of the speech signal at each pair of microphones is estimated. These readings are combined in the second stage to obtain the source location. In this paper, we focus on the second stage of the localization task. In this contribution, we propose to exploit the speaker's smooth trajectory for improving the current position estimate. Three localization schemes, which use the temporal information, are presented. The first is a recursive form of the Gauss method. The other two are extensions of the Kalman filter to the nonlinear problem at hand, namely, the extended Kalman filter and the unscented Kalman filter. These methods are compared with other algorithms, which do not make use of the temporal information. An extensive experimental study demonstrates the advantage of using the spatial-temporal methods. To gain some insight on the obtainable performance of the localization algorithm, an approximate analytical evaluation, verified by an experimental study, is conducted. This study shows that in common TDOA-based localization scenarios—where the microphone array has small interelement spread relative to the source position—the elevation and azimuth angles can be accurately estimated, whereas the Cartesian coordinates as well as the range are poorly estimated.
引用
收藏
相关论文
共 50 条
  • [1] Microphone array speaker localizers using spatial-temporal information
    Gannot, Sharon
    Dvorkind, Tsvi Gregory
    [J]. Eurasip Journal on Applied Signal Processing, 2006, 2006
  • [2] Microphone array speaker localizers using spatial-temporal information
    Gannot, Sharon
    Dvorkind, Tsvi Gregory
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [3] Performance of speaker localization using microphone array
    Visalakshi, R.
    Dhanalakshmi, P.
    Palanivel, S.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 467 - 483
  • [4] An improved spatial-temporal equalization scheme using array antennas
    Xu, B
    Vu, TB
    [J]. IEEE ANTENNAS AND PROPAGATION SOCIETY INTERNATIONAL SYMPOSIUM - ANTENNAS: GATEWAYS TO THE GLOBAL NETWORK, VOLS 1-4, 1998, : 388 - 391
  • [5] Speaker localization using microphone array in a reverberant room
    Zou, QY
    Rahardja, S
    Cai, ZB
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 354 - 357
  • [6] A framework for speaker tracking using microphone array and camera
    Chen, JF
    Jiang, LJ
    Ser, W
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1384 - 1387
  • [7] The Spatial-Temporal Channel for Information Fusion
    Li Weihua
    Fu Xiaodong
    [J]. 8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 483 - 487
  • [8] Fuzzy clustering with spatial-temporal information
    D'Urso, Pierpaolo
    De Giovanni, Livia
    Disegna, Marta
    Massari, Riccardo
    [J]. SPATIAL STATISTICS, 2019, 30 : 71 - 102
  • [9] Spatial-temporal active wave computing using infrared proximity array
    Koller, Miklos
    Cserey, Gyoergy
    [J]. INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2012, 40 (12) : 1209 - 1218
  • [10] Microphone array response to speaker movements
    Grenier, Y
    Affes, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 247 - 250