Microphone Array Speaker Localizers Using Spatial-Temporal Information

被引:0
|
作者
Sharon Gannot
Tsvi Gregory Dvorkind
机构
[1] Bar-Ilan University,School of Engineering
[2] Technion –Israel Institute of Technology,Department of Electrical Engineering
关键词
Azimuth; Kalman Filter; Speech Signal; Temporal Information; Extended Kalman Filter;
D O I
暂无
中图分类号
学科分类号
摘要
A dual-step approach for speaker localization based on a microphone array is addressed in this paper. In the first stage, which is not the main concern of this paper, the time difference between arrivals of the speech signal at each pair of microphones is estimated. These readings are combined in the second stage to obtain the source location. In this paper, we focus on the second stage of the localization task. In this contribution, we propose to exploit the speaker's smooth trajectory for improving the current position estimate. Three localization schemes, which use the temporal information, are presented. The first is a recursive form of the Gauss method. The other two are extensions of the Kalman filter to the nonlinear problem at hand, namely, the extended Kalman filter and the unscented Kalman filter. These methods are compared with other algorithms, which do not make use of the temporal information. An extensive experimental study demonstrates the advantage of using the spatial-temporal methods. To gain some insight on the obtainable performance of the localization algorithm, an approximate analytical evaluation, verified by an experimental study, is conducted. This study shows that in common TDOA-based localization scenarios—where the microphone array has small interelement spread relative to the source position—the elevation and azimuth angles can be accurately estimated, whereas the Cartesian coordinates as well as the range are poorly estimated.
引用
收藏
相关论文
共 50 条
  • [31] Steering of Camera by Stepper Motor Towards Active Speaker Using Microphone Array
    Mukul, Manoj Kumar
    Prasad, Rajkishore
    Choudhary, M. M.
    Matsuno, Fumitoshi
    [J]. 2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 19 - +
  • [32] Speaker attention system for mobile robots using microphone array and face tracking
    Song, Kai-Tai
    Hu, Jwu-Sheng
    Tsai, Chi-Yi
    Chou, Chung-Min
    Cheng, Chieh- Cheng
    Liu, Wei-Han
    Yang, Chia-Hsing
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 3624 - +
  • [33] Improved microphone array design with statistical speaker verification
    Demir, Kadir Erdem
    Eskil, M. Taner
    [J]. APPLIED ACOUSTICS, 2021, 175
  • [34] Robust speech recognition with speaker localization by a microphone array
    Yamada, T
    Nakamura, S
    Shikano, K
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1317 - 1320
  • [35] Fire Detection Using Spatial-temporal Analysis
    Chen, Liang-Hua
    Huang, Wei-Cheng
    [J]. WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL III, 2013, : 2222 - 2225
  • [36] Loitering Detection Using Spatial-Temporal Information for Intelligent Surveillance Systems on a Vision Sensor
    Wahyono
    Harjoko, Agus
    Dharmawan, Andi
    Adhinata, Faisal Dharma
    Kosala, Gamma
    Jo, Kang-Hyun
    [J]. JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2023, 12 (01)
  • [37] Headlight recognition for night-time traffic surveillance using spatial-temporal information
    Sooksatra, Sorn
    Kondo, Toshiaki
    Bunnun, Pished
    Yoshitaka, Atsuo
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (01) : 107 - 114
  • [38] High performance spatial-temporal de-interlacing technique using interfield information
    Hsu, CT
    Chen, MJ
    Huang, CH
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, 2004, : 213 - 216
  • [39] Partial volume correction for arterial spin labeling data using spatial-temporal information
    Liu, Yang
    Li, Baojuan
    Zhang, Xi
    Zhang, Linchuan
    Liang, Zhengrong
    Lu, Hongbing
    [J]. MEDICAL IMAGING 2015: IMAGE PROCESSING, 2015, 9413
  • [40] Spatial-temporal analysis of mortality using splines
    vanderLinde, A
    Witzko, KH
    Jockel, KH
    [J]. BIOMETRICS, 1995, 51 (04) : 1352 - 1360