Speaker Orientation Estimation based on Hybridation of GCC-PHAT and HLBR

被引:0
|
作者
Segura, Carlos [1 ]
Abad, Alberto [1 ]
Hernando, Javier [1 ]
Nadeu, Climent [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, Barcelona, Spain
关键词
Head orientation; Speaker orientation; Speaker localization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel approach to speaker orientation estimation in a Smart Room environment equipped with multiple microphones. The ratio between the high and low band energies (HLBR) received at each microphone has been shown in our previous work to be a potentially approach to estimate the direction of the voice produced by a speaker. In this work, for each microphone pair, a smoothed CPS phase is obtained by a proper windowing of the main peak of the cross-correlation sequence estimated with the GCC-PHAT method, and a HLBR is computed from the processed CPS. The proposed method keeps the computational simplicity of the HLBR algorithm while adding the robustness offered by the GCC-PHAT technique. Experimental preliminary results were conducted over a database recorded purposely in the UPC Smart room, and over the CLEAR head pose database. The proposed method performs consistently better than other state-of-the-art techniques with both databases.
引用
收藏
页码:1325 / 1328
页数:4
相关论文
共 50 条
  • [1] GCC-PHAT based Head Orientation Estimation
    Segura, Carlos
    Hernando, Javier
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1738 - 1741
  • [2] Time Delay Estimation for Speaker Localization Using CNN-Based Parametrized GCC-PHAT Features
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    INTERSPEECH 2021, 2021, : 1479 - 1483
  • [3] Water leakage detection and localisation based on GCC-PHAT algorithm
    Liu Y.
    Tao B.
    Jiang G.
    Li G.
    Zhang X.
    Chen D.
    International Journal of Wireless and Mobile Computing, 2020, 19 (01): : 55 - 61
  • [4] Analysis of the GCC-PHAT technique for multiple sources
    Kwon, Byoungho
    Park, Youngjin
    Park, Youn-sik
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 2070 - 2073
  • [5] Optimization Algorithm for Delay Estimation Based on Singular Value Decomposition and Improved GCC-PHAT Weighting
    Wang, Shizhe
    Li, Zongji
    Wang, Pingbo
    Chen, Huadong
    SENSORS, 2022, 22 (19)
  • [6] ON PRE-FILTERING STRATEGIES FOR THE GCC-PHAT ALGORITHM
    Kang, Hong-Goo
    Graczyk, Michael
    Skoglund, Jan
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [7] Dynamic Adjustment of Weighted GCC-PHAT for Position Estimation in an Ultrasonic Local Positioning System
    Manuel Villadangos, Jose
    Urena, Jesus
    Jesus Garcia-Dominguez, Juan
    Jimenez-Martin, Ana
    Hernandez, Alvaro
    Perez-Rubio, Ma Carmen
    SENSORS, 2021, 21 (21)
  • [8] SIGNAL-INFORMED DNN-BASED DOA ESTIMATION COMBINING AN EXTERNAL MICROPHONE AND GCC-PHAT FEATURES
    Kowalk, Ulrik
    Doclo, Simon
    Bitzer, Joerg
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [9] Extending GCC-PHAT using Shift Equivariant Neural Networks
    Berg, Axel
    O'Connor, Mark
    Astrom, Kalle
    Oskarsson, Magnus
    INTERSPEECH 2022, 2022, : 1791 - 1795
  • [10] Sound Source Localization Based on GCC-PHAT With Diffuseness Mask in Noisy and Reverberant Environments
    Lee, Ran
    Kang, Min-Seok
    Kim, Bo-Hyun
    Park, Kang-Ho
    Lee, Sung Q.
    Park, Hyung-Min
    IEEE ACCESS, 2020, 8 : 7373 - 7382