QUATERNION NEURAL NETWORKS FOR 3D SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS

被引:8
|
作者
Celsi, Michela Ricciardi [1 ]
Scardapane, Simone [1 ]
Comminiello, Danilo [1 ]
机构
[1] Sapienza Univ Rome, Dept Informat Engn Elect & Telecommun, Rome, Italy
关键词
Quaternion neural networks; Hypercomplex-valued neural networks; 3D audio; Source localization; Convolutional recurrent neural networks;
D O I
10.1109/mlsp49062.2020.9231809
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Localization of sound sources in 3D sound fields is an extremely challenging task, especially when the environments are reverberant and involve multiple sources. In this work, we propose a deep neural network to analyze audio signals recorded by 3D microphones and localize sound sources in a spatial sound field. In particular, we consider first-order Ambisonics microphones to capture 3D acoustic signals and represent them by spherical harmonic decomposition in the quaternion domain. Moreover, to improve the localization performance, we use quaternion input features derived from the acoustic intensity, which is strictly related to the direction of arrival (DOA) of a sound source. The proposed network architecture involves both quaternion-valued convolutional and recurrent layers. Results show that the proposed method is able to exploit both the quaternion-valued representation of ambisonic signals and to improve the localization performance with respect to existing methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] QUATERNION CONVOLUTIONAL NEURAL NETWORKS FOR DETECTION AND LOCALIZATION OF 3D SOUND EVENTS
    Comminiello, Danilo
    Lella, Marco
    Scardapane, Simone
    Uncini, Aurelio
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8533 - 8537
  • [2] Sound Source Localization through Optimal Peak Association in Reverberant Environments
    Zhu, Hongyan
    Li, Zelin
    Cheng, Qi
    [J]. 2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2017, : 467 - 472
  • [3] Improved TDOA Disambiguation Techniques for Sound Source Localization in Reverberant Environments
    Zannini, Cecilia Maria
    Cirillo, Albenzio
    Parisi, Raffaele
    Uncini, Aurelio
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 2666 - 2669
  • [4] Sound source localization in reverberant environments using an outlier elimination algorithm
    Jan, EE
    Flanagan, J
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1321 - 1324
  • [5] Sound Source Localization Based on Robust Least Squares in Reverberant Environments
    Zhu, Hongyan
    Dang, Xudong
    Li, Zelin
    Ge, Quanbo
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2029 - 2035
  • [6] Localization of Sound Source in 3D Space
    Hinich, Petar
    Hinich, Sasa
    Milutinovich, Svetko
    [J]. ISSPIT: 8TH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2008, : 219 - +
  • [7] Analysis of source localization in reverberant environments
    Peterson, J. Michael
    Kyriakakis, Chris
    [J]. 2006 IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, VOLS 1 AND 2, 2006, : 672 - +
  • [8] Robust MUSIC-Based Sound Source Localization in Reverberant and Echoic Environments
    Sewtz, Marco
    Bodenmueller, Tim
    Triebel, Rudolph
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 2474 - 2480
  • [9] Sound Source Localization in Reverberant Environments Based on Structural Sparse Bayesian Learning
    Liu, Yanshan
    Wang, Lu
    Zeng, Xiangyang
    Wang, Haitao
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2018, 104 (03) : 528 - 541
  • [10] Neural Coding of Sound Envelope in Reverberant Environments
    Slama, Michael C. C.
    Delgutte, Bertrand
    [J]. JOURNAL OF NEUROSCIENCE, 2015, 35 (10): : 4452 - 4468