QUATERNION NEURAL NETWORKS FOR 3D SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS

被引:8
|
作者
Celsi, Michela Ricciardi [1 ]
Scardapane, Simone [1 ]
Comminiello, Danilo [1 ]
机构
[1] Sapienza Univ Rome, Dept Informat Engn Elect & Telecommun, Rome, Italy
关键词
Quaternion neural networks; Hypercomplex-valued neural networks; 3D audio; Source localization; Convolutional recurrent neural networks;
D O I
10.1109/mlsp49062.2020.9231809
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Localization of sound sources in 3D sound fields is an extremely challenging task, especially when the environments are reverberant and involve multiple sources. In this work, we propose a deep neural network to analyze audio signals recorded by 3D microphones and localize sound sources in a spatial sound field. In particular, we consider first-order Ambisonics microphones to capture 3D acoustic signals and represent them by spherical harmonic decomposition in the quaternion domain. Moreover, to improve the localization performance, we use quaternion input features derived from the acoustic intensity, which is strictly related to the direction of arrival (DOA) of a sound source. The proposed network architecture involves both quaternion-valued convolutional and recurrent layers. Results show that the proposed method is able to exploit both the quaternion-valued representation of ambisonic signals and to improve the localization performance with respect to existing methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Inhibiting the inhibition: A neuronal network for sound localization in reverberant environments
    Pecka, Michael
    Zahn, Thomas P.
    Saunier-Rebori, Bernadette
    Siveke, Ida
    Felmy, Felix
    Wiegrebe, Lutz
    Klug, Achim
    Pollak, George D.
    Grothe, Benedikt
    [J]. JOURNAL OF NEUROSCIENCE, 2007, 27 (07): : 1782 - 1790
  • [32] 3D Moving Sound Source Localization via Conventional Microphones
    Catalbas, Mehmet Cem
    Dobrisek, Simon
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2017, 23 (04) : 63 - 69
  • [33] Sound Source Localization in Reverberant Environment using Visual information
    Lee, Byoung-gi
    Choi, JongSuk
    Kim, Daijin
    Kim, Munsang
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 3542 - 3547
  • [34] Effect of source spectrum on sound localization in an everyday reverberant room
    Ihlefeld, Antje
    Shinn-Cunningham, Barbara G.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (01): : 324 - 333
  • [35] Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
    Diaz-Guerra, David
    Miguel, Antonio
    Beltran, Jose R.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 300 - 311
  • [36] A Maximum a Posteriori Sound Source Localization in Reverberant and Noisy Conditions
    Choi, Jinho
    Yoo, Chang D.
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2770 - 2773
  • [37] Dual input neural networks for positional sound source localization
    Eric Grinstein
    Vincent W. Neo
    Patrick A. Naylor
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [38] Dual input neural networks for positional sound source localization
    Grinstein, Eric
    Neo, Vincent W.
    Naylor, Patrick A.
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [39] Acoustic eyes, a novel sound source localization and monitoring technique with 3D sound probes
    Basten, T. G. H.
    de Bree, H. E.
    Sadasivan, S.
    [J]. PROCEEDINGS OF ISMA 2008: INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING, VOLS. 1-8, 2008, : 3113 - +
  • [40] Source localization in reverberant environments: Performance bounds and ML estimation
    Gustafsson, T
    Rao, BD
    Trivedi, M
    [J]. CONFERENCE RECORD OF THE THIRTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2001, : 1583 - 1587