QUATERNION CONVOLUTIONAL NEURAL NETWORKS FOR DETECTION AND LOCALIZATION OF 3D SOUND EVENTS

被引:0
|
作者
Comminiello, Danilo [1 ]
Lella, Marco [1 ]
Scardapane, Simone [1 ]
Uncini, Aurelio [1 ]
机构
[1] Sapienza Univ Rome, DIET Dept, Via Eudossiana 18, I-00184 Rome, Italy
关键词
Quaternion neural networks; Hypercomplex machine learning; 3D audio; Ambisonics;
D O I
10.1109/icassp.2019.8682711
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity. One of the models that perfectly suits with quaternion-valued data processing is represented by 3D acoustic signals in their spherical harmonics decomposition. In this paper, we address the problem of localizing and detecting sound events in the spatial sound field by using quaternion-valued data processing. In particular, we consider the spherical harmonic components of the signals captured by a first-order ambisonic microphone and process them by using a quaternion convolutional neural network. Experimental results show that the proposed approach exploits the correlated nature of the ambisonic signals, thus improving accuracy results in 3D sound event detection and localization.
引用
收藏
页码:8533 / 8537
页数:5
相关论文
共 50 条
  • [1] QUATERNION NEURAL NETWORKS FOR 3D SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS
    Celsi, Michela Ricciardi
    Scardapane, Simone
    Comminiello, Danilo
    [J]. PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [2] Detection Sound Source Direction in 3D Space Using Convolutional Neural Networks
    Yue, Xiao
    Qu, Guangzhi
    Liu, Bo
    Liu, Anyi
    [J]. 2018 FIRST IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES (AI4I 2018), 2018, : 81 - 84
  • [3] Detection of Cardiac Events in Echocardiography using 3D Convolutional Recurrent Neural Networks
    Fiorito, Adrian Meidell
    Ostvik, Andreas
    Smistad, Erik
    Leclerc, Sarah
    Bernard, Olivier
    Lovstakken, Lasse
    [J]. 2018 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2018,
  • [4] Using 3D Convolutional Neural Networks for Real-time Detection of Soccer Events
    Rongved, Olav A. Nergard
    Hicks, Steven A.
    Thambawita, Vajira
    Stensland, Hakon K.
    Zouganeli, Evi
    Johansen, Dag
    Midoglu, Cise
    Riegler, Michael A.
    Halvorsen, Pal
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2021, 15 (02) : 161 - 187
  • [5] Violence Detection using 3D Convolutional Neural Networks
    Su, Jiayi
    Her, Paris
    Clemens, Erik
    Yaz, Edwin
    Schneider, Susan
    Medeiros, Henry
    [J]. 2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [6] Real-Time Detection of Events in Soccer Videos using 3D Convolutional Neural Networks
    Rongved, Olav A. Norgard
    Hicks, Steven A.
    Thambawita, Vajira
    Stensland, Hakon K.
    Zouganeli, Evi
    Johansen, Dag
    Riegler, Michael A.
    Halvorsen, Pal
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 135 - 144
  • [7] Traffic Sign Detection and 3D Localization via Deep Convolutional Neural Networks and Stereo Vision
    Noya Doval, Gabriel
    Al-Kaff, Abdulla
    Beltran, Jorge
    Garcia Fernandez, Fernando
    Fernandez Lopez, Gerardo
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 1411 - 1416
  • [8] 3D Localization of RFID Antenna Tags Using Convolutional Neural Networks
    Patel, Sohel J.
    Zawodniok, Maciej J.
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [9] Sound Events Localization and Detection Using Bio-Inspired Gammatone Filters and Temporal Convolutional Neural Networks
    Rosero, Karen
    Grijalva, Felipe
    Masiero, Bruno
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2314 - 2324
  • [10] Lung Cancer Detection using 3D Convolutional Neural Networks
    Pradhan, Adarsh
    Sarma, Bhaskarjyothi
    Dey, Bhiman Kr
    [J]. 2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 765 - 770