Detection Sound Source Direction in 3D Space Using Convolutional Neural Networks

被引:0
|
作者
Yue, Xiao [1 ]
Qu, Guangzhi [1 ]
Liu, Bo [2 ]
Liu, Anyi [1 ]
机构
[1] Oakland Univ, Comp Sci & Engn Dept, Rochester, MI 48063 USA
[2] Beijing Univ Technol, Sch Software Engn, Fac Informat Technol, Beijing, Peoples R China
关键词
Sound Source Direction Detection; GCC-PHAT; Convolutional Neural Network; Room impulse simulation;
D O I
10.1109/ai4i.2018.00027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sound source detection and localization have a lot of practical uses in many industrial settings. Most of sound source direction detection algorithms in literature are designed to identify the angle of sound source in a 2D space. In this work, we propose to use convolutional neural networks to detect the sound source direction in a 3D space. This algorithm is based on the generalized cross correlation method with phase transform (GCC-PHAT) [1] to derive time delay of arrival (TDOA). By using a convolutional neural network model, this algorithm can be applied and deployed. In addition, by modifying GCC-PHAT formula, this approach also works of multiple sound sources detection. Simulation experimental results on single sound source and multiple sound sources detection show the proposed system could work in most situations.
引用
收藏
页码:81 / 84
页数:4
相关论文
共 50 条
  • [1] QUATERNION CONVOLUTIONAL NEURAL NETWORKS FOR DETECTION AND LOCALIZATION OF 3D SOUND EVENTS
    Comminiello, Danilo
    Lella, Marco
    Scardapane, Simone
    Uncini, Aurelio
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8533 - 8537
  • [2] Violence Detection using 3D Convolutional Neural Networks
    Su, Jiayi
    Her, Paris
    Clemens, Erik
    Yaz, Edwin
    Schneider, Susan
    Medeiros, Henry
    [J]. 2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [3] Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
    Diaz-Guerra, David
    Miguel, Antonio
    Beltran, Jose R.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 300 - 311
  • [4] Lung Cancer Detection using 3D Convolutional Neural Networks
    Pradhan, Adarsh
    Sarma, Bhaskarjyothi
    Dey, Bhiman Kr
    [J]. 2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 765 - 770
  • [5] Efficient Violence Detection Using 3D Convolutional Neural Networks
    Li, Ji
    Jiang, Xinghao
    Sun, Tanfeng
    Xu, Ke
    [J]. 2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [6] Violence Detection in Video by Using 3D Convolutional Neural Networks
    Ding, Chunhui
    Fan, Shouke
    Zhu, Ming
    Feng, Weiguo
    Jia, Baozhi
    [J]. ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT II, 2014, 8888 : 551 - 558
  • [7] Detection of Vertebral Fractures in CT Using 3D Convolutional Neural Networks
    Nicolaes, Joeri
    Raeymaeckers, Steven
    Robben, David
    Wilms, Guido
    Vandermeulen, Dirk
    Libanati, Cesar
    Debois, Marc
    [J]. COMPUTATIONAL METHODS AND CLINICAL APPLICATIONS FOR SPINE IMAGING, CSI 2019, 2020, 11963 : 3 - 14
  • [8] LUNG NODULE DETECTION IN CT USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Huang, Xiaojie
    Shan, Junjie
    Vaidya, Vivek
    [J]. 2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 379 - 383
  • [9] Smoke Detection on Video Sequences Using 3D Convolutional Neural Networks
    Lin, Gaohua
    Zhang, Yongming
    Xu, Gao
    Zhang, Qixing
    [J]. FIRE TECHNOLOGY, 2019, 55 (05) : 1827 - 1847
  • [10] Smoke Detection on Video Sequences Using 3D Convolutional Neural Networks
    Gaohua Lin
    Yongming Zhang
    Gao Xu
    Qixing Zhang
    [J]. Fire Technology, 2019, 55 : 1827 - 1847