Localization of Steady Sound Source and Direction Detection of Moving Sound Source using CNN

被引:0
|
作者
Mane, Shubham S. [1 ]
Mali, Swapnil G. [1 ]
Mahajan, S. P. [1 ]
机构
[1] Coll Engn Pune, Dept Elect & Telecommun, Pune, Maharashtra, India
关键词
Sound Source Localization; Direction detection of moving source; Convolutional Neural Network; Supervised learning; DOA Estimation; TIME-DELAY ESTIMATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a convolutional neural network (CNN) based classification method for broadband direction of arrival (DOA) estimation of steady sound source in noisy conditions and also in reverberation conditions using a uniform linear array (ULA) of microphones. In addition, we also find out the direction of moving sound source (left or right). The input to the CNN is given as the Short-Time Fourier Transform (STFT) coefficients of the phase components obtained from the ULA of microphones. The CNN then learns the features required for training. Here we have used the room impulse response (RIR) of each angle and White Gaussian Noise of different variances to generate training database. Also we add synthesized noise signal to training data set to generate the actual speech signal of that particular room, so that the CNN can classify speech source according to the DOA during the demonstration.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Localization of sound source direction in real time
    Kornatowski, Eugeniusz
    [J]. Advances in Intelligent and Soft Computing, 2010, 80 : 39 - 47
  • [2] Moving sound source localization in large areas
    Pertilä, P
    Parviainen, M
    Korhonen, T
    Visa, A
    [J]. ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 745 - 748
  • [3] Optimal Prediction of Moving Sound Source Direction in the Owl
    Cox, Weston
    Fischer, Brian J.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (07)
  • [4] Moving sound source localization based on triangulation method
    Miao, Feng
    Yang, Diange
    Wen, Junjie
    Lian, Xiaomin
    [J]. JOURNAL OF SOUND AND VIBRATION, 2016, 385 : 93 - 103
  • [5] Sound source localization
    Risoud, M.
    Hanson, J. -N.
    Gauvrit, F.
    Renard, C.
    Lemesre, P. -E.
    Bonne, N. -X.
    Vincent, C.
    [J]. EUROPEAN ANNALS OF OTORHINOLARYNGOLOGY-HEAD AND NECK DISEASES, 2018, 135 (04) : 259 - 264
  • [6] THE SOUND FIELD OF A MOVING SOURCE
    ZATZKIS, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1953, 25 (05): : 897 - 898
  • [7] SOUND FIELD OF A MOVING SOURCE
    KRASILNIKOV, VA
    PAVLOV, VI
    [J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOFIZIKA, 1981, 24 (05): : 609 - 620
  • [8] Multiple sound source localization using gammatone auditory filtering and direct sound componence detection
    Chen, Huaiyu
    Cao, Li
    [J]. 3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING, 2017, 69
  • [9] Using sound source localization in a home environment
    Bian, XH
    Abowd, GD
    Rehg, JM
    [J]. PERVASIVE COMPUTING, PROCEEDINGS, 2005, 3468 : 19 - 36
  • [10] Sound Source Localization using Stochastic Computing
    Schober, Peter
    Estiri, Seyedeh Newsha
    Aygun, Sercan
    TaheriNejad, Nima
    Najafi, M. Hassan
    [J]. 2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,