Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking

被引:17
|
作者
Zhang, Wangyou [1 ]
Zhou, Ying [1 ]
Qian, Yanmin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, MoE Key Lab Artificial Intelligence, SpeechLab, Shanghai, Peoples R China
来源
关键词
source localization; direction-of-arrival estimation; convolutional neural networks; time-frequency masking; multi-task learning;
D O I
10.21437/Interspeech.2019-3158
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In the scenario with noise and reverberation, the performance of current methods for direction of arrival (DOA) estimation usually degrades significantly. Inspired by the success of time-frequency masking in speech enhancement and speech separation, this paper proposes new methods to better utilize timefrequency masking in convolution neural network to improve the robustness of localization. First a mask estimation network is developed to assist DOA estimation by either appending or multiplying the estimated masks to the original input feature. Then we further propose a multi-task learning architecture to optimize the mask and DOA estimation networks jointly, and two modes are designed and compared. Experiments show that all the proposed methods have better robustness and generalization in noisy and reverberant conditions compared to the conventional methods, and the multi-task methods have the best performance among all approaches.
引用
收藏
页码:2703 / 2707
页数:5
相关论文
共 50 条
  • [1] Robust TDOA Estimation Based on Time-Frequency Masking and Deep Neural Networks
    Wang, Zhong-Qiu
    Zhang, Xueliang
    Wang, DeLiang
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 322 - 326
  • [2] Underdetermined DOA Estimation via Independent Component Analysis and Time-Frequency Masking
    Jancovic, Peter
    Zou, Xin
    Kokuer, Munevver
    [J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2010, 2010
  • [3] DOA Estimation Based on Spatial Time-frequency Analysis
    Liu, B. S.
    Song, H. Y.
    Shi, J.
    Diao, M.
    Yang, C. Y.
    [J]. MECHANICAL, CONTROL, ELECTRIC, MECHATRONICS, INFORMATION AND COMPUTER, 2016, : 80 - 85
  • [4] NEURAL NETWORK BASED TIME-FREQUENCY MASKING AND STEERING VECTOR ESTIMATION FOR TWO-CHANNEL MVDR BEAMFORMING
    Liu, Yuzhou
    Ganguly, Anshuman
    Kamath, Krishna
    Kristjansson, Trausti
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6717 - 6721
  • [5] Frequency hopping modulation recognition of convolutional neural network based on time-frequency characteristics
    Li, Hong-Guang
    Guo, Ying
    Sui, Ping
    Qi, Zi-Sen
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (10): : 1945 - 1954
  • [6] Time-Frequency Representation and Convolutional Neural Network-Based Emotion Recognition
    Khare, Smith K.
    Bajaj, Varun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 2901 - 2909
  • [7] Detection of microseismic events based on time-frequency analysis and convolutional neural network
    Sheng, Li
    Xu, Xilong
    Wang, Weibo
    Gao, Ming
    [J]. Zhongguo Shiyou Daxue Xuebao (Ziran Kexue Ban)/Journal of China University of Petroleum (Edition of Natural Science), 2021, 45 (05): : 54 - 63
  • [8] DOA Estimation Method Based on Improved Deep Convolutional Neural Network
    Zhao, Fangzheng
    Hu, Guoping
    Zhan, Chenghong
    Zhang, Yule
    [J]. SENSORS, 2022, 22 (04)
  • [9] Leakage Detection in Water Distribution Systems Based on Time-Frequency Convolutional Neural Network
    Guo, Guancheng
    Yu, Xipeng
    Liu, Shuming
    Ma, Ziqing
    Wu, Yipeng
    Xu, Xiyan
    Wang, Xiaoting
    Smith, Kate
    Wu, Xue
    [J]. JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2021, 147 (02)
  • [10] Radar Emitter Identification Based on Novel Time-Frequency Spectrum and Convolutional Neural Network
    Xiao, Zhiling
    Yan, Zhenya
    [J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) : 2634 - 2638