Design of a robust MVDR beamforming method with Low-Latency by reconstructing covariance matrix for speech enhancement

被引:1
|
作者
Zhou, Jing [1 ]
Bao, Changchun [1 ]
Zhang, Xu [1 ]
Xiong, Wenmeng [1 ]
Jia, Maoshen [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech enhancement; Microphone array; Minimum variance distortionless response; Beamformer; Direction of arrival; STEERING VECTOR ESTIMATION; PERFORMANCE; ARRAY;
D O I
10.1016/j.apacoust.2023.109464
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Aiming at solving the problems of the conventional minimum variance distortionless response (MVDR) beamformer in practical applications, such as the sensibility of the steering vector mismatch and beam pattern distortion, a robust broadband MVDR beamforming method with low-latency by reconstructing covariance matrix is proposed and applied to speech enhancement with a linear microphone array in this paper. In this work, some important steps are optimized, and the main contribution is to consider the problem of correlation terms generated by the low latency. Firstly, the direction of arrival (DOA) is cor-rected and the steering vector is estimated based on the sparsity of the DOAs corresponding to the sound sources, which improves the ability of anti-mismatches in the steering vector. Secondly, the correlation terms between the sound sources and noise are estimated and eliminated by the Capon power within the eigen-subspace, and the indirect dominant method is used to eliminate the correlation terms between the sound sources, so that the covariance matrix is reconstructed to obtain a more robust MVDR beam former. Thirdly, the problem of white noise amplification at low frequency bins is analyzed, and a white noise gain (WNG) modification method is proposed to obtain a compromise between the interference suppression and WNG. In the experiments, the TIMIT corpus is used to generate the multi-channel speech data set, and the performance of the proposed method is evaluated with different DOAs and input signal to interference plus noise ratios (SINRs). The experimental results show that the proposed method can effectively suppress the interferences and reduce the noise with strong robustness.& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 44 条
  • [1] Robust MVDR beamforming based on covariance matrix reconstruction
    PengCheng Mu
    Dan Li
    QinYe Yin
    Wei Guo
    Science China Information Sciences, 2013, 56 : 1 - 12
  • [2] A Robust MVDR Beamforming Based on Covariance Matrix Reconstruction
    Mu, Pengcheng
    Li, Dan
    Yin, Qinye
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [3] Robust MVDR beamforming based on covariance matrix reconstruction
    Mu PengCheng
    Li Dan
    Yin QinYe
    Guo Wei
    SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (04) : 1 - 12
  • [4] Robust MVDR beamforming based on covariance matrix reconstruction
    MU PengCheng
    LI Dan
    YIN QinYe
    GUO Wei
    Science China(Information Sciences), 2013, 56 (04) : 28 - 39
  • [5] EXPLORING TRADEOFFS IN MODELS FOR LOW-LATENCY SPEECH ENHANCEMENT
    Wilson, Kevin
    Chinen, Michael
    Thorpe, Jeremy
    Patton, Brian
    Hershey, John
    Saurous, Rif A.
    Skoglund, Jan
    Lyon, Richard F.
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 366 - 370
  • [6] A Survey on Low-Latency DNN-Based Speech Enhancement
    Drgas, Szymon
    SENSORS, 2023, 23 (03)
  • [7] A Novel Covariance Matrix Estimation Method for MVDR Beamforming In Audio-Visual Communication Systems
    You, Gyeong-Kuk
    Yang, Jae-Mo
    Lee, Jinkyu
    Kang, Hong-Goo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2014, 33 (05): : 326 - 334
  • [8] Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks
    Romaniuk, Michal
    Masztalski, Piotr
    Piaskowski, Karol
    Matuszewski, Mateusz
    INTERSPEECH 2020, 2020, : 3296 - 3300
  • [9] Low-rank covariance matrix tapering for robust adaptive beamforming
    Ruebsamen, Michael
    Gerlach, Christian
    Gershman, Alex B.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 2333 - 2336
  • [10] Robust Frequency Invariant Beamforming with Low Sidelobe for Speech Enhancement
    Zhu, Yiting
    Pan, Xiang
    2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING (CCISP 2017), 2018, 960