Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter

被引:0
|
作者
Wang, Dujuan [1 ]
Bao, Changchun [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
beamforming; speech enhancement; residual neural network; real and imaginary masks; postfilter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network (DNN) based ideal ratio mask (IRM) estimation methods have yielded good performance in monaural speech enhancement. Meanwhile, these methods have also shown considerable potential for beamforming and multichannel speech enhancement. It is crucial for minimum variance distortionless response (MVDR) beamformer to estimate the covariance matrix of the speech and noise accurately. The accurate estimation of time-frequency (T-F) mask has significant impact on the estimation of the covariance matrices. So, in this paper, a complex real and imaginary ratio mask (CRIRM) based MVDR beamformer for speech enhancement using residual network is proposed. First, the real and imaginary masks of speech and noise are estimated by taking advantage of a residual neural network. After that, the estimations of speech and noise are obtained by using the estimated masks. Finally, the covariance matrices of speech and noise are estimated, and applied into the MVDR beamformer. In addition, in order to further reduce residual noise interference, the output of the MVDR beamformer is further processed by an end-to-end monaural speech enhancement module. Experiments show that, the proposed method can better improve the quality and intelligibility of the enhanced speech.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Factorized MVDR Deep Beamforming for Multi-Channel Speech Enhancement
    Kim, Hansol
    Kang, Kyeongmuk
    Shin, Jong Won
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1898 - 1902
  • [2] Three-stage hybrid neural beamformer for multi-channel speech enhancement
    Kuang, Kelan
    Yang, Feiran
    Li, Junfeng
    Yang, Jun
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (06): : 3378 - 3389
  • [3] ADAPTATION MODE CONTROL WITH RESIDUAL NOISE ESTIMATION FOR BEAMFORMER-BASED MULTI-CHANNEL SPEECH ENHANCEMENT
    Kim, Seon Man
    Kim, Hong Kook
    Lee, Sung Joo
    Lee, Yun Keun
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 301 - 304
  • [4] A New Neural Beamformer for Multi-channel Speech Separation
    Liu, Ruqiao
    Zhou, Yi
    Liu, Hongqing
    Xu, Xinmeng
    Jia, Jie
    Chen, Binbin
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (10): : 977 - 987
  • [5] A New Neural Beamformer for Multi-channel Speech Separation
    Ruqiao Liu
    Yi Zhou
    Hongqing Liu
    Xinmeng Xu
    Jie Jia
    Binbin Chen
    [J]. Journal of Signal Processing Systems, 2022, 94 : 977 - 987
  • [6] Steering vector correction in MVDR beamformer for speech enhancement
    Bu, Suliang
    Zhao, Yunxin
    Zhao, Tuo
    [J]. INTERSPEECH 2022, 2022, : 5468 - 5472
  • [7] Multi-Channel Speech Enhancement using a Minimum Variance Distortionless Response Beamformer based on Graph Convolutional Network
    Nguyen Huu Binh
    Duong Van Hai
    Bui Tien Dat
    Hoang Ngoc Chau
    Nguyen Quoc Cuong
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (10) : 739 - 747
  • [8] Influence of MVDR beamformer on a Speech Enhancement based Smartphone application for Hearing Aids
    Shankar, Nikhil
    Kucuk, Abdullah
    Reddy, Chandan K. A.
    Bhat, Gautam S.
    Panahi, Issa M. S.
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 417 - 420
  • [9] Integration of a Priori and Estimated Constraints Into an MVDR Beamformer for Speech Enhancement
    Ali, Randall
    Van Waterschoot, Toon
    Moonen, Marc
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2288 - 2300
  • [10] A MULTI-CHANNEL POSTFILTER BASED ON THE DIFFUSE NOISE SOUND FIELD
    Pfeifenberger, Lukas
    Pernkopf, Franz
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 686 - 690