Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter

被引：0

作者：

Wang, Dujuan ^{[1
]}

Bao, Changchun ^{[1
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing, Peoples R China

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

beamforming; speech enhancement; residual neural network; real and imaginary masks; postfilter;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep neural network (DNN) based ideal ratio mask (IRM) estimation methods have yielded good performance in monaural speech enhancement. Meanwhile, these methods have also shown considerable potential for beamforming and multichannel speech enhancement. It is crucial for minimum variance distortionless response (MVDR) beamformer to estimate the covariance matrix of the speech and noise accurately. The accurate estimation of time-frequency (T-F) mask has significant impact on the estimation of the covariance matrices. So, in this paper, a complex real and imaginary ratio mask (CRIRM) based MVDR beamformer for speech enhancement using residual network is proposed. First, the real and imaginary masks of speech and noise are estimated by taking advantage of a residual neural network. After that, the estimations of speech and noise are obtained by using the estimated masks. Finally, the covariance matrices of speech and noise are estimated, and applied into the MVDR beamformer. In addition, in order to further reduce residual noise interference, the output of the MVDR beamformer is further processed by an end-to-end monaural speech enhancement module. Experiments show that, the proposed method can better improve the quality and intelligibility of the enhanced speech.

引用

页数：5

共 50 条

[1] Factorized MVDR Deep Beamforming for Multi-Channel Speech Enhancement
Kim, Hansol
Kang, Kyeongmuk
Shin, Jong Won
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1898 - 1902
[2] Three-stage hybrid neural beamformer for multi-channel speech enhancement
Kuang, Kelan
Yang, Feiran
Li, Junfeng
Yang, Jun
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (06): : 3378 - 3389
[3] LEARNING-BASED MULTI-CHANNEL SPEECH PRESENCE PROBABILITY ESTIMATION USING A LOW-PARAMETER MODEL AND INTEGRATION WITH MVDR BEAMFORMING FOR MULTI-CHANNEL SPEECH ENHANCEMENT
Tao, Shuai
Mowlaee, Pejman
Jensen, Jesper Rindom
Christensen, Mads Graesboll
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 100 - 104
[4] ADAPTATION MODE CONTROL WITH RESIDUAL NOISE ESTIMATION FOR BEAMFORMER-BASED MULTI-CHANNEL SPEECH ENHANCEMENT
Kim, Seon Man
Kim, Hong Kook
Lee, Sung Joo
Lee, Yun Keun
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 301 - 304
[5] A New Neural Beamformer for Multi-channel Speech Separation
Liu, Ruqiao
Zhou, Yi
Liu, Hongqing
Xu, Xinmeng
Jia, Jie
Chen, Binbin
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (10): : 977 - 987
[6] A New Neural Beamformer for Multi-channel Speech Separation
Ruqiao Liu
Yi Zhou
Hongqing Liu
Xinmeng Xu
Jie Jia
Binbin Chen
Journal of Signal Processing Systems, 2022, 94 : 977 - 987
[7] Steering vector correction in MVDR beamformer for speech enhancement
Bu, Suliang
Zhao, Yunxin
Zhao, Tuo
INTERSPEECH 2022, 2022, : 5468 - 5472
[8] Multi-Channel Speech Enhancement using a Minimum Variance Distortionless Response Beamformer based on Graph Convolutional Network
Nguyen Huu Binh
Duong Van Hai
Bui Tien Dat
Hoang Ngoc Chau
Nguyen Quoc Cuong
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (10) : 739 - 747
[9] Influence of MVDR beamformer on a Speech Enhancement based Smartphone application for Hearing Aids
Shankar, Nikhil
Kucuk, Abdullah
Reddy, Chandan K. A.
Bhat, Gautam S.
Panahi, Issa M. S.
2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 417 - 420
[10] Integration of a Priori and Estimated Constraints Into an MVDR Beamformer for Speech Enhancement
Ali, Randall
Van Waterschoot, Toon
Moonen, Marc
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2288 - 2300

← 1 2 3 4 5 →