DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING

被引：0

作者：

Pfeifenberger, Lukas ^{[1
]}

Zoehrer, Matthias ^{[1
]}

Pernkopf, Franz ^{[1
]}

机构：

[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

基金：

奥地利科学基金会;

关键词：

multi-channel speech enhancement; eigenvector beamforming; speech mask estimation;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present an optimal multi-channel Wiener filter, which consists of an eigenvector beamformer and a single-channel postfilter. We show that both components solely depend on a speech presence probability, which we learn using a deep neural network, consisting of a deep autoencoder and a softmax regression layer. To prevent the DNN from learning specific speaker and noise types, we do not use the signal energy as input feature, but rather the cosine distance between the dominant eigenvectors of consecutive frames of the power spectral density of the noisy speech signal. We compare our system against the BeamformIt toolkit, and state-of-the-art approaches such as the front-end of the best system of the CHiME3 challenge. We show that our system yields superior results, both in terms of perceptual speech quality and classification error.

引用

页码：66 / 70

页数：5

共 50 条

[41] On the Training of DNN-based Average Voice Model for Speech Synthesis
Yang, Shan
Wu, Zhizheng
Xie, Lei
[J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[42] Dual-channel DNN-based Speech Enhancement for Smartphones
Martin-Donas, Juan M.
Gomez, Angel M.
Lopez-Espejo, Ivan
Peinado, Antonio M.
[J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
[43] Robust Beam forming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation
Jiang, Wenbin
Wen, Fei
Liu, Peilin
[J]. IEEE ACCESS, 2018, 6 : 52385 - 52392
[44] DNN-Based Estimation for Misalignment State of Automotive Radar Sensor
Kim, Junho
Jeong, Taewon
Lee, Seongwook
[J]. SENSORS, 2023, 23 (14)
[45] DNN-Based Force Estimation in Hyper-Redundant Manipulators
Choi, Sunwoong
Moon, Yonghwan
Kim, Jeongryul
Kim, Keri
[J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2024, 25 (10) : 2111 - 2123
[46] Towards minimum perceptual error training for DNN-based speech synthesis
Valentini-Botinhao, Cassia
Wu, Zhizheng
King, Simon
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 869 - 873
[47] Modeling Long Temporal Contexts for Robust DNN-based Speech Recognition
Li, Bo
Sim, Khe Chai
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 353 - 357
[48] DNN-based Bilingual (Telugu-Hindi) Polyglot Speech Synthesis
Reddy, M. Kiran
Rao, K. Sreenivasa
[J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1808 - 1811
[49] AUTOREGRESSIVE PARAMETER ESTIMATION WITH DNN-BASED PRE-PROCESSING
Cui, Zihao
Bao, Changchun
Nielsen, Jesper Kjoer
Christensen, Mads Groesboll
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6759 - 6763
[50] DNN-Based Fractional Doppler Channel Estimation for OTFS Modulation
Guo, Lin
Gu, Peng
Zou, Jun
Liu, Guangzu
Shu, Feng
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (11) : 15062 - 15067

← 1 2 3 4 5 →