DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING

被引:0
|
作者
Pfeifenberger, Lukas [1 ]
Zoehrer, Matthias [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
基金
奥地利科学基金会;
关键词
multi-channel speech enhancement; eigenvector beamforming; speech mask estimation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an optimal multi-channel Wiener filter, which consists of an eigenvector beamformer and a single-channel postfilter. We show that both components solely depend on a speech presence probability, which we learn using a deep neural network, consisting of a deep autoencoder and a softmax regression layer. To prevent the DNN from learning specific speaker and noise types, we do not use the signal energy as input feature, but rather the cosine distance between the dominant eigenvectors of consecutive frames of the power spectral density of the noisy speech signal. We compare our system against the BeamformIt toolkit, and state-of-the-art approaches such as the front-end of the best system of the CHiME3 challenge. We show that our system yields superior results, both in terms of perceptual speech quality and classification error.
引用
收藏
页码:66 / 70
页数:5
相关论文
共 50 条
  • [41] On the Training of DNN-based Average Voice Model for Speech Synthesis
    Yang, Shan
    Wu, Zhizheng
    Xie, Lei
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [42] Dual-channel DNN-based Speech Enhancement for Smartphones
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    [J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [43] Robust Beam forming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation
    Jiang, Wenbin
    Wen, Fei
    Liu, Peilin
    [J]. IEEE ACCESS, 2018, 6 : 52385 - 52392
  • [44] DNN-Based Estimation for Misalignment State of Automotive Radar Sensor
    Kim, Junho
    Jeong, Taewon
    Lee, Seongwook
    [J]. SENSORS, 2023, 23 (14)
  • [45] DNN-Based Force Estimation in Hyper-Redundant Manipulators
    Choi, Sunwoong
    Moon, Yonghwan
    Kim, Jeongryul
    Kim, Keri
    [J]. INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2024, 25 (10) : 2111 - 2123
  • [46] Towards minimum perceptual error training for DNN-based speech synthesis
    Valentini-Botinhao, Cassia
    Wu, Zhizheng
    King, Simon
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 869 - 873
  • [47] Modeling Long Temporal Contexts for Robust DNN-based Speech Recognition
    Li, Bo
    Sim, Khe Chai
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 353 - 357
  • [48] DNN-based Bilingual (Telugu-Hindi) Polyglot Speech Synthesis
    Reddy, M. Kiran
    Rao, K. Sreenivasa
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1808 - 1811
  • [49] AUTOREGRESSIVE PARAMETER ESTIMATION WITH DNN-BASED PRE-PROCESSING
    Cui, Zihao
    Bao, Changchun
    Nielsen, Jesper Kjoer
    Christensen, Mads Groesboll
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6759 - 6763
  • [50] DNN-Based Fractional Doppler Channel Estimation for OTFS Modulation
    Guo, Lin
    Gu, Peng
    Zou, Jun
    Liu, Guangzu
    Shu, Feng
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (11) : 15062 - 15067