DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING

被引:0
|
作者
Pfeifenberger, Lukas [1 ]
Zoehrer, Matthias [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
基金
奥地利科学基金会;
关键词
multi-channel speech enhancement; eigenvector beamforming; speech mask estimation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an optimal multi-channel Wiener filter, which consists of an eigenvector beamformer and a single-channel postfilter. We show that both components solely depend on a speech presence probability, which we learn using a deep neural network, consisting of a deep autoencoder and a softmax regression layer. To prevent the DNN from learning specific speaker and noise types, we do not use the signal energy as input feature, but rather the cosine distance between the dominant eigenvectors of consecutive frames of the power spectral density of the noisy speech signal. We compare our system against the BeamformIt toolkit, and state-of-the-art approaches such as the front-end of the best system of the CHiME3 challenge. We show that our system yields superior results, both in terms of perceptual speech quality and classification error.
引用
收藏
页码:66 / 70
页数:5
相关论文
共 50 条
  • [1] DNN-BASED MASK ESTIMATION INTEGRATING SPECTRAL AND SPATIAL FEATURES FOR ROBUST BEAMFORMING
    Deng, Chengyun
    Song, Hui
    Zhang, Yi
    Sha, Yongtao
    Li, Xiangang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4647 - 4651
  • [2] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
    Furnon, Nicolas
    Serizel, Romain
    Illina, Irina
    Essid, Slim
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
  • [3] INTEGRATING DNN-BASED AND SPATIAL CLUSTERING-BASED MASK ESTIMATION FOR ROBUST MVDR BEAMFORMING
    Nakatani, Tomohiro
    To, Nobutaka
    Higuchi, Takuya
    Araki, Shoko
    Kinoshita, Keisuke
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 286 - 290
  • [4] DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays
    Furnon, Nicolas
    Serizel, Romain
    Essid, Slim
    Illina, Irina
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2310 - 2323
  • [5] ONLINE INTEGRATION OF DNN-BASED AND SPATIAL CLUSTERING-BASED MASK ESTIMATION FOR ROBUST MVDR BEAMFORMING
    Matsui, Yutaro
    Nakatani, Tomohiro
    Delcroix, Marc
    Kinoshita, Keisuke
    Ito, Nobutaka
    Araki, Shoko
    Makino, Shoji
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 71 - 75
  • [6] SYNTHETIC DATA FOR DNN-BASED DOA ESTIMATION OF INDOOR SPEECH
    Gelderblom, Femke B.
    Liu, Yi
    Kvam, Johannes
    Myrvoll, Tor Andre
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4390 - 4394
  • [7] Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement
    Cui, Zihao
    Bao, Changchun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 618 - 622
  • [8] DNN-based Intelligent Beamforming on a Programmable Metasurface
    Li, Shangyang
    Fu, Shilei
    Xu, Feng
    [J]. Journal of Radars, 2021, 10 (02) : 259 - 266
  • [9] DNN-Based Arabic Speech Synthesis
    Amrouche, Aissa
    Bentrcia, Youssouf
    Boubakeur, Khadidja Nesrine
    Abed, Ahcene
    [J]. 2022 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2022), 2022, : 378 - 382
  • [10] Online Multichannel Speech Enhancement Based on Recursive EM and DNN-Based Speech Presence Estimation
    Martin-Donas, Juan Manuel
    Jensen, Jesper
    Tan, Zheng-Hua
    Gomez, Angel M.
    Peinado, Antonio M.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 3080 - 3094