DNN-Based Mask Estimation for Distributed Speech Enhancement in Spatially Unconstrained Microphone Arrays

被引:7
|
作者
Furnon, Nicolas [1 ]
Serizel, Romain [1 ]
Essid, Slim [2 ]
Illina, Irina [1 ]
机构
[1] Univ Lorraine, CNRS, Inria, Loria, F-54000 Nancy, France
[2] Inst Polytech Paris, Telecom Paris, LTCI, F-91764 Palaiseau, France
关键词
Microphone arrays; Speech enhancement; Estimation; Speech processing; Noise measurement; Noise reduction; Distortion; Distributed algorithm; microphone arrays; speech enhancement; MULTICHANNEL WIENER FILTER; LOW-RANK APPROXIMATION; NOISE-REDUCTION; SIGNAL ESTIMATION; SENSOR NETWORKS; SINGLE; SEGREGATION; BEAMFORMER; ALGORITHMS;
D O I
10.1109/TASLP.2021.3092838
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep neural network (DNN)-based speech enhancement algorithms in microphone arrays have now proven to be efficient solutions to speech understanding and speech recognition in noisy environments. However, in the context of ad-hoc microphone arrays, many challenges remain and raise the need for distributed processing. In this paper, we propose to extend a previously introduced distributed DNN-based time-frequency mask estimation scheme that can efficiently use spatial information in form of so-called compressed signals which are pre-filtered target estimations. We study the performance of this algorithm named Tango under realistic acoustic conditions and investigate practical aspects of its optimal application. We show that the nodes in the microphone array cooperate by taking profit of their spatial coverage in the room. We also propose to use the compressed signals not only to convey the target estimation but also the noise estimation in order to exploit the acoustic diversity recorded throughout the microphone array.
引用
收藏
页码:2310 / 2323
页数:14
相关论文
共 50 条
  • [21] Dual-channel DNN-based Speech Enhancement for Smartphones
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    [J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [22] SYNTHETIC DATA FOR DNN-BASED DOA ESTIMATION OF INDOOR SPEECH
    Gelderblom, Femke B.
    Liu, Yi
    Kvam, Johannes
    Myrvoll, Tor Andre
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4390 - 4394
  • [23] DNN-based speech enhancement with self-attention on feature dimension
    Cheng, Jiaming
    Liang, Ruiyu
    Zhao, Li
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32449 - 32470
  • [24] DNN-based monaural speech enhancement with temporal and spectral variations equalization
    Kang, Tae Gyoon
    Shin, Jong Won
    Kim, Nam Soo
    [J]. DIGITAL SIGNAL PROCESSING, 2018, 74 : 102 - 110
  • [25] DNN-based speech enhancement with self-attention on feature dimension
    Jiaming Cheng
    Ruiyu Liang
    Li Zhao
    [J]. Multimedia Tools and Applications, 2020, 79 : 32449 - 32470
  • [26] An Adaptation Method in Noise Mismatch Conditions for DNN-based Speech Enhancement
    Xu Si-Ying
    Niu Tong
    Qu Dan
    Long Xing-Yan
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (10): : 4930 - 4951
  • [27] DNN-Based Arabic Speech Synthesis
    Amrouche, Aissa
    Bentrcia, Youssouf
    Boubakeur, Khadidja Nesrine
    Abed, Ahcene
    [J]. 2022 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2022), 2022, : 378 - 382
  • [28] Bottleneck feature-mediated DNN-based feature mapping for throat microphone speech recognition
    Suzuki, Takahito
    Ogata, Jun
    Tsunakawa, Takashi
    Nishida, Masafumi
    Nishimura, Masafumi
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1738 - 1741
  • [29] DNN-Based Feature Extraction for Conflict Intensity Estimation From Speech
    Gosztolya, Gabor
    Toth, Laszlo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (12) : 1837 - 1841
  • [30] Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition
    d'Olne, Emilie
    Neo, Vincent W.
    Naylor, Patrick A.
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 55 - 59