Dual-channel DNN-based Speech Enhancement for Smartphones

被引:0
|
作者
Martin-Donas, Juan M. [1 ]
Gomez, Angel M. [1 ]
Lopez-Espejo, Ivan [2 ]
Peinado, Antonio M. [1 ]
机构
[1] Univ Granada, Dept Signal Theory Telemat & Commun, Granada, Spain
[2] VeriDas Das Nano, Navarra, Spain
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech communications in real-world scenarios need high performance enhancement algorithms to address the distortions that can degrade the intelligibility and quality of the speech signal. Current portable devices usually integrate multiple microphones that can conveniently be exploited to improve the signal quality. In this paper we present a dual-microphone speech enhancement approach suitable for smartphones with primary (front) and reference (back) microphones. Our proposal is based on the use of deep neural networks which are able to obtain a non-linear mapping function between noisy and clean speech signals. We explore two different architectures: a feedforward deep neural network (DNN) with temporal context and a gated recurrent unit (GRU) recurrent neural network (RNN). The proposed system is evaluated under different acoustic conditions in close-and far-talk device positions. A comparison with other single-and dual-channel approaches shows that our proposal obtains the best performance in terms of perceptual quality.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A Dual-Channel Three-Stage Model for DoA and Speech Enhancement
    Wu, Meng-Hsuan
    Shen, Yih-Liang
    Chou, Hsuan-Cheng
    Shih, Bo-Wun
    Chi, Tai-Shih
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1064 - 1068
  • [42] Dual-Channel Speech Enhancement Using Neural Network Adaptive Beamforming
    Jiang, Tao
    Liu, Hongqing
    Shuai, Chenhao
    Wang, Mingtian
    Zhou, Yi
    Gan, Lu
    [J]. COMMUNICATIONS AND NETWORKING (CHINACOM 2021), 2022, : 497 - 506
  • [43] DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching
    Mizoguchi, Satoshi
    Saito, Yuki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11) : 1971 - 1980
  • [44] DNN-BASED SPEECH QUALITY ASSESSMENT FOR BINAURAL SIGNALS
    Reimes, Jan
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [45] DNN-Based Speech Synthesis Using Speaker Codes
    Hojo, Nobukatsu
    Ijima, Yusuke
    Mizuno, Hideyuki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (02): : 462 - 472
  • [46] Prediction of speech intelligibility with DNN-based performance measures
    Martinez, Angel Mario Castro
    Spille, Constantin
    Rossbach, Jana
    Kollmeier, Birger
    Meyer, Bernd T.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [47] DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation
    Houidhek, Amal
    Colotte, Vincent
    Mnasri, Zied
    Jouvet, Denis
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 9 - 20
  • [48] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70
  • [49] DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FORMULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Tammen, Marvin
    Fischer, Doerte
    Meyer, Bernd T.
    Doclo, Simon
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 191 - 195
  • [50] A study of speaker adaptation for DNN-based speech synthesis
    Wu, Zhizheng
    Swietojanski, Pawel
    Veaux, Christophe
    Renals, Steve
    King, Simon
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 879 - 883