Dual-channel DNN-based Speech Enhancement for Smartphones

被引:0
|
作者
Martin-Donas, Juan M. [1 ]
Gomez, Angel M. [1 ]
Lopez-Espejo, Ivan [2 ]
Peinado, Antonio M. [1 ]
机构
[1] Univ Granada, Dept Signal Theory Telemat & Commun, Granada, Spain
[2] VeriDas Das Nano, Navarra, Spain
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech communications in real-world scenarios need high performance enhancement algorithms to address the distortions that can degrade the intelligibility and quality of the speech signal. Current portable devices usually integrate multiple microphones that can conveniently be exploited to improve the signal quality. In this paper we present a dual-microphone speech enhancement approach suitable for smartphones with primary (front) and reference (back) microphones. Our proposal is based on the use of deep neural networks which are able to obtain a non-linear mapping function between noisy and clean speech signals. We explore two different architectures: a feedforward deep neural network (DNN) with temporal context and a gated recurrent unit (GRU) recurrent neural network (RNN). The proposed system is evaluated under different acoustic conditions in close-and far-talk device positions. A comparison with other single-and dual-channel approaches shows that our proposal obtains the best performance in terms of perceptual quality.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
    Zhao, Yan
    Wang, DeLiang
    Merks, Ivo
    Zhang, Tao
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
  • [2] Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones
    Lopez-Espejo, Ivan
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Peinado, Antonio M.
    [J]. 2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 88 - 91
  • [3] Dual-channel speech intelligibility enhancement based on the psychoacoustics
    Lee, Sang-Hoon
    Jeong, Hong
    [J]. LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 83 - +
  • [4] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    Xiang, Yang
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
  • [5] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
    Elshamy, Samy
    Fingscheidt, Tim
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
  • [6] DNN-Based Speech Enhancement via Integrating NMF and CASA
    Yan, Bofang
    Bao, Changchun
    Bai, Zhigang
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 435 - 439
  • [7] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
    Feng, Xinyang
    Li, Nuo
    He, Zunwen
    Zhang, Yan
    Zhang, Wancheng
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545
  • [8] Boosting DNN-Based Speech Enhancement via Explicit Transformations
    Wang, Qing
    Du, Jun
    Dai, Li-Rong
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [9] DNN-Based Calibrated-Filter Models for Speech Enhancement
    Yazid Attabi
    Benoit Champagne
    Wei-Ping Zhu
    [J]. Circuits, Systems, and Signal Processing, 2021, 40 : 2926 - 2949
  • [10] DNN-BASED AR-WIENER FILTERING FOR SPEECH ENHANCEMENT
    Yang, Yan
    Bao, Changchun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2901 - 2905