Dual-channel DNN-based Speech Enhancement for Smartphones

被引：0

作者：

Martin-Donas, Juan M. ^{[1
]}

Gomez, Angel M. ^{[1
]}

Lopez-Espejo, Ivan ^{[2
]}

Peinado, Antonio M. ^{[1
]}

机构：

[1] Univ Granada, Dept Signal Theory Telemat & Commun, Granada, Spain

[2] VeriDas Das Nano, Navarra, Spain

来源：

2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2017年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech communications in real-world scenarios need high performance enhancement algorithms to address the distortions that can degrade the intelligibility and quality of the speech signal. Current portable devices usually integrate multiple microphones that can conveniently be exploited to improve the signal quality. In this paper we present a dual-microphone speech enhancement approach suitable for smartphones with primary (front) and reference (back) microphones. Our proposal is based on the use of deep neural networks which are able to obtain a non-linear mapping function between noisy and clean speech signals. We explore two different architectures: a feedforward deep neural network (DNN) with temporal context and a gated recurrent unit (GRU) recurrent neural network (RNN). The proposed system is evaluated under different acoustic conditions in close-and far-talk device positions. A comparison with other single-and dual-channel approaches shows that our proposal obtains the best performance in terms of perceptual quality.

引用

页数：6

共 50 条

[1] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
Zhao, Yan
Wang, DeLiang
Merks, Ivo
Zhang, Tao
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
[2] Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones
Lopez-Espejo, Ivan
Martin-Donas, Juan M.
Gomez, Angel M.
Peinado, Antonio M.
[J]. 2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 88 - 91
[3] Dual-channel speech intelligibility enhancement based on the psychoacoustics
Lee, Sang-Hoon
Jeong, Hong
[J]. LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 83 - +
[4] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
Huang, Qizheng
Bao, Changchun
Wang, Xianyun
Xiang, Yang
[J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
[5] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
Elshamy, Samy
Fingscheidt, Tim
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
[6] DNN-Based Speech Enhancement via Integrating NMF and CASA
Yan, Bofang
Bao, Changchun
Bai, Zhigang
[J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 435 - 439
[7] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
Feng, Xinyang
Li, Nuo
He, Zunwen
Zhang, Yan
Zhang, Wancheng
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545
[8] Boosting DNN-Based Speech Enhancement via Explicit Transformations
Wang, Qing
Du, Jun
Dai, Li-Rong
[J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[9] DNN-Based Calibrated-Filter Models for Speech Enhancement
Yazid Attabi
Benoit Champagne
Wei-Ping Zhu
[J]. Circuits, Systems, and Signal Processing, 2021, 40 : 2926 - 2949
[10] DNN-BASED AR-WIENER FILTERING FOR SPEECH ENHANCEMENT
Yang, Yan
Bao, Changchun
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2901 - 2905

← 1 2 3 4 5 →