Dual-channel DNN-based Speech Enhancement for Smartphones

被引：0

作者：

Martin-Donas, Juan M. ^{[1
]}

Gomez, Angel M. ^{[1
]}

Lopez-Espejo, Ivan ^{[2
]}

Peinado, Antonio M. ^{[1
]}

机构：

[1] Univ Granada, Dept Signal Theory Telemat & Commun, Granada, Spain

[2] VeriDas Das Nano, Navarra, Spain

来源：

2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2017年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Speech communications in real-world scenarios need high performance enhancement algorithms to address the distortions that can degrade the intelligibility and quality of the speech signal. Current portable devices usually integrate multiple microphones that can conveniently be exploited to improve the signal quality. In this paper we present a dual-microphone speech enhancement approach suitable for smartphones with primary (front) and reference (back) microphones. Our proposal is based on the use of deep neural networks which are able to obtain a non-linear mapping function between noisy and clean speech signals. We explore two different architectures: a feedforward deep neural network (DNN) with temporal context and a gated recurrent unit (GRU) recurrent neural network (RNN). The proposed system is evaluated under different acoustic conditions in close-and far-talk device positions. A comparison with other single-and dual-channel approaches shows that our proposal obtains the best performance in terms of perceptual quality.

引用

页数：6

共 50 条

[41] A Dual-Channel Three-Stage Model for DoA and Speech Enhancement
Wu, Meng-Hsuan
Shen, Yih-Liang
Chou, Hsuan-Cheng
Shih, Bo-Wun
Chi, Tai-Shih
[J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1064 - 1068
[42] Dual-Channel Speech Enhancement Using Neural Network Adaptive Beamforming
Jiang, Tao
Liu, Hongqing
Shuai, Chenhao
Wang, Mingtian
Zhou, Yi
Gan, Lu
[J]. COMMUNICATIONS AND NETWORKING (CHINACOM 2021), 2022, : 497 - 506
[43] DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching
Mizoguchi, Satoshi
Saito, Yuki
Takamichi, Shinnosuke
Saruwatari, Hiroshi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11) : 1971 - 1980
[44] DNN-BASED SPEECH QUALITY ASSESSMENT FOR BINAURAL SIGNALS
Reimes, Jan
[J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
[45] DNN-Based Speech Synthesis Using Speaker Codes
Hojo, Nobukatsu
Ijima, Yusuke
Mizuno, Hideyuki
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (02): : 462 - 472
[46] Prediction of speech intelligibility with DNN-based performance measures
Martinez, Angel Mario Castro
Spille, Constantin
Rossbach, Jana
Kollmeier, Birger
Meyer, Bernd T.
[J]. COMPUTER SPEECH AND LANGUAGE, 2022, 74
[47] DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation
Houidhek, Amal
Colotte, Vincent
Mnasri, Zied
Jouvet, Denis
[J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 9 - 20
[48] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
Pfeifenberger, Lukas
Zoehrer, Matthias
Pernkopf, Franz
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70
[49] DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FORMULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT
Tammen, Marvin
Fischer, Doerte
Meyer, Bernd T.
Doclo, Simon
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 191 - 195
[50] A study of speaker adaptation for DNN-based speech synthesis
Wu, Zhizheng
Swietojanski, Pawel
Veaux, Christophe
Renals, Steve
King, Simon
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 879 - 883

← 1 2 3 4 5 →