IMPROVEMENT OF SPEECH RESIDUALS FOR SPEECH ENHANCEMENT

被引：0

作者：

Elshamy, Samy ^{[1
]}

Fingscheidt, Tim ^{[1
]}

机构：

[1] Tech Univ Carolo Wilhelmina Braunschweig, Inst Commun Technol, Schleinitzstr 22, D-38106 Braunschweig, Germany

来源：

2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2019年

关键词：

a priori SNR; speech enhancement; deep learning; cepstrum; PRIORI SNR ESTIMATION; EXCITATION;

D O I：

10.1109/waspaa.2019.8937197

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this work we present two novel methods to improve speech residuals for speech enhancement. A deep neural network is used to enhance residual signals in the cepstral domain, thereby exceeding a former cepstral excitation manipulation (CEM) approach in different ways: One variant provides higher speech component quality by 0.1 PESQ points in low-SNR conditions, while another one delivers substantially higher noise attenuation by 1.5 dB, without loss of speech component quality or speech intelligibility. Compared to traditional speech enhancement based on the decision-directed (DD) a priori SNR estimation, a gain of even up to 3.5 dB noise attenuation is obtained. A semi-formal comparative category rating (CCR) subjective listening test confirms the superiority of the proposed approach over DD by 0.25 CMOS points (or even by 0.48 if two outlier subjects are not considered).

引用

页码：219 / 223

页数：5

共 50 条

[1] SNR Improvement with Speech Enhancement Techniques
Gala, D. R.
Misra, V. M.
[J]. 2010 INTERNATIONAL CONFERENCE ON COMMUNICATION AND VEHICULAR TECHNOLOGY (ICCVT 2010), VOL I, 2010, : 26 - 29
[2] Speech enhancement for bandlimited speech
Heide, DA
Kang, GS
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 393 - 396
[3] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
Huan-Yu Dong
Chang-Myung Lee
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018
[4] On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
Saleem, Nasir
Khattak, Muhammad Irfan
Verdu, Elena
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (02): : 78 - 89
[5] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
Dong, Huan-Yu
Lee, Chang-Myung
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
[6] Speech enhancement based on perceptual filter bank improvement
Alaya, Sana
Zoghlami, Novlene
Lachiri, Zied
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (03) : 253 - 258
[7] Speech Practice Book: For Speech Improvement and Speech Correction
Frye, Agnes M.
[J]. JOURNAL OF SPEECH AND HEARING DISORDERS, 1954, 19 (01): : 88 - 88
[8] SPEECH PRACTICE BOOK FOR SPEECH IMPROVEMENT AND SPEECH CORRECTION
Manser, Ruth B.
[J]. QUARTERLY JOURNAL OF SPEECH, 1955, 41 (01) : 88 - 89
[9] Speech Improvement
Harding, H. F.
[J]. EDUCATIONAL RESEARCH BULLETIN, 1954, 33 (04): : 109 - 109
[10] SPEECH IMPROVEMENT
Gaylord, J. S.
[J]. QUARTERLY JOURNAL OF SPEECH EDUCATION, 1919, 5 (04): : 358 - 367

← 1 2 3 4 5 →