IMPROVEMENT OF SPEECH RESIDUALS FOR SPEECH ENHANCEMENT

被引:0
|
作者
Elshamy, Samy [1 ]
Fingscheidt, Tim [1 ]
机构
[1] Tech Univ Carolo Wilhelmina Braunschweig, Inst Commun Technol, Schleinitzstr 22, D-38106 Braunschweig, Germany
关键词
a priori SNR; speech enhancement; deep learning; cepstrum; PRIORI SNR ESTIMATION; EXCITATION;
D O I
10.1109/waspaa.2019.8937197
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work we present two novel methods to improve speech residuals for speech enhancement. A deep neural network is used to enhance residual signals in the cepstral domain, thereby exceeding a former cepstral excitation manipulation (CEM) approach in different ways: One variant provides higher speech component quality by 0.1 PESQ points in low-SNR conditions, while another one delivers substantially higher noise attenuation by 1.5 dB, without loss of speech component quality or speech intelligibility. Compared to traditional speech enhancement based on the decision-directed (DD) a priori SNR estimation, a gain of even up to 3.5 dB noise attenuation is obtained. A semi-formal comparative category rating (CCR) subjective listening test confirms the superiority of the proposed approach over DD by 0.25 CMOS points (or even by 0.48 if two outlier subjects are not considered).
引用
收藏
页码:219 / 223
页数:5
相关论文
共 50 条
  • [1] SNR Improvement with Speech Enhancement Techniques
    Gala, D. R.
    Misra, V. M.
    [J]. 2010 INTERNATIONAL CONFERENCE ON COMMUNICATION AND VEHICULAR TECHNOLOGY (ICCVT 2010), VOL I, 2010, : 26 - 29
  • [2] Speech enhancement for bandlimited speech
    Heide, DA
    Kang, GS
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 393 - 396
  • [3] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Huan-Yu Dong
    Chang-Myung Lee
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [4] On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Verdu, Elena
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (02): : 78 - 89
  • [5] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Dong, Huan-Yu
    Lee, Chang-Myung
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [6] Speech enhancement based on perceptual filter bank improvement
    Alaya, Sana
    Zoghlami, Novlene
    Lachiri, Zied
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (03) : 253 - 258
  • [7] Speech Practice Book: For Speech Improvement and Speech Correction
    Frye, Agnes M.
    [J]. JOURNAL OF SPEECH AND HEARING DISORDERS, 1954, 19 (01): : 88 - 88
  • [8] SPEECH PRACTICE BOOK FOR SPEECH IMPROVEMENT AND SPEECH CORRECTION
    Manser, Ruth B.
    [J]. QUARTERLY JOURNAL OF SPEECH, 1955, 41 (01) : 88 - 89
  • [9] Speech Improvement
    Harding, H. F.
    [J]. EDUCATIONAL RESEARCH BULLETIN, 1954, 33 (04): : 109 - 109
  • [10] SPEECH IMPROVEMENT
    Gaylord, J. S.
    [J]. QUARTERLY JOURNAL OF SPEECH EDUCATION, 1919, 5 (04): : 358 - 367