Speech enhancement using linear prediction residual

被引:61
|
作者
Yegnanarayana, B [1 ]
Avendano, C
Hermansky, H
Murthy, PS
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras 600036, Tamil Nadu, India
[2] Oregon Grad Inst Sci & Technol, Dept Elect Engn, Portland, OR USA
[3] Indian Inst Technol, Dept Elect Engn, Madras 600036, Tamil Nadu, India
关键词
speech enhancement; linear prediction residual signal;
D O I
10.1016/S0167-6393(98)00070-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose a method for enhancement of speech in the presence of additive noise. The objective is to selectively enhance the high signal-to-noise ratio (SNR) regions in the noisy speech in the temporal and spectral domains, without causing significant distortion in the resulting enhanced speech. This is proposed to be done at three different levels. (a) At the gross level, by identifying the regions of speech and noise in the temporal domain. (b) At the finer level, by identifying the regions of high and low SNR portions in the noisy speech. (c) At the short-time spectrum level, by enhancing the spectral peaks over spectral valleys. The basis for the proposed approach is to analyze linear prediction (LP) residual signal in short (1-2 ms) segments to determine whether a segment belongs to a noise region or speech region. In the speech regions the inverse spectral flatness factor is significantly higher than in the noisy regions. The LP residual signal enables us to deal with short segments of data due to uncorrelatedness of the samples. Processing of noisy speech for enhancement involves mostly weighting the LP residual signal samples. The weighted residual signal samples are used to excite the time-varying all-pole filter to produce enhanced speech. As the additive noise level in the speech signal is increased, the quality of the resulting enhanced speech decreases progressively due to loss of speech information in the low SNR, high noise regions. Thus the degradation in performance of enhancement is graceful as the overall SNR of the noisy speech is decreased. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:25 / 42
页数:18
相关论文
共 50 条
  • [1] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
    Feng, Xinyang
    Li, Nuo
    He, Zunwen
    Zhang, Yan
    Zhang, Wancheng
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545
  • [2] ENHANCEMENT OF SPEECH IN ADDITIVE, LOCALLY STATIONARY AND COLORED NOISE, USING LINEAR PREDICTION
    YARMANVURAL, FT
    [J]. SIGNAL PROCESSING, 1990, 20 (03) : 211 - 217
  • [3] Enhancement of reverberant speech using LP residual
    Yegnanarayana, B
    Murthy, PS
    Avendano, C
    Hermansky, H
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 405 - 408
  • [4] Phase Modeling using Integrated Linear Prediction Residual for Statistical Parametric Speech Synthesis
    Adiga, Nagaraj
    Prasanna, S. R. M.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3981 - 3985
  • [5] Enhancement of reverberant speech using LP residual signal
    Yegnanarayana, B
    Murthy, PS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (03): : 267 - 281
  • [6] An intelligibility enhancement for the mixed excitation linear prediction speech coder
    Chong-White, NR
    Cox, RV
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (09) : 263 - 266
  • [7] Speech enhancement using a modified Kalman Filter based on complex linear prediction and supergaussian priors
    Esch, Thomas
    Vary, Peter
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4877 - 4880
  • [8] Pitch synchronous modulated lapped transform of the linear prediction residual of speech
    Tsinghua Univ, Beijing, China
    [J]. Int Conf Signal Process Proc, (591-594):
  • [9] Pitch synchronous modulated lapped transform of the linear prediction residual of speech
    Yang, HM
    Kleijn, WB
    Deprettere, EF
    Chen, HY
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 591 - 594
  • [10] HELIUM SPEECH PROCESSOR USING LINEAR PREDICTION
    BEET, SW
    GOODYEAR, CC
    [J]. ELECTRONICS LETTERS, 1983, 19 (11) : 408 - 410