ON ROBUST LINEAR PREDICTION OF SPEECH

被引:58
|
作者
LEE, CH
机构
[1] DIGITAL SOUND CORP,SANTA BARBARA,CA
[2] VERBEX CORP,BEDFORD,MA
关键词
SIGNAL FILTERING AND PREDICTION - Mathematical Models;
D O I
10.1109/29.1574
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A robust linear prediction (LP) algorithms is proposed that minimizes the sum of appropriately weighted residuals. The weight is a function of the prediction residual, and the cost function is selected to give more weight to the bulk of smaller residuals while deemphasizing the small portion of large residuals. In contrast, the conventional LP procedure weights all prediction residuals equally. The robust algorithm takes into account the non-Gaussian nature of the excitations for voiced speech and gives a more efficient (less variance) and less biased estimate for the prediction coefficients than conventional methods. The algorithm can be used in the front-end features extractor for a speech recognition system and as an analyzer for a speech coding system. Testing on synthetic vowel data demonstrates that the robust LP procedure is able to reduce the formant and bandwidth error rate by more than an order of magnitude compared to the conventional LP procedures and is relatively insensitive to the placement of the LPC (LP coding) analysis window and to the value of the pitch period, for a given section of speech signal.
引用
收藏
页码:642 / 650
页数:9
相关论文
共 50 条
  • [41] Robust speech recognition based on a Bayesian prediction approach
    Jiang, H
    Hirose, K
    Huo, Q
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04): : 426 - 440
  • [42] LOW-BIT-RATE SPEECH TRANSMISSION BY LINEAR PREDICTION OF SPEECH SIGNALS
    ATAL, BS
    HANAUER, SL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (01): : 133 - &
  • [43] SPEECH DEREVERBERATION USING LINEAR PREDICTION WITH ESTIMATION OF EARLY SPEECH SPECTRAL VARIANCE
    Parchami, Mahdi
    Zhu, Wei-Ping
    Champagne, Benoit
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 504 - 508
  • [44] Noise robust speech recognition with a switching linear dynamic model
    Droppo, J
    Acero, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 953 - 956
  • [45] Structured Log Linear Models for Noise Robust Speech Recognition
    Zhang, Shi-Xiong
    Ragni, Anton
    Gales, Mark John Francis
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (11) : 945 - 948
  • [46] Optimized Linear Discriminant Analysis for extractin robust speech features
    Abbasian, H.
    Nasersharif, B.
    Akbari, A.
    Rahmani, M.
    Moin, M. S.
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 819 - +
  • [47] A LINEAR PROJECTION APPROACH TO ENVIRONMENT MODELING FOR ROBUST SPEECH RECOGNITION
    Tsao, Yu
    Huang, Chien-Lin
    Matsuda, Shigeki
    Hori, Chiori
    Kashioka, Hideki
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4329 - 4332
  • [48] Sparse Linear Prediction and Its Applications to Speech Processing
    Giacobello, Daniele
    Christensen, Mads Graesboll
    Murthi, Manohar N.
    Jensen, Soren Holdt
    Moonen, Marc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1644 - 1657
  • [49] Reverse P-transformation in linear speech prediction
    Pavlov, OI
    IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2001, 44 (1-2): : 61 - 73
  • [50] Linear prediction based on improved solution of speech parameters
    Chen, Shu-Zhen
    Zhang, Chen-Guang
    Liu, Huai-Lin
    Zhang, Yu
    2003, Wuhan University (49):