Stabilised Weighted Linear Prediction - A Robust All-Pole Method for Speech Processing

被引:0
|
作者
Magi, Carlo [1 ]
Backstrom, Tom [1 ]
Alku, Paavo [1 ]
机构
[1] Helsinki Univ Technol, Lab Acoust & Audio Signal Proc, FI-02015 Helsinki, Finland
关键词
linear prediction; all-pole modelling; spectral estimation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weighted linear prediction (WLP) is a method to compute all-pole models of speech by applying temporal weighting of the residual energy. By using short-time energy (STE) as a weighting function, the algorithm over-weight those samples that fit the underlying speech production model well. The current work introduces a modified WLP method, stabilised weighted linear prediction (SWLP) leading always to stable all-pole models whose performance can be adjusted by changing the length (denoted by M) of the STE window. With a large M value, the SWLP spectra become similar to conventional LP spectra. A small value of M results in SWLP filters similar to those computed by the minimum variance distortionless response (MVDR) method. ne study compares the performances of SWLP, MVDR, and conventional LP in spectral modelling of speech sounds corrupted by Gaussian additive white noise. Results indicate that SWLP is the most robust method against noise especially with a small M value.
引用
收藏
页码:253 / 256
页数:4
相关论文
共 50 条
  • [42] Weighted Linear Prediction for Speech Analysis in Noisy Conditions
    Pohjalainen, Jouni
    Kallasjoki, Heikki
    Palomaki, Kalle J.
    Kurimo, Mikko
    Alku, Paavo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1347 - +
  • [43] Base on All-pole Approximation a New Internal Model PID Control Method for the System with Time Delays
    Jin Qibing
    Quan Ling
    Wang Xuewei
    Qi Fei
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 268 - 273
  • [44] Robust estimate for linear prediction parameters of speech signal
    Jiang, Taihui
    Yao, Tianren
    1996, (24):
  • [45] IMPROVED NORMALIZING FLOW-BASED SPEECH ENHANCEMENT USING AN ALL-POLE GAMMATONE FILTERBANK FOR CONDITIONAL INPUT REPRESENTATION
    Strauss, Martin
    Torcoli, Matteo
    Edler, Bernd
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 444 - 450
  • [46] LINEAR PREDICTION ANALYSIS OF SPEECH BASED ON A POLE-ZERO REPRESENTATION
    ATAL, BS
    SCHROEDER, MR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S96 - S96
  • [47] LINEAR PREDICTION ANALYSIS OF SPEECH BASED ON A POLE-ZERO REPRESENTATION
    ATAL, BS
    SCHROEDER, MR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 (05): : 1310 - 1318
  • [48] Sparse Linear Prediction and Its Applications to Speech Processing
    Giacobello, Daniele
    Christensen, Mads Graesboll
    Murthi, Manohar N.
    Jensen, Soren Holdt
    Moonen, Marc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1644 - 1657
  • [49] Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
    Ali, Zulfiqar
    Elamvazuthi, Irraivan
    Alsulaiman, Mansour
    Muhammad, Ghulam
    JOURNAL OF VOICE, 2016, 30 (06) : 757.e7 - 757.e19
  • [50] Digraphs Structures Corresponding to Minimal Realisation of Fractional Continuous-Time Linear Systems with All-Pole and All-Zero Transfer Function
    Markowski, Konrad Andrzej
    PROCEEDING OF 2016 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR), 2016, : 423 - 428