Noise-robust speech analysis using system identification methods

被引:0
|
作者
Arima, Y [1 ]
Shimamura, T [1 ]
机构
[1] Saitama Univ, Dept Informat & Comp Sci, Urawa, Saitama 3388570, Japan
关键词
linear prediction; all-pole filter; system identification; input estimation;
D O I
10.1002/ecjc.1137
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a modified linear prediction method for speech analysis, using two system identification methods-the least-square method and the instrument variable method-for the estimation of the coefficients of an all-pole filter. Whereas the linear prediction method estimates the coefficients of all-pole filters from speech signals, which are observed output signals, the system identification method estimates coefficients of all-pole filters from observed output signals and the input signals. This paper derives a novel technique that estimates input signals from speech signals that are observed output signals with a high degree of accuracy and robustness with respect to added noise, by generating improved prediction error signals. The paper also shows that when voiced speech is to be analyzed, if input signals, which are an impulse chain, can be accurately estimated, the estimation of filter coefficients can yield a high degree of accuracy provided that the least-square method is used, and that in this manner, the pitch period dependency can be removed. We also show that by applying the instrument variable method using an auxiliary model, the accuracy of estimation of filter coefficients in a noisy environment can be substantially improved while maintaining the properties of the least-square method. The effectiveness of these system identification methods for speech analysis is demonstrated through computer simulations. (C) 2002 Wiley Periodicals, Inc.
引用
收藏
页码:20 / 32
页数:13
相关论文
共 50 条
  • [41] An improved algorithm for noise-robust sparse linear prediction of speech
    Zhou, Bin
    Zou, Xia
    Zhang, Xiongwei
    [J]. Shengxue Xuebao/Acta Acustica, 2014, 39 (05): : 655 - 662
  • [42] NOISE-ROBUST DETECTION OF PEAK-CLIPPING IN DECODED SPEECH
    Eaton, James
    Naylor, Patrick A.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [43] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
  • [44] MULTI-TASK AUTOENCODER FOR NOISE-ROBUST SPEECH RECOGNITION
    Zhang, Haoyi
    Liu, Conggui
    Inoue, Nakamasa
    Shinoda, Koichi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5599 - 5603
  • [45] Noise-Robust Speech Recognition Based on RBF Neural Network
    Hou, Xuemei
    [J]. HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
  • [46] INCORPORATING MASK MODELLING FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Koekueer, Muenevver
    Jancovic, Peter
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3929 - 3932
  • [47] An improved algorithm for noise-robust sparse linear prediction of speech
    ZHOU Bin
    ZOU Xia
    ZHANG Xiongwei
    [J]. Chinese Journal of Acoustics, 2015, 34 (01) : 84 - 95
  • [48] Unsupervised modulation filter learning for noise-robust speech recognition
    Agrawal, Purvi
    Ganapathy, Sriram
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
  • [49] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [50] A noise-robust speech input interface for information kiosk terminals
    Ida, M
    Mori, H
    Nakamura, S
    Shikano, K
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (12): : 51 - 61