Noise-robust speech analysis using system identification methods

被引：0

作者：

Arima, Y ^{[1
]}

Shimamura, T ^{[1
]}

机构：

[1] Saitama Univ, Dept Informat & Comp Sci, Urawa, Saitama 3388570, Japan

来源：

ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE | 2003年 / 86卷 / 03期

关键词：

linear prediction; all-pole filter; system identification; input estimation;

D O I：

10.1002/ecjc.1137

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes a modified linear prediction method for speech analysis, using two system identification methods-the least-square method and the instrument variable method-for the estimation of the coefficients of an all-pole filter. Whereas the linear prediction method estimates the coefficients of all-pole filters from speech signals, which are observed output signals, the system identification method estimates coefficients of all-pole filters from observed output signals and the input signals. This paper derives a novel technique that estimates input signals from speech signals that are observed output signals with a high degree of accuracy and robustness with respect to added noise, by generating improved prediction error signals. The paper also shows that when voiced speech is to be analyzed, if input signals, which are an impulse chain, can be accurately estimated, the estimation of filter coefficients can yield a high degree of accuracy provided that the least-square method is used, and that in this manner, the pitch period dependency can be removed. We also show that by applying the instrument variable method using an auxiliary model, the accuracy of estimation of filter coefficients in a noisy environment can be substantially improved while maintaining the properties of the least-square method. The effectiveness of these system identification methods for speech analysis is demonstrated through computer simulations. (C) 2002 Wiley Periodicals, Inc.

引用

页码：20 / 32

页数：13

共 50 条

[41] An improved algorithm for noise-robust sparse linear prediction of speech
Zhou, Bin
Zou, Xia
Zhang, Xiongwei
[J]. Shengxue Xuebao/Acta Acustica, 2014, 39 (05): : 655 - 662
[42] NOISE-ROBUST DETECTION OF PEAK-CLIPPING IN DECODED SPEECH
Eaton, James
Naylor, Patrick A.
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[43] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
Wu, Kuo-Hao
Chen, Chia-Ping
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
[44] MULTI-TASK AUTOENCODER FOR NOISE-ROBUST SPEECH RECOGNITION
Zhang, Haoyi
Liu, Conggui
Inoue, Nakamasa
Shinoda, Koichi
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5599 - 5603
[45] Noise-Robust Speech Recognition Based on RBF Neural Network
Hou, Xuemei
[J]. HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
[46] INCORPORATING MASK MODELLING FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
Koekueer, Muenevver
Jancovic, Peter
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3929 - 3932
[47] An improved algorithm for noise-robust sparse linear prediction of speech
ZHOU Bin
ZOU Xia
ZHANG Xiongwei
[J]. Chinese Journal of Acoustics, 2015, 34 (01) : 84 - 95
[48] Unsupervised modulation filter learning for noise-robust speech recognition
Agrawal, Purvi
Ganapathy, Sriram
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1686 - 1692
[49] Noise-robust speech feature processing with empirical mode decomposition
Wu, Kuo-Hau
Chen, Chia-Ping
Yeh, Bing-Feng
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
[50] A noise-robust speech input interface for information kiosk terminals
Ida, M
Mori, H
Nakamura, S
Shikano, K
[J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (12): : 51 - 61

← 1 2 3 4 5 →