Target Speech GMM-based Spectral Compensation for Noise Robust Speech Recognition

被引：0

作者：

Shinozaki, Takahiro ^{[1
]}

Furui, Sadaoki ^{[1
]}

机构：

[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

noisy speech recognition; spectrum; Gaussian mixture model;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To improve speech recognition performance in adverse conditions, a noise compensation method is proposed that applies a transformation in the spectral domain whose parameters are optimized based on likelihood of speech GMM modeled on the feature domain. The idea is that additive and convolutional noises have mathematically simple expression in the spectral domain while speech characteristics are better modeled in the feature domain such as MFCC. The proposed method works as a feature extraction front-end that is independent from decoding engine, and has ability to compensate for non-stationary additive and convolutional noises with a short time delay. It includes spectral subtraction as a special case when no parameter optimization is performed. Experiments were performed using the AURORA-2J database. It ha; been shown that significantly higher recognition performance is obtained by the proposed method than spectral subtraction.

引用

页码：1223 / 1226

页数：4

共 50 条

[21] Compensation of speech enhancement distortion for robust speech recognition
Ding, P
Cao, ZG
[J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 449 - 452
[22] Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech
Pribil, Jiri
Pribilova, Anna
Matousek, Jindrich
[J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 23 - 31
[23] Multi-frame GMM-based block quantisation of line spectral frequencies for wideband speech
So, S
Paliwal, KK
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 121 - 124
[24] Robust telephone speech recognition based on channel compensation
Han, JQ
Gao, W
[J]. PATTERN RECOGNITION, 1999, 32 (06) : 1061 - 1067
[25] Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
Gonzalez, Jose A.
Gomez, Angel M.
Peinado, Antonio M.
Ma, Ning
Barker, Jon
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (09) : 3731 - 3760
[26] Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
Jose A. Gonzalez
Angel M. Gómez
Antonio M. Peinado
Ning Ma
Jon Barker
[J]. Circuits, Systems, and Signal Processing, 2017, 36 : 3731 - 3760
[27] Non-intrusive GMM-based speech quality measurement
Falk, TH
Xu, QF
Chan, WY
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 125 - 128
[28] Adaptive compensation for robust speech recognition
Lee, CH
[J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 357 - 364
[29] GMM supervector based SVM with spectral features for speech emotion recognition
Hu, Hao
Xu, Ming-Xing
Wu, Wei
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
[30] Psychoacoustic Model Compensation for Robust Continuous Speech Recognition in Additive Noise
Das, Biswajit
Panda, Ashish
[J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 511 - 515

← 1 2 3 4 5 →