Target Speech GMM-based Spectral Compensation for Noise Robust Speech Recognition

被引:0
|
作者
Shinozaki, Takahiro [1 ]
Furui, Sadaoki [1 ]
机构
[1] Tokyo Inst Technol, Dept Comp Sci, Tokyo 152, Japan
关键词
noisy speech recognition; spectrum; Gaussian mixture model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve speech recognition performance in adverse conditions, a noise compensation method is proposed that applies a transformation in the spectral domain whose parameters are optimized based on likelihood of speech GMM modeled on the feature domain. The idea is that additive and convolutional noises have mathematically simple expression in the spectral domain while speech characteristics are better modeled in the feature domain such as MFCC. The proposed method works as a feature extraction front-end that is independent from decoding engine, and has ability to compensate for non-stationary additive and convolutional noises with a short time delay. It includes spectral subtraction as a special case when no parameter optimization is performed. Experiments were performed using the AURORA-2J database. It ha; been shown that significantly higher recognition performance is obtained by the proposed method than spectral subtraction.
引用
收藏
页码:1223 / 1226
页数:4
相关论文
共 50 条
  • [21] Compensation of speech enhancement distortion for robust speech recognition
    Ding, P
    Cao, ZG
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 449 - 452
  • [22] Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 23 - 31
  • [23] Multi-frame GMM-based block quantisation of line spectral frequencies for wideband speech
    So, S
    Paliwal, KK
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 121 - 124
  • [24] Robust telephone speech recognition based on channel compensation
    Han, JQ
    Gao, W
    [J]. PATTERN RECOGNITION, 1999, 32 (06) : 1061 - 1067
  • [25] Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
    Gonzalez, Jose A.
    Gomez, Angel M.
    Peinado, Antonio M.
    Ma, Ning
    Barker, Jon
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (09) : 3731 - 3760
  • [26] Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
    Jose A. Gonzalez
    Angel M. Gómez
    Antonio M. Peinado
    Ning Ma
    Jon Barker
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 3731 - 3760
  • [27] Non-intrusive GMM-based speech quality measurement
    Falk, TH
    Xu, QF
    Chan, WY
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 125 - 128
  • [28] Adaptive compensation for robust speech recognition
    Lee, CH
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 357 - 364
  • [29] GMM supervector based SVM with spectral features for speech emotion recognition
    Hu, Hao
    Xu, Ming-Xing
    Wu, Wei
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
  • [30] Psychoacoustic Model Compensation for Robust Continuous Speech Recognition in Additive Noise
    Das, Biswajit
    Panda, Ashish
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 511 - 515