Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC

被引:0
|
作者
Han Zhiyan [1 ]
Wang Jian [1 ]
Wang Xu [2 ]
Lun Shuxian [1 ]
机构
[1] Bohai Univ, Coll Informat Sci & Engn, Jinzhou 121000, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2011年 / 20卷 / 01期
关键词
Speech recognition; Multiple signal classification (MUSIC); Canonical correlation based on compensation (CCBC); Feature extraction; SPECTRUM ESTIMATION; MVDR; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel feature extraction algorithm was proposed to improve the robustness of speech recognition. Core technology was incorporating perceptual information into the Multiple signal classification (MUSIC) spectrum, it provided improved robustness and computational efficiency comparing with the Mel frequency cepstral coefficient (MFCC) technique, then the cepstrum coefficients were extracted as the feature parameter. The effectiveness of the parameter was discussed in view of the class separability and speaker variability properties. To improve the robustness, we considered incorporating Canonical correlation based compensation (CCBC) to cope with the mismatch between training and test set. We evaluated the technique using improved Back-propagation neural networks (BPNN) in three different tasks: in different speakers, different recording channels and different noisy environments. The experimental results show that the novel feature has well robustness and effectiveness relative to MFCC and the CCBC algorithm can make speech recognition system robust in all three kinds of mismatch.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [31] A bio-inspired feature extraction for robust speech recognition
    Zouhir, Youssef
    Ouni, Kais
    SPRINGERPLUS, 2014, 3
  • [32] Temporal modulation normalization for robust speech feature extraction and recognition
    Lu, Xugang
    Matsuda, Shigeki
    Unoki, Masashi
    Nakamura, Satoshi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 52 (01) : 187 - 199
  • [33] Temporal modulation normalization for robust speech feature extraction and recognition
    Xugang Lu
    Shigeki Matsuda
    Masashi Unoki
    Satoshi Nakamura
    Multimedia Tools and Applications, 2011, 52 : 187 - 199
  • [34] A Correlational Discriminant Approach to Feature Extraction for Robust Speech Recognition
    Tomar, Vikrant Singh
    Rose, Richard C.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 554 - 557
  • [35] An auditory neural feature extraction method for robust speech recognition
    Guo, Wei
    Zhang, Liqing
    Xia, Bin
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 793 - +
  • [36] A robust feature extraction for automatic speech recognition in noisy environments
    Lima, C
    Almeida, LB
    Monteiro, JL
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 540 - 543
  • [37] Study of Robust Feature Extraction Techniques for Speech Recognition System
    Sharma, Usha
    Maheshkar, Sushila
    Mishra, A. N.
    2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 666 - 670
  • [38] Temporal modulation normalization for robust speech feature extraction and recognition
    Lu, Xugang
    Matsuda, Shigeki
    Unoki, Masashi
    Nakamura, Satoshi
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4354 - 4357
  • [39] Robust Feature Extraction for Speech Recognition by Enhancing Auditory Spectrum
    Alam, Md Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1358 - 1361
  • [40] Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition
    Shi, Yanyan
    Bai, Jing
    Xue, Peiyun
    Shi, Dianxi
    IEEE ACCESS, 2019, 7 : 81911 - 81922