Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC

被引:0
|
作者
Han Zhiyan [1 ]
Wang Jian [1 ]
Wang Xu [2 ]
Lun Shuxian [1 ]
机构
[1] Bohai Univ, Coll Informat Sci & Engn, Jinzhou 121000, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2011年 / 20卷 / 01期
关键词
Speech recognition; Multiple signal classification (MUSIC); Canonical correlation based on compensation (CCBC); Feature extraction; SPECTRUM ESTIMATION; MVDR; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel feature extraction algorithm was proposed to improve the robustness of speech recognition. Core technology was incorporating perceptual information into the Multiple signal classification (MUSIC) spectrum, it provided improved robustness and computational efficiency comparing with the Mel frequency cepstral coefficient (MFCC) technique, then the cepstrum coefficients were extracted as the feature parameter. The effectiveness of the parameter was discussed in view of the class separability and speaker variability properties. To improve the robustness, we considered incorporating Canonical correlation based compensation (CCBC) to cope with the mismatch between training and test set. We evaluated the technique using improved Back-propagation neural networks (BPNN) in three different tasks: in different speakers, different recording channels and different noisy environments. The experimental results show that the novel feature has well robustness and effectiveness relative to MFCC and the CCBC algorithm can make speech recognition system robust in all three kinds of mismatch.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [21] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [22] A robust feature extraction method based on CZCPA model for speech recognition system
    Zhang, XY
    Jiao, ZP
    Zhao, SY
    ICEMI 2005: Conference Proceedings of the Seventh International Conference on Electronic Measurement & Instruments, Vol 3, 2005, : 89 - 92
  • [23] A robust feature extraction based on the MTF concept for speech recognition in reverberant environment
    Lu, Xugang
    Unoki, Masashi
    Akagi, Masato
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2546 - 2549
  • [24] Feature Extraction Based on DCT and MVDR Spectral Estimation for Robust Speech Recognition
    Seyedin, Sanaz
    Ahadi, Mohammad
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 605 - 608
  • [25] Robust feature extraction for mobile-based speech emotion recognition system
    Lee, Kang-Kue
    Cho, Youn-Ho
    Park, Kyu-Sik
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 470 - 477
  • [26] PSYCHOACOUSTICAL MASKING EFFECT-BASED FEATURE EXTRACTION FOR ROBUST SPEECH RECOGNITION
    Naing, Hay Mar Soe
    Hidayat, Risanuri
    Winduratna, Bondhan
    Miyanaga, Yoshikazu
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (05): : 1641 - 1654
  • [27] Feature Extraction Based on Pitch-Synchronous Averaging for Robust Speech Recognition
    Morales-Cordovilla, Juan A.
    Peinado, Antonio M.
    Sanchez, Victoria
    Gonzalez, Jose A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 640 - 651
  • [28] Robust speech detection in real acoustic backgrounds with perceptually motivated features
    Bach, Joerg-Hendrik
    Anemueller, Joern
    Kollmeier, Birger
    SPEECH COMMUNICATION, 2011, 53 (05) : 690 - 706
  • [29] Perceptually Motivated Linear Prediction Cepstral Features for Network Speech Recognition
    Alatwi, Aadel
    So, Stephen
    Paliwal, Kuldip K.
    2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [30] Robust Feature Extraction Methods for Speech Recognition in Noisy Environments
    Mukheolkar, Ajinkya Sunil
    Alex, John Sahaya Rani
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 295 - 299