Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC

被引:0
|
作者
Han Zhiyan [1 ]
Wang Jian [1 ]
Wang Xu [2 ]
Lun Shuxian [1 ]
机构
[1] Bohai Univ, Coll Informat Sci & Engn, Jinzhou 121000, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2011年 / 20卷 / 01期
关键词
Speech recognition; Multiple signal classification (MUSIC); Canonical correlation based on compensation (CCBC); Feature extraction; SPECTRUM ESTIMATION; MVDR; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel feature extraction algorithm was proposed to improve the robustness of speech recognition. Core technology was incorporating perceptual information into the Multiple signal classification (MUSIC) spectrum, it provided improved robustness and computational efficiency comparing with the Mel frequency cepstral coefficient (MFCC) technique, then the cepstrum coefficients were extracted as the feature parameter. The effectiveness of the parameter was discussed in view of the class separability and speaker variability properties. To improve the robustness, we considered incorporating Canonical correlation based compensation (CCBC) to cope with the mismatch between training and test set. We evaluated the technique using improved Back-propagation neural networks (BPNN) in three different tasks: in different speakers, different recording channels and different noisy environments. The experimental results show that the novel feature has well robustness and effectiveness relative to MFCC and the CCBC algorithm can make speech recognition system robust in all three kinds of mismatch.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [41] Dynamic Feature Extraction for Speech Signal Based on MUSIC
    Han Zhiyan
    Wang Jian
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3770 - 3773
  • [42] Unsupervised speech denoising via perceptually motivated robust principal component analysis
    Zhang, Xiongwei (xwzhang9898@163.com), 2017, Science Press (42):
  • [43] Speech enhancement based on perceptually motivated guided spectrogram filtering
    Wang, Jie
    Yan, Linhuang
    Yang, Qiaohe
    Yuan, Minmin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5443 - 5454
  • [44] AN AUDITORY-BASED FEATURE FOR ROBUST SPEECH RECOGNITION
    Shao, Yang
    Jin, Zhaozhang
    Wang, DeLiang
    Srinivasan, Soundararajan
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4625 - +
  • [45] Filterbank Analysis of MFCC Feature Extraction in Robust Children Speech Recognition
    Naing, Hay Mar Soe
    Miyanaga, Yoshikazu
    Hidayat, Risanuri
    Winduratna, Bondhan
    2019 INTERNATIONAL SYMPOSIUM ON MULTIMEDIA AND COMMUNICATION TECHNOLOGY (ISMAC), 2019,
  • [46] Robust speech recognition and feature extraction using HMM2
    Weber, K
    Ikbal, S
    Bengio, S
    Bourlard, H
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 195 - 211
  • [47] FEATURE EXTRACTION WITH A MULTISCALE MODULATION ANALYSIS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Mueller, Florian
    Mertins, Alfred
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7427 - 7431
  • [48] Speech recognition with emphasis on wavelet based feature extraction
    Farooq, O
    Datta, S
    IETE JOURNAL OF RESEARCH, 2002, 48 (01) : 3 - 13
  • [49] Acceleration of feature extraction for FPGA based speech recognition
    Arminas, Vytautas
    Tamulevicius, Gintautas
    Navakauskas, Dalius
    Ivanovas, Edgaras
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2010, 2010, 7745
  • [50] FEATURE EXTRACTION BASED ON HEARING SYSTEM SIGNAL PROCESSING FOR ROBUST LARGE VOCABULARY SPEECH RECOGNITION
    Li, Qi
    Sun, Xie
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1262 - 1265