New Features Using Robust MVDR Spectrum of Filtered Autocorrelation Sequence for Robust Speech Recognition

被引:5
|
作者
Seyedin, Sanaz [1 ]
Ahadi, Seyed Mohammad [2 ]
Gazor, Saeed [1 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada
[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran 15914, Iran
来源
关键词
FEATURE-EXTRACTION; FRONT-END; COEFFICIENTS;
D O I
10.1155/2013/634160
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a novel noise-robust feature extraction method for speech recognition using the robust perceptual minimum variance distortionless response (MVDR) spectrum of temporally filtered autocorrelation sequence. The perceptual MVDR spectrum of the filtered short-time autocorrelation sequence can reduce the effects of residue of the nonstationary additive noise which remains after filtering the autocorrelation. To achieve a more robust front-end, we also modify the robust distortionless constraint of the MVDR spectral estimation method via revised weighting of the subband power spectrum values based on the sub-band signal to noise ratios (SNRs), which adjusts it to the new proposed approach. This new function allows the components of the input signal at the frequencies least affected by noise to pass with larger weights and attenuates more effectively the noisy and undesired components. This modification results in reduction of the noise residuals of the estimated spectrum from the filtered autocorrelation sequence, thereby leading to a more robust algorithm. Our proposed method, when evaluated on Aurora 2 task for recognition purposes, outperformed all Mel frequency cepstral coefficients (MFCC) as the baseline, relative autocorrelation sequence MFCC (RAS-MFCC), and the MVDR-based features in several different noisy conditions.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Robust feature extraction for continuous speech recognition using the MVDR spectrum estimation method
    Dharanipragada, Satya
    Yapanel, Umit H.
    Rao, Bhaskar D.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 224 - 234
  • [2] Robust features for speech recognition using Minimum Variance Distortionless Response (MVDR) spectrum estimation and feature normalization techniques
    Chen, Y
    Lee, LS
    [J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 101 - 104
  • [3] Robust feature extraction using spectral peaks of the filtered higher lag autocorrelation sequence of the speech signal
    Farahani, G.
    Ahadi, S. M.
    Homayounpour, M. M.
    Kashi, A.
    [J]. 2006 IEEE International Symposium on Signal Processing and Information Technology, Vols 1 and 2, 2006, : 896 - 901
  • [4] Enhancing the magnitude spectrum of speech features for robust speech recognition
    Hung, Jeih-weih
    Fan, Hao-teng
    Tu, Wen-hsiang
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [5] Enhancing the magnitude spectrum of speech features for robust speech recognition
    Jeih-weih Hung
    Hao-teng Fan
    Wen-hsiang Tu
    [J]. EURASIP Journal on Advances in Signal Processing, 2012
  • [6] Model compensation using robust features for robust speech recognition
    Zhang, Jun
    Wei, Gang
    [J]. Shuju Caiji Yu Chuli/Journal of Data Acquisition and Processing, 2003, 18 (03):
  • [7] MVDR based feature extraction for robust speech recognition
    Dharanipragada, S
    Rao, BD
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 309 - 312
  • [8] Regularized MVDR Spectrum Estimation-based Robust Feature Extractors for Speech Recognition
    Alam, Md Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 891 - 895
  • [9] Robust speech recognition using harmonic features
    Goh, Yeh Huann
    Raveendran, Paramesran
    Jamuar, Sudhanshu Shekhar
    [J]. IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175
  • [10] Phase Autocorrelation Bark Wavelet Transform (PACWT) Features for Robust Speech Recognition
    Majeed, Sayf A.
    Husain, Hafizah
    Samad, Salina A.
    [J]. ARCHIVES OF ACOUSTICS, 2015, 40 (01) : 25 - 31