Noisy speech recognition using de-noised multiresolution analysis acoustic features

被引:10
|
作者
Chan, CP [1 ]
Ching, PC [1 ]
Lee, T [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
来源
关键词
Cepstral mean normalization - Feature parameters - High frequency bands - Mel-frequency cepstral coefficients - Noisy speech recognition - Novel applications - Robust speech recognition - Wavelet packet filters;
D O I
10.1121/1.1398054
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a novel application of multiresolution analysis (MRA) in extracting acoustic features that possess de-noising capability for robust speech recognition. The MRA algorithm is used to construct a mel-scaled wavelet packet filter-bank, from which subband powers are computed as the feature parameters for speech recognition. Wiener filtering is applied to a few selected subbands at some intermediate stages of decomposition. For high-frequency bands, Wiener filters are designed based on a reduced fraction of the estimated noise power, making the consonant features much more prominent and contrastive. The proposed method is evaluated in phone recognition experiments with the MIT database. In the presence of stationary white noise at 10-dB SNR, the de-noised MRA features attain a phone recognition rate of 32%. There is a noticeable improvement compared with the accuracy of 29% and 20% attained by the commonly used mel-frequency cepstral coefficients (MFCC) with and without cepstral mean normalization (CMN), respectively. The effectiveness of the MRA features is also verified by the fact that they exhibit smaller distortion from clean speech. (C) 2001 Acoustical Society of America.
引用
收藏
页码:2567 / 2574
页数:8
相关论文
共 50 条
  • [1] Noisy speech recognition using de-noised multiresolution analysis acoustic features
    Chan, C.P.
    Ching, P.C.
    Lee, T.
    1600, Acoustical Society of America (110):
  • [2] Using wavelet de-noised spectra in NMR screening
    Trbovic, N
    Dancea, F
    Langer, T
    Günther, U
    JOURNAL OF MAGNETIC RESONANCE, 2005, 173 (02) : 280 - 287
  • [3] Classification of mental tasks using de-noised EEG signals
    Daud, SM
    Yunus, J
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 2206 - 2209
  • [4] Regularizing GRAPPA using simultaneous sparsity to recover de-noised images
    Weller, Daniel S.
    Polimeni, Jonathan R.
    Grady, Leo
    Wald, Lawrence L.
    Adalsteinsson, Elfar
    Goyal, Vivek K.
    WAVELETS AND SPARSITY XIV, 2011, 8138
  • [5] Automated protein NMR structure determination using wavelet de-noised NOESY spectra
    Dancea, F
    Günther, U
    JOURNAL OF BIOMOLECULAR NMR, 2005, 33 (03) : 139 - 152
  • [6] Automated Protein NMR Structure Determination Using Wavelet De-noised NOESY Spectra
    Felician Dancea
    Ulrich Günther
    Journal of Biomolecular NMR, 2005, 33 : 139 - 152
  • [7] ROBUST FEATURES FOR NOISY SPEECH RECOGNITION USING JITTER AND SHIMMER
    Rahali, Hajer
    Hajaiej, Zied
    Ellouze, Noureddine
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2015, 11 (03): : 955 - 963
  • [8] Photon-counting Spectral CT with De-noised Principal Component Analysis (PCA)
    Xie, Huiqiao
    Liu, Yufei
    Thuering, Thomas
    Long, Wenting
    Tang, Xiangyang
    15TH INTERNATIONAL MEETING ON FULLY THREE-DIMENSIONAL IMAGE RECONSTRUCTION IN RADIOLOGY AND NUCLEAR MEDICINE, 2019, 11072
  • [9] Alzheimer's Dementia Recognition Using Acoustic, Lexical, Disfluency and Speech Pause Features Robust to Noisy Inputs
    Rohanian, Morteza
    Hough, Julian
    Purver, Matthew
    INTERSPEECH 2021, 2021, : 3820 - 3824
  • [10] Robust Bioinformatic Method for Estimating Tumor Content in cfDNA Using De-noised Target Sequencing
    Conley, A.
    Kotlar, A.
    Aldersley, J.
    Cao, Y.
    Ke, Y.
    Liu, M.
    McFarland, K.
    Modak, T.
    Polvino, S.
    Rishishwar, L.
    Wang, Z.
    Yang, N.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2023, 25 (11): : S86 - S86