Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients

被引:0
|
作者
Sen, Tjong Wan [1 ]
Trilaksono, Bambang Riyanto [1 ]
Arman, Arry Akhmad [1 ]
Mandala, Rila [1 ]
机构
[1] Bandung Inst Technol, Jl Ganesha 10, Bandung 40132, Indonesia
关键词
complex wavelet packet coefficients; feature; noise; phoneme; principal component analysis; robust; speech recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To improve the performance of phoneme based Automatic Speech Recognition (ASR) in noisy environment; we developed a new technique that could add robustness to clean phonemes features. These robust features are obtained from Complex Wavelet Packet Transform (CWPT) coefficients. Since the CWPT coefficients represent all different frequency bands of the input signal, decomposing the input signal into complete CWPT tree would also cover all frequencies involved in recognition process. For time overlapping signals with different frequency contents, e.g. phoneme signal with noises, its CWPT coefficients are the combination of CWPT coefficients of phoneme signal and CWPT coefficients of noises. The CWPT coefficients of phonemes signal would be changed according to frequency components contained in noises. Since the numbers of phonemes in every language are relatively small (limited) and already well known, one could easily derive principal component vectors from clean training dataset using Principal Component Analysis (PCA). These principal component vectors could be used then to add robustness and minimize noises effects in testing phase. Simulation results, using Alpha Numeric 4 (AN4) from Carnegie Mellon University and NOISEX-92 examples from Rice University, showed that this new technique could be used as features extractor that improves the robustness of phoneme based ASR systems in various adverse noisy conditions and still preserves the performance in clean environments.
引用
收藏
页码:123 / 134
页数:12
相关论文
共 50 条
  • [1] Emotion recognition from speech using wavelet packet transform and prosodic features
    Gupta, Manish
    Bharti, Shambhu Shankar
    Agarwal, Suneeta
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (02) : 1541 - 1553
  • [2] Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1242 - 1245
  • [3] Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech
    D. J. Mahadevaswamy
    [J]. Wireless Personal Communications, 2021, 121 : 1781 - 1804
  • [4] Robust Perceptual Wavelet Packet Features for Recognition of Continuous Kannada Speech
    Mahadevaswamy
    Ravi, D. J.
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2021, 121 (03) : 1781 - 1804
  • [5] Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition
    Pavez, Eduardo
    Silva, Jorge F.
    [J]. SPEECH COMMUNICATION, 2012, 54 (06) : 814 - 835
  • [6] Robust speech recognition using wavelet coefficient features
    Gupta, M
    Gilbert, A
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 445 - 448
  • [7] Robust speech, recognition using adaptively denoised wavelet coefficients
    Akyol, E
    Erzin, E
    Tekalp, AM
    [J]. PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 407 - 409
  • [8] Phase Autocorrelation Bark Wavelet Transform (PACWT) Features for Robust Speech Recognition
    Majeed, Sayf A.
    Husain, Hafizah
    Samad, Salina A.
    [J]. ARCHIVES OF ACOUSTICS, 2015, 40 (01) : 25 - 31
  • [9] Automatic speech/speaker recognition in noisy environments using wavelet transform
    Alkhaldi, W
    Fakhr, W
    Hamdy, N
    [J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 463 - 466
  • [10] Visual speech recognition using wavelet transform and moment based features
    Yau, Wai C.
    Kumar, Dinesh K.
    Arjunan, Sridhar P.
    Kumar, Sanjay
    [J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345