Robust speech features based on wavelet transform with application to speaker identification

被引:20
|
作者
Hsieh, CT [1 ]
Lai, E [1 ]
Wang, YC [1 ]
机构
[1] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan
来源
关键词
D O I
10.1049/ip-vis:20020121
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An effective and robust speech feature extraction method is presented. Based on the time-frequency multiresolution property of wavelet transform, the input speech signal is decomposed into various frequency channels. For capturing the characteristics of an individual speaker, the linear predictive cepstral coefficients of the approximation channel and entropy value of the detail channel for each decomposition process are calculated. In addition, an adaptive thresholding technique for each lower resolution is also applied to remove the influence of noise interference. Experimental results show that using this mechanism not only effectively reduces the influence of noise interference but also improves the recognition performance. Finally, the proposed method is evaluated on the MAT telephone speech database for text-independent speaker identification using the group vector quantisation identifier. Some popular existing methods are also evaluated for comparison, and the results show that the proposed feature extraction algorithm is more effective and robust than the other existing methods. In addition, the performance of the proposed method is very satisfactory even in a low SNR environment corrupted by Gaussian white noise.
引用
收藏
页码:108 / 114
页数:7
相关论文
共 50 条
  • [1] A robust speaker identification system based on wavelet transform
    Hsieh, CT
    Wang, YC
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (07): : 839 - 846
  • [2] Robust speaker identification system based on wavelet transform and Gaussian mixture model
    Chen, WC
    Hsieh, CT
    Lai, E
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 263 - 271
  • [3] Robust speaker identification system based on wavelet transform and Gaussian mixture model
    Hsieh, CT
    Lai, E
    Wang, YC
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2003, 19 (02) : 267 - 282
  • [4] Speaker Identification Wavelet Transform Based Method
    Daqrouq, Khaled
    Al-Sawalmeh, Wael
    Al-Qawasmi, Abdel-Rahman
    Abu-Isbeib, Ibrahim N.
    [J]. 2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 698 - 702
  • [5] Wavelet Scattering Transform Depth Benefit, An Application for Speaker Identification
    Moufidi, Abderrazzaq
    Rousseau, David
    Rasti, Pejman
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2022, 2023, 13739 : 97 - 106
  • [6] Robust classification of speech based on the dyadic wavelet transform with application to CELP coding
    Stegmann, J
    Schroder, G
    Fischer, KA
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 546 - 549
  • [7] Robust FHPD Features from Speech Harmonic Analysis for Speaker Identification
    Wang, Shuiping
    Tang, Zhenmin
    Jiang, Ye
    Chen, Ying
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1591 - 1598
  • [8] Phase Autocorrelation Bark Wavelet Transform (PACWT) Features for Robust Speech Recognition
    Majeed, Sayf A.
    Husain, Hafizah
    Samad, Salina A.
    [J]. ARCHIVES OF ACOUSTICS, 2015, 40 (01) : 25 - 31
  • [9] Spectral Restoration Based Speech Enhancement for Robust Speaker Identification
    Saleem, Nasir
    Tareen, Tayyaba Gul
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 5 (01): : 34 - 39
  • [10] Application of wavelet Karhunen-Loeve transform to text independent speaker identification
    Lung, SY
    [J]. Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 157 - 158