Robust speech features based on wavelet transform with application to speaker identification

被引:20
|
作者
Hsieh, CT [1 ]
Lai, E [1 ]
Wang, YC [1 ]
机构
[1] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan
来源
关键词
D O I
10.1049/ip-vis:20020121
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An effective and robust speech feature extraction method is presented. Based on the time-frequency multiresolution property of wavelet transform, the input speech signal is decomposed into various frequency channels. For capturing the characteristics of an individual speaker, the linear predictive cepstral coefficients of the approximation channel and entropy value of the detail channel for each decomposition process are calculated. In addition, an adaptive thresholding technique for each lower resolution is also applied to remove the influence of noise interference. Experimental results show that using this mechanism not only effectively reduces the influence of noise interference but also improves the recognition performance. Finally, the proposed method is evaluated on the MAT telephone speech database for text-independent speaker identification using the group vector quantisation identifier. Some popular existing methods are also evaluated for comparison, and the results show that the proposed feature extraction algorithm is more effective and robust than the other existing methods. In addition, the performance of the proposed method is very satisfactory even in a low SNR environment corrupted by Gaussian white noise.
引用
收藏
页码:108 / 114
页数:7
相关论文
共 50 条
  • [21] Speaker Identification based on Robust AM-FM Features
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    [J]. 2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
  • [22] Robust Feature Extracting of Speech Signal Based on Wavelet Packet Transform
    Han Zhiyan
    Wang Jian
    Lun Shuxian
    Wang Xu
    [J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2832 - 2837
  • [23] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
    Sen, Tjong Wan
    Trilaksono, Bambang Riyanto
    Arman, Arry Akhmad
    Mandala, Rila
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
  • [24] Visual speech recognition using wavelet transform and moment based features
    Yau, Wai C.
    Kumar, Dinesh K.
    Arjunan, Sridhar P.
    Kumar, Sanjay
    [J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345
  • [25] Performance of selective speech features for speaker identification
    Department of Electronics and Communication Engineering, Indian Institute of Technology, Guwahati 781039, India
    [J]. J Inst Eng India Part CP, 2008, MAY (38-46):
  • [26] Speaker identification using speech and lip features
    Ou, GB
    Li, X
    Yao, XC
    Jia, HB
    Murphey, YL
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570
  • [27] A novel robust feature of speech signal based on the mellin transform for speaker-independent speech recognition
    Chen, JD
    Xu, B
    Huang, TY
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 629 - 632
  • [28] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    M. Milošević
    Ž. Nedeljković
    U. Glavitsch
    Ž. Đurović
    [J]. Journal of Communications Technology and Electronics, 2019, 64 : 1256 - 1265
  • [29] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
    Milosevic, M.
    Nedeljkovic, Z.
    Glavitsch, U.
    Durovic, Z.
    [J]. JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2019, 64 (11) : 1256 - 1265
  • [30] Automatic speech/speaker recognition in noisy environments using wavelet transform
    Alkhaldi, W
    Fakhr, W
    Hamdy, N
    [J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 463 - 466