Robust speech features based on wavelet transform with application to speaker identification

被引：20

作者：

Hsieh, CT ^{[1
]}

Lai, E ^{[1
]}

Wang, YC ^{[1
]}

机构：

[1] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan

来源：

IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING | 2002年 / 149卷 / 02期

关键词：

D O I：

10.1049/ip-vis:20020121

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

An effective and robust speech feature extraction method is presented. Based on the time-frequency multiresolution property of wavelet transform, the input speech signal is decomposed into various frequency channels. For capturing the characteristics of an individual speaker, the linear predictive cepstral coefficients of the approximation channel and entropy value of the detail channel for each decomposition process are calculated. In addition, an adaptive thresholding technique for each lower resolution is also applied to remove the influence of noise interference. Experimental results show that using this mechanism not only effectively reduces the influence of noise interference but also improves the recognition performance. Finally, the proposed method is evaluated on the MAT telephone speech database for text-independent speaker identification using the group vector quantisation identifier. Some popular existing methods are also evaluated for comparison, and the results show that the proposed feature extraction algorithm is more effective and robust than the other existing methods. In addition, the performance of the proposed method is very satisfactory even in a low SNR environment corrupted by Gaussian white noise.

引用

页码：108 / 114

页数：7

共 50 条

[21] Speaker Identification based on Robust AM-FM Features
Deshpande, Mangesh S.
Holambe, Raghunath S.
[J]. 2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
[22] Robust Feature Extracting of Speech Signal Based on Wavelet Packet Transform
Han Zhiyan
Wang Jian
Lun Shuxian
Wang Xu
[J]. PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2832 - 2837
[23] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
Sen, Tjong Wan
Trilaksono, Bambang Riyanto
Arman, Arry Akhmad
Mandala, Rila
[J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
[24] Visual speech recognition using wavelet transform and moment based features
Yau, Wai C.
Kumar, Dinesh K.
Arjunan, Sridhar P.
Kumar, Sanjay
[J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: ROBOTICS AND AUTOMATION, 2006, : 340 - 345
[25] Performance of selective speech features for speaker identification
Department of Electronics and Communication Engineering, Indian Institute of Technology, Guwahati 781039, India
[J]. J Inst Eng India Part CP, 2008, MAY (38-46):
[26] Speaker identification using speech and lip features
Ou, GB
Li, X
Yao, XC
Jia, HB
Murphey, YL
[J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570
[27] A novel robust feature of speech signal based on the mellin transform for speaker-independent speech recognition
Chen, JD
Xu, B
Huang, TY
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 629 - 632
[28] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
M. Milošević
Ž. Nedeljković
U. Glavitsch
Ž. Đurović
[J]. Journal of Communications Technology and Electronics, 2019, 64 : 1256 - 1265
[29] Speaker Modeling Using Emotional Speech for More Robust Speaker Identification
Milosevic, M.
Nedeljkovic, Z.
Glavitsch, U.
Durovic, Z.
[J]. JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2019, 64 (11) : 1256 - 1265
[30] Automatic speech/speaker recognition in noisy environments using wavelet transform
Alkhaldi, W
Fakhr, W
Hamdy, N
[J]. 2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 463 - 466

← 1 2 3 4 5 →