Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification

被引：0

作者：

Srivastava, Smriti ^{[1
]}

Bhardwaj, Saurabh ^{[1
]}

Bhandari, Abhishek ^{[1
]}

Gupta, Krit ^{[1
]}

Bahl, Hitesh ^{[1
]}

Gupta, J. R. P. ^{[1
]}

机构：

[1] Netaji Subhas Inst Technol, New Delhi 110078, India

来源：

INTELLIGENT INFORMATICS | 2013年 / 182卷

关键词：

WPT; MFCC; HMM; GMM; Speaker Identification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The present research proposes a paradigm which combines the Wavelet Packet Transform (WPT) with the distinguished Mel Frequency Cepstral Coefficients (MFCC) for extraction of speech feature vectors in the task of text independent speaker identification. The proposed technique overcomes the single resolution limitation of MFCC by incorporating the multi resolution analysis offered by WPT. To check the accuracy of the proposed paradigm in the real life scenario, it is tested on the speaker database by using Hidden Markov Model (HMM) and Gaussian Mixture Model (GMM) as classifiers and their relative performance for identification purpose is compared. The identification results of the MFCC features and the Wavelet Packet based Mel Frequency Cepstral (WP-MFC) Features are compared to validate the efficiency of the proposed paradigm. Accuracy as high as 100% was achieved in some cases using WP-MFC Features.

引用

页码：237 / 247

页数：11

共 50 条

[1] A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification
Turner, Claude
Joseph, Anthony
[J]. COMPLEX ADAPTIVE SYSTEMS, 2015, 2015, 61 : 416 - 421
[2] Mel Frequency Cepstral Coefficients Based Text Independent Automatic Speaker Recognition Using Matlab
Singh, Amit Kumar
Singh, Rohit
Dwivedi, Ashutosh
[J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 524 - 527
[3] Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification
Srivastava, Sumit
Chandra, Mahesh
Sahoo, G.
[J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 309 - 316
[4] Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients
Nasr, Marwa A.
Abd-Elnaby, Mohammed
El-Fishawy, Adel S.
El-Rabaie, S.
Abd El-Samie, Fathi E.
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 941 - 951
[5] Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier
Seddik, H
Rahmouni, A
Sayadi, M
[J]. ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 631 - 634
[6] Integration of Mel-frequency Cepstral Coefficients with Log Energy and Temporal Derivatives for Text-Independent Speaker Identification
Dhonde, S. B.
Chaudhari, Amol
Jagade, S. M.
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 791 - 797
[7] The wavelet packet based cepstral features for open set speaker classification in Marathi
Patil, HA
Dutta, PK
Basu, TK
[J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 134 - +
[8] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
Fekkai, S
Al-Akaidi, M
Blackledge, JM
[J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
[9] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
Jokic, Ivan D.
Jokic, Stevan D.
Delic, Vlado D.
Peric, Zoran H.
[J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
[10] Modified Mel-frequency Cepstral Coefficients (MMFCC) in Robust Text-dependent Speaker Identification
Islam, Md. Atiqul
[J]. 2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 505 - 509

← 1 2 3 4 5 →