Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification

被引:0
|
作者
Srivastava, Smriti [1 ]
Bhardwaj, Saurabh [1 ]
Bhandari, Abhishek [1 ]
Gupta, Krit [1 ]
Bahl, Hitesh [1 ]
Gupta, J. R. P. [1 ]
机构
[1] Netaji Subhas Inst Technol, New Delhi 110078, India
来源
INTELLIGENT INFORMATICS | 2013年 / 182卷
关键词
WPT; MFCC; HMM; GMM; Speaker Identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present research proposes a paradigm which combines the Wavelet Packet Transform (WPT) with the distinguished Mel Frequency Cepstral Coefficients (MFCC) for extraction of speech feature vectors in the task of text independent speaker identification. The proposed technique overcomes the single resolution limitation of MFCC by incorporating the multi resolution analysis offered by WPT. To check the accuracy of the proposed paradigm in the real life scenario, it is tested on the speaker database by using Hidden Markov Model (HMM) and Gaussian Mixture Model (GMM) as classifiers and their relative performance for identification purpose is compared. The identification results of the MFCC features and the Wavelet Packet based Mel Frequency Cepstral (WP-MFC) Features are compared to validate the efficiency of the proposed paradigm. Accuracy as high as 100% was achieved in some cases using WP-MFC Features.
引用
收藏
页码:237 / 247
页数:11
相关论文
共 50 条
  • [1] A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification
    Turner, Claude
    Joseph, Anthony
    [J]. COMPLEX ADAPTIVE SYSTEMS, 2015, 2015, 61 : 416 - 421
  • [2] Mel Frequency Cepstral Coefficients Based Text Independent Automatic Speaker Recognition Using Matlab
    Singh, Amit Kumar
    Singh, Rohit
    Dwivedi, Ashutosh
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 524 - 527
  • [3] Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification
    Srivastava, Sumit
    Chandra, Mahesh
    Sahoo, G.
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 309 - 316
  • [4] Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients
    Nasr, Marwa A.
    Abd-Elnaby, Mohammed
    El-Fishawy, Adel S.
    El-Rabaie, S.
    Abd El-Samie, Fathi E.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 941 - 951
  • [5] Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier
    Seddik, H
    Rahmouni, A
    Sayadi, M
    [J]. ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 631 - 634
  • [6] Integration of Mel-frequency Cepstral Coefficients with Log Energy and Temporal Derivatives for Text-Independent Speaker Identification
    Dhonde, S. B.
    Chaudhari, Amol
    Jagade, S. M.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 791 - 797
  • [7] The wavelet packet based cepstral features for open set speaker classification in Marathi
    Patil, HA
    Dutta, PK
    Basu, TK
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 134 - +
  • [8] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
    Fekkai, S
    Al-Akaidi, M
    Blackledge, JM
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
  • [9] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [10] Modified Mel-frequency Cepstral Coefficients (MMFCC) in Robust Text-dependent Speaker Identification
    Islam, Md. Atiqul
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 505 - 509