Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification

被引:0
|
作者
Srivastava, Smriti [1 ]
Bhardwaj, Saurabh [1 ]
Bhandari, Abhishek [1 ]
Gupta, Krit [1 ]
Bahl, Hitesh [1 ]
Gupta, J. R. P. [1 ]
机构
[1] Netaji Subhas Inst Technol, New Delhi 110078, India
来源
INTELLIGENT INFORMATICS | 2013年 / 182卷
关键词
WPT; MFCC; HMM; GMM; Speaker Identification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The present research proposes a paradigm which combines the Wavelet Packet Transform (WPT) with the distinguished Mel Frequency Cepstral Coefficients (MFCC) for extraction of speech feature vectors in the task of text independent speaker identification. The proposed technique overcomes the single resolution limitation of MFCC by incorporating the multi resolution analysis offered by WPT. To check the accuracy of the proposed paradigm in the real life scenario, it is tested on the speaker database by using Hidden Markov Model (HMM) and Gaussian Mixture Model (GMM) as classifiers and their relative performance for identification purpose is compared. The identification results of the MFCC features and the Wavelet Packet based Mel Frequency Cepstral (WP-MFC) Features are compared to validate the efficiency of the proposed paradigm. Accuracy as high as 100% was achieved in some cases using WP-MFC Features.
引用
收藏
页码:237 / 247
页数:11
相关论文
共 50 条
  • [31] Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features
    Zhu, Qiang
    Wang, Zhong
    Dou, Yunfeng
    Zhou, Jian
    [J]. ALGORITHMS, 2022, 15 (02)
  • [32] Variable Length Teager Energy Based Mel Cepstral Features for Identification of Twins
    Patil, Hemant A.
    Parhi, Keshab K.
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 525 - +
  • [33] Speaker identification and verification based on cepstral features and fuzzy nonlinear classifier
    Dustor, A.
    [J]. Proceedings of the International Conference Mixed Design of Integrated Circuits and Systems, 2006, : 692 - 697
  • [34] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
    Md. Jahangir Alam
    Patrick Kenny
    Douglas O’Shaughnessy
    [J]. Cognitive Computation, 2013, 5 : 533 - 544
  • [35] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
    Alam, Md. Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    [J]. COGNITIVE COMPUTATION, 2013, 5 (04) : 533 - 544
  • [36] Speaker identification using higher order spectral phase features and their effectiveness vis-a-vis Mel-Cepstral features
    Chandran, V
    Ning, D
    Sridharan, S
    [J]. BIOMETRIC AUTHENTICATION, PROCEEDINGS, 2004, 3072 : 614 - 622
  • [37] Fused Mel Feature sets based Text-Independent Speaker Identification using Gaussian Mixture Model
    Kumari, R. Shantha Selva
    Nidhyananthan, S. Selva
    Anand, G.
    [J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 319 - 326
  • [38] Higher order information set based features for text-independent speaker identification
    Medikonda J.
    Madasu H.
    [J]. International Journal of Speech Technology, 2018, 21 (03) : 451 - 461
  • [39] Speaker Recognition Using Mel Frequency Cepstral Coefficient and Locality Sensitive Hashing
    Awais, Ahmed
    Kun, She
    Yu, Yue
    Hayat, Shaukat
    Ahmed, Aftab
    Tu, Tianyi
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), 2018, : 271 - 276
  • [40] Speaker Profiling by Extracting Paralinguistic Parameters using Mel Frequency Cepstral Coefficients
    Galgali, Sudeep
    Priyanka, Selva S.
    Shashank, B. R.
    Patil, Annapurna P.
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 486 - 489