Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition

被引:0
|
作者
Jokic, Ivan D. [1 ]
Jokic, Stevan D. [1 ]
Delic, Vlado D. [1 ]
Peric, Zoran H. [2 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Trg Dositeja Obradovica 6, Novi Sad 21000, Serbia
[2] Univ Nis, Fac Elect Engn, Nish 18000, Serbia
关键词
Automatic speaker recognition; auditory critical bands; covariance matrix; exponential auditory critical bands; mel-frequency cepstral coefficients; multidimensional Gaussian distribution;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.
引用
收藏
页码:419 / 424
页数:6
相关论文
共 50 条
  • [41] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Zhiyou Yang
    Ying Huang
    Evolutionary Intelligence, 2022, 15 : 2485 - 2494
  • [42] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Yang, Zhiyou
    Huang, Ying
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (04) : 2485 - 2494
  • [43] Text independent speaker recognition using the Mel frequency cepstral coefficients and a neural network classifier
    Seddik, H
    Rahmouni, A
    Sayadi, M
    ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 631 - 634
  • [44] Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
    Shao, X
    Milner, B
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (02): : 1134 - 1143
  • [45] Extracting Mel-Frequency and Bark-Frequency Cepstral Coefficients from Encrypted Signals
    Thaine, Patricia
    Penn, Gerald
    INTERSPEECH 2019, 2019, : 3715 - 3719
  • [46] Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker Recognition
    Lei, Howard
    Lopez, Eduardo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2307 - 2310
  • [47] Hidden Markov Model Neurons Classification based on Mel-frequency Cepstral Coefficients
    Haggag, Sherif
    Mohamed, Shady
    Haggag, Hussein
    Nahavandi, Saeid
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE 2014), 2014, : 166 - 170
  • [48] Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients
    Nasr, Marwa A.
    Abd-Elnaby, Mohammed
    El-Fishawy, Adel S.
    El-Rabaie, S.
    Abd El-Samie, Fathi E.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 941 - 951
  • [49] Vector quantization in text dependent automatic speaker recognition using Mel-Frequency Cepstrum Coefficient
    Kabir, Ahsanul
    Ahsan, Sheikh Mohammad Masudul
    PROCEEDINGS OF THE WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, ELECTRONICS, CONTROL & SIGNAL PROCESSING: SELECTED TOPICS ON CIRCUITS, SYSTEMS, ELECTRONICS, CONTROL & SIGNAL PROCESSING, 2007, : 352 - 355
  • [50] Prediction of fundamental frequency and voicing from mel-frequency cepstral coefficients for unconstrained speech reconstruction
    Milner, Ben
    Shao, Xu
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 24 - 33