Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition

被引:0
|
作者
Jokic, Ivan D. [1 ]
Jokic, Stevan D. [1 ]
Delic, Vlado D. [1 ]
Peric, Zoran H. [2 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Trg Dositeja Obradovica 6, Novi Sad 21000, Serbia
[2] Univ Nis, Fac Elect Engn, Nish 18000, Serbia
关键词
Automatic speaker recognition; auditory critical bands; covariance matrix; exponential auditory critical bands; mel-frequency cepstral coefficients; multidimensional Gaussian distribution;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.
引用
收藏
页码:419 / 424
页数:6
相关论文
共 50 条
  • [1] Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning
    Ayvaz, Ugur
    Guruler, Huseyin
    Khan, Faheem
    Ahmed, Naveed
    Whangbo, Taegkeun
    Bobomirzaevich, Abdusalomov Akmalbek
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5511 - 5521
  • [2] Automatic Speaker Recognition Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
    Memon, Sheeraz
    Bhatti, Sania
    Abro, Farzana Rauf
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2013, 32 (04) : 543 - 550
  • [3] One Solution of Extension of Mel-Frequency Cepstral Coefficients Feature Vector for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (02): : 224 - 236
  • [4] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
    Fekkai, S
    Al-Akaidi, M
    Blackledge, JM
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
  • [5] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
    Hashad F.G.
    Halim T.M.
    Diab S.M.
    Sallam B.M.
    El-Samie F.E.A.
    [J]. Pattern Recognition and Image Analysis, 2010, 20 (3) : 360 - 369
  • [6] Automatic recognition of birdsongs using mel-frequency cepstral coefficients and vector quantization
    Lee, Chang-Hsing
    Lien, Cheng-Chang
    Huang, Ren-Zhuang
    [J]. IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, 2006, : 331 - +
  • [7] Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients
    Sheu, Jia-Shing
    Chen, Ching-Wen
    [J]. SENSORS AND MATERIALS, 2020, 32 (10) : 3209 - 3220
  • [8] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
    Md. Jahangir Alam
    Patrick Kenny
    Douglas O’Shaughnessy
    [J]. Cognitive Computation, 2013, 5 : 533 - 544
  • [9] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
    Alam, Md. Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    [J]. COGNITIVE COMPUTATION, 2013, 5 (04) : 533 - 544
  • [10] Mel Frequency Cepstral Coefficients Based Text Independent Automatic Speaker Recognition Using Matlab
    Singh, Amit Kumar
    Singh, Rohit
    Dwivedi, Ashutosh
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 524 - 527