Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition

被引:0
|
作者
Jokic, Ivan D. [1 ]
Jokic, Stevan D. [1 ]
Delic, Vlado D. [1 ]
Peric, Zoran H. [2 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Trg Dositeja Obradovica 6, Novi Sad 21000, Serbia
[2] Univ Nis, Fac Elect Engn, Nish 18000, Serbia
关键词
Automatic speaker recognition; auditory critical bands; covariance matrix; exponential auditory critical bands; mel-frequency cepstral coefficients; multidimensional Gaussian distribution;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.
引用
收藏
页码:419 / 424
页数:6
相关论文
共 50 条
  • [21] Mel-Frequency Cepstral Coefficient Analysis in Speech Recognition
    On, Chin Kim
    Pandiyan, Paulraj M.
    Yaacob, Sazali
    Saudi, Azali
    2006 INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS (ICOCI 2006), 2006, : 291 - +
  • [22] Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
    Darch, Jonathan
    Milner, Ben
    Vaseghi, Saeed
    Journal of the Acoustical Society of America, 2009, 124 (06): : 3989 - 4000
  • [23] Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
    Darch, Jonathan
    Milner, Ben
    Vaseghi, Saeed
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (06): : 3989 - 4000
  • [24] Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)
    Koolagudi, Shashidhar G.
    Rastogi, Deepika
    Rao, K. Sreenivasa
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3391 - 3398
  • [25] Speaker Recognition Using Mel-Frequency Cepstrum Coefficients and Sum Square Error
    Charisma, Atik
    Hidayat, M. Reza
    Zainal, Yuda Bakti
    2017 3RD INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2017, : 160 - 163
  • [26] On the Inversion of Mel-Frequency Cepstral Coefficients for Speech Enhancement Applications
    Boucheron, Laura E.
    De Leon, Phillip L.
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 485 - 488
  • [27] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    Zhang Jun
    Sam Kwong
    Wei Gang
    Qingyang Hong
    EURASIP Journal on Advances in Signal Processing, 2004
  • [28] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    Jun, Z. (zhj_angun@sina.com.cn), 1600, Hindawi Publishing Corporation (2004):
  • [29] Multiple time resolutions for derivatives of mel-frequency cepstral coefficients
    Stemmer, G
    Hacker, C
    Nöth, E
    Niemann, H
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 37 - 40
  • [30] Using Mel-frequency cepstral coefficients in missing data technique
    Jun, Z
    Kwong, S
    Gang, W
    Hong, QY
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (03) : 340 - 346