Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition

被引：0

作者：

Jokic, Ivan D. ^{[1
]}

Jokic, Stevan D. ^{[1
]}

Delic, Vlado D. ^{[1
]}

Peric, Zoran H. ^{[2
]}

机构：

[1] Univ Novi Sad, Fac Tech Sci, Trg Dositeja Obradovica 6, Novi Sad 21000, Serbia

[2] Univ Nis, Fac Elect Engn, Nish 18000, Serbia

来源：

2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR) | 2015年

关键词：

Automatic speaker recognition; auditory critical bands; covariance matrix; exponential auditory critical bands; mel-frequency cepstral coefficients; multidimensional Gaussian distribution;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.

引用

页码：419 / 424

页数：6

共 50 条

[1] Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning
Ayvaz, Ugur
Guruler, Huseyin
Khan, Faheem
Ahmed, Naveed
Whangbo, Taegkeun
Bobomirzaevich, Abdusalomov Akmalbek
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5511 - 5521
[2] Automatic Speaker Recognition Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
Memon, Sheeraz
Bhatti, Sania
Abro, Farzana Rauf
MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2013, 32 (04) : 543 - 550
[3] One Solution of Extension of Mel-Frequency Cepstral Coefficients Feature Vector for Automatic Speaker Recognition
Jokic, Ivan D.
Jokic, Stevan D.
Delic, Vlado D.
Peric, Zoran H.
INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (02): : 224 - 236
[4] Speaker independent phoneme recognition based on fractal dimension (DF) and the mel-frequency cepstral coefficients features
Fekkai, S
Al-Akaidi, M
Blackledge, JM
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4014 - 4014
[5] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
Hashad F.G.
Halim T.M.
Diab S.M.
Sallam B.M.
El-Samie F.E.A.
Pattern Recognition and Image Analysis, 2010, 20 (03) : 360 - 369
[6] Automatic recognition of birdsongs using mel-frequency cepstral coefficients and vector quantization
Lee, Chang-Hsing
Lien, Cheng-Chang
Huang, Ren-Zhuang
IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, 2006, : 331 - +
[7] Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients
Sheu, Jia-Shing
Chen, Ching-Wen
SENSORS AND MATERIALS, 2020, 32 (10) : 3209 - 3220
[8] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
Md. Jahangir Alam
Patrick Kenny
Douglas O’Shaughnessy
Cognitive Computation, 2013, 5 : 533 - 544
[9] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
Alam, Md. Jahangir
Kenny, Patrick
O'Shaughnessy, Douglas
COGNITIVE COMPUTATION, 2013, 5 (04) : 533 - 544
[10] Mel Frequency Cepstral Coefficients Based Text Independent Automatic Speaker Recognition Using Matlab
Singh, Amit Kumar
Singh, Rohit
Dwivedi, Ashutosh
PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 524 - 527

← 1 2 3 4 5 →