Comparative study of several novel acoustic features for speaker recognition

被引：0

作者：

Pervouchine, Vladimir ^{[1
]}

Leedham, Graham ^{[1
]}

Zhong, Haishan ^{[1
]}

Cho, David ^{[1
]}

Li, Haizhou ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore

来源：

BIOSIGNALS 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, VOL 1 | 2008年

关键词：

speaker recognition; feature extraction; feature evaluation;

D O I：

暂无

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Finding good features that represent speaker identity is an important problem in speaker recognition area. Recently a number of novel acoustic features have been proposed for speaker recognition. The researchers use different data sets and sometimes different classifiers to evaluate the features and compare them to the baselines such as MFCC or LPCC. However, due to different experimental conditions direct comparison of those features to each other is difficult or impossible. This paper presents a study of five new recently proposed acoustic features using the same data (NIST 2001 SRE), and the same UBM-GMM classifier. The results are presented as DET curves with equal error ratios indicated. Also, an SVM-based combination of GMM scores produced on different features has been made to determine if the new features carry any complimentary information. The results for different features as well as for their combinations are directly comparable to each other and to those obtained with the baseline MFCC features.

引用

页码：220 / 223

页数：4

共 50 条

[1] Acoustic and facial features for speaker recognition
Roach, MJ
Brand, JD
Mason, JSD
[J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 258 - 261
[2] Fusion of acoustic and tokenization features for speaker recognition
Tong, Rong
Ma, Bin
Lee, Kong-Aik
You, Changhuai
Zhu, Donglai
Kinnunen, Tomi
Sun, Hanwu
Dong, Minghui
Chng, Eng-Siong
Li, Haizhou
[J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 566 - +
[3] SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
Lawson, A.
Vabishchevich, P.
Huggins, M.
Ardis, P.
Battles, B.
Stauffer, A.
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5444 - 5447
[4] Integration of complementary acoustic features for speaker recognition
Zheng, Nengheng
Lee, Tan
Ching, P. C.
[J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (03) : 181 - 184
[5] CONTOUR MODELING OF PROSODIC AND ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
Kockmann, Marcel
Burget, Lukas
[J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 45 - 48
[6] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
Chen, SH
Wang, HC
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
[7] Study of harmonic features for the speaker recognition
Univ of Maribor, Maribor, Slovenia
[J]. Speech Commun, 4 (385-402):
[8] A study of harmonic features for the speaker recognition
Imperl, B
Kacic, Z
Horvat, B
[J]. SPEECH COMMUNICATION, 1997, 22 (04) : 385 - 402
[9] Multi-View Learning of Acoustic Features for Speaker Recognition
Livescu, Karen
Stoehr, Mark
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 82 - +
[10] Using Genetic Algorithms to Weight Acoustic Features for Speaker Recognition
Zamalloa, Maider
Bordel, German
Javier Rodriguez, Luis
Penagarikano, Mikel
Uribe, Juan Pedro
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 909 - +

← 1 2 3 4 5 →