Comparative study of several novel acoustic features for speaker recognition

被引:0
|
作者
Pervouchine, Vladimir [1 ]
Leedham, Graham [1 ]
Zhong, Haishan [1 ]
Cho, David [1 ]
Li, Haizhou [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
speaker recognition; feature extraction; feature evaluation;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Finding good features that represent speaker identity is an important problem in speaker recognition area. Recently a number of novel acoustic features have been proposed for speaker recognition. The researchers use different data sets and sometimes different classifiers to evaluate the features and compare them to the baselines such as MFCC or LPCC. However, due to different experimental conditions direct comparison of those features to each other is difficult or impossible. This paper presents a study of five new recently proposed acoustic features using the same data (NIST 2001 SRE), and the same UBM-GMM classifier. The results are presented as DET curves with equal error ratios indicated. Also, an SVM-based combination of GMM scores produced on different features has been made to determine if the new features carry any complimentary information. The results for different features as well as for their combinations are directly comparable to each other and to those obtained with the baseline MFCC features.
引用
收藏
页码:220 / 223
页数:4
相关论文
共 50 条
  • [1] Acoustic and facial features for speaker recognition
    Roach, MJ
    Brand, JD
    Mason, JSD
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 258 - 261
  • [2] Fusion of acoustic and tokenization features for speaker recognition
    Tong, Rong
    Ma, Bin
    Lee, Kong-Aik
    You, Changhuai
    Zhu, Donglai
    Kinnunen, Tomi
    Sun, Hanwu
    Dong, Minghui
    Chng, Eng-Siong
    Li, Haizhou
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 566 - +
  • [3] SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
    Lawson, A.
    Vabishchevich, P.
    Huggins, M.
    Ardis, P.
    Battles, B.
    Stauffer, A.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5444 - 5447
  • [4] Integration of complementary acoustic features for speaker recognition
    Zheng, Nengheng
    Lee, Tan
    Ching, P. C.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (03) : 181 - 184
  • [5] CONTOUR MODELING OF PROSODIC AND ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
    Kockmann, Marcel
    Burget, Lukas
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 45 - 48
  • [6] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
    Chen, SH
    Wang, HC
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
  • [7] Study of harmonic features for the speaker recognition
    Univ of Maribor, Maribor, Slovenia
    [J]. Speech Commun, 4 (385-402):
  • [8] A study of harmonic features for the speaker recognition
    Imperl, B
    Kacic, Z
    Horvat, B
    [J]. SPEECH COMMUNICATION, 1997, 22 (04) : 385 - 402
  • [9] Multi-View Learning of Acoustic Features for Speaker Recognition
    Livescu, Karen
    Stoehr, Mark
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 82 - +
  • [10] Using Genetic Algorithms to Weight Acoustic Features for Speaker Recognition
    Zamalloa, Maider
    Bordel, German
    Javier Rodriguez, Luis
    Penagarikano, Mikel
    Uribe, Juan Pedro
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 909 - +