SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION

被引:0
|
作者
Lawson, A. [1 ]
Vabishchevich, P. [1 ]
Huggins, M.
Ardis, P. [1 ]
Battles, B. [1 ]
Stauffer, A. [1 ]
机构
[1] RADC Inc, Rome, NY USA
关键词
speaker recognition; acoustic features; feature fusion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study seeks to quantify the effectiveness of a broad range of acoustic features for speaker identification and their impact in feature fusion. Sixteen different acoustic features are evaluated under nine different acoustic, channel and speaking style conditions. Three major types of features are examined: traditional (MFCC, PLP, LPCC, etc.), innovative (PYKFEC, MVDR, etc.) and extensions of these (frequency-constrained LPCC, LFCC). All features were then fused in binary and three-way fusion to determine the complementarity between features and their impact on accuracy. Results were surprising, with the MVDR feature having the highest performance for any single feature, and LPCC based features having the greatest impact on fusion effectiveness. Commonly used features like PLP and MFCC did not achieve the best results in any category. It was further found that removing the perceptually-motivated warping from MFCC, MVDR and PYKFEC improved the performance of these features significantly.
引用
下载
收藏
页码:5444 / 5447
页数:4
相关论文
共 50 条
  • [41] FM Features for Automatic Forensic Speaker Recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1497 - 1500
  • [42] Speaker recognition using prosodic and lexical features
    Kajarekar, S
    Ferrer, L
    Venkataraman, A
    Sonmez, K
    Shriberg, E
    Stolcke, A
    Bratt, H
    Gadde, RR
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 19 - 24
  • [43] Looking for relevant features for speaker role recognition
    IRIT, Unversité de Toulouse, 118 route de Narbonne, F-31062 Toulouse Cedex 9, France
    Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (1057-1060):
  • [44] Time frequency features for automatic speaker recognition
    Shahrood University of Technology, Faculty of Electrical Engineering and Robotics, Shahrood, Iran
    WSEAS Trans. Commun., 2006, 12 (2148-2154):
  • [45] STUDY OF MUTUAL INFORMATION FOR SPEAKER RECOGNITION FEATURES
    Garcia, Guillermo
    Eriksson, Thomas
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 601 - 605
  • [46] GRAPH RELATIONAL FEATURES FOR SPEAKER RECOGNITION AND MINING
    Karam, Zahi N.
    Campbell, William M.
    Dehak, Najim
    2011 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2011, : 525 - 528
  • [47] Learning statistically efficient features for speaker recognition
    Jang, GJ
    Lee, TW
    Oh, YH
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 437 - 440
  • [48] Subspace Analysis of Spectral Features for Speaker Recognition
    Chen, Ling
    Man, Hong
    Jia, Huading
    Wang, Zhiyi
    Wang, Lei
    Li, Zili
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 98 - 102
  • [49] Connectionist Transformation Network Features for Speaker Recognition
    Abad, Alberto
    Luque, Jordi
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 20 - 27
  • [50] Looking for relevant features for speaker role recognition
    Bigot, Benjamin
    Pinquier, Julien
    Ferrane, Isabelle
    Andre-Obrecht, Regine
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1057 - 1060