SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION

被引:0
|
作者
Lawson, A. [1 ]
Vabishchevich, P. [1 ]
Huggins, M.
Ardis, P. [1 ]
Battles, B. [1 ]
Stauffer, A. [1 ]
机构
[1] RADC Inc, Rome, NY USA
关键词
speaker recognition; acoustic features; feature fusion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study seeks to quantify the effectiveness of a broad range of acoustic features for speaker identification and their impact in feature fusion. Sixteen different acoustic features are evaluated under nine different acoustic, channel and speaking style conditions. Three major types of features are examined: traditional (MFCC, PLP, LPCC, etc.), innovative (PYKFEC, MVDR, etc.) and extensions of these (frequency-constrained LPCC, LFCC). All features were then fused in binary and three-way fusion to determine the complementarity between features and their impact on accuracy. Results were surprising, with the MVDR feature having the highest performance for any single feature, and LPCC based features having the greatest impact on fusion effectiveness. Commonly used features like PLP and MFCC did not achieve the best results in any category. It was further found that removing the perceptually-motivated warping from MFCC, MVDR and PYKFEC improved the performance of these features significantly.
引用
下载
收藏
页码:5444 / 5447
页数:4
相关论文
共 50 条
  • [1] Acoustic and facial features for speaker recognition
    Roach, MJ
    Brand, JD
    Mason, JSD
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 258 - 261
  • [2] Fusion of acoustic and tokenization features for speaker recognition
    Tong, Rong
    Ma, Bin
    Lee, Kong-Aik
    You, Changhuai
    Zhu, Donglai
    Kinnunen, Tomi
    Sun, Hanwu
    Dong, Minghui
    Chng, Eng-Siong
    Li, Haizhou
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 566 - +
  • [3] Integration of complementary acoustic features for speaker recognition
    Zheng, Nengheng
    Lee, Tan
    Ching, P. C.
    IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (03) : 181 - 184
  • [4] CONTOUR MODELING OF PROSODIC AND ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
    Kockmann, Marcel
    Burget, Lukas
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 45 - 48
  • [5] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
    Chen, SH
    Wang, HC
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
  • [6] Comparative study of several novel acoustic features for speaker recognition
    Pervouchine, Vladimir
    Leedham, Graham
    Zhong, Haishan
    Cho, David
    Li, Haizhou
    BIOSIGNALS 2008: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, VOL 1, 2008, : 220 - 223
  • [7] Multi-View Learning of Acoustic Features for Speaker Recognition
    Livescu, Karen
    Stoehr, Mark
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 82 - +
  • [8] Using Genetic Algorithms to Weight Acoustic Features for Speaker Recognition
    Zamalloa, Maider
    Bordel, German
    Javier Rodriguez, Luis
    Penagarikano, Mikel
    Uribe, Juan Pedro
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 909 - +
  • [9] FUSION OF ACOUSTIC, LINGUISTIC AND PSYCHOLINGUISTIC FEATURES FOR SPEAKER PERSONALITY TRAITS RECOGNITION
    Alam, Firoj
    Riccardi, Giuseppe
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] CYCLE-GANS FOR DOMAIN ADAPTATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION
    Nidadavolu, Phani Sankar
    Villalba, Jesus
    Dehak, Najim
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6206 - 6210