SURVEY AND EVALUATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION

被引:0
|
作者
Lawson, A. [1 ]
Vabishchevich, P. [1 ]
Huggins, M.
Ardis, P. [1 ]
Battles, B. [1 ]
Stauffer, A. [1 ]
机构
[1] RADC Inc, Rome, NY USA
关键词
speaker recognition; acoustic features; feature fusion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study seeks to quantify the effectiveness of a broad range of acoustic features for speaker identification and their impact in feature fusion. Sixteen different acoustic features are evaluated under nine different acoustic, channel and speaking style conditions. Three major types of features are examined: traditional (MFCC, PLP, LPCC, etc.), innovative (PYKFEC, MVDR, etc.) and extensions of these (frequency-constrained LPCC, LFCC). All features were then fused in binary and three-way fusion to determine the complementarity between features and their impact on accuracy. Results were surprising, with the MVDR feature having the highest performance for any single feature, and LPCC based features having the greatest impact on fusion effectiveness. Commonly used features like PLP and MFCC did not achieve the best results in any category. It was further found that removing the perceptually-motivated warping from MFCC, MVDR and PYKFEC improved the performance of these features significantly.
引用
下载
收藏
页码:5444 / 5447
页数:4
相关论文
共 50 条
  • [21] SELECTION OF ACOUSTIC FEATURES FOR SPEAKER IDENTIFICATION
    SAMBUR, MR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (02): : 176 - 182
  • [22] Speaker Recognition Based on Long-Term Acoustic Features With Analysis Sparse Representation
    Lin, Ting
    Zhang, Ye
    IEEE ACCESS, 2019, 7 : 87439 - 87447
  • [23] Study of harmonic features for the speaker recognition
    Univ of Maribor, Maribor, Slovenia
    Speech Commun, 4 (385-402):
  • [24] The use of harmonic features in speaker recognition
    Imperl, B
    Kacic, Z
    Horvat, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1131 - 1134
  • [25] A study of harmonic features for the speaker recognition
    Imperl, B
    Kacic, Z
    Horvat, B
    SPEECH COMMUNICATION, 1997, 22 (04) : 385 - 402
  • [26] Normalization of modulation features for speaker recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 599 - +
  • [27] Extraction of Glottal Features for Speaker Recognition
    Ostrogonac, Stevan
    Secujski, Milan
    Knezevic, Dragan
    Suzic, Sinisa
    IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013), 2013, : 369 - 373
  • [28] Group delay features for speaker recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1113 - 1117
  • [29] A Survey on Automatic Speaker Recognition Systems
    Saquib, Zia
    Salam, Nirmala
    Nair, Rekha P.
    Pandey, Nipun
    Joshi, Akanksha
    SIGNAL PROCESSING AND MULTIMEDIA, 2010, 123 : 134 - 145
  • [30] Evaluation of the usefulness of selected features of the speech signal for automatic speaker recognition systems
    Dobrowolski, Andrzej P.
    Majda, Ewelina
    PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (10): : 193 - 197