Experimental Evaluation of Features for Robust Speaker Identification

被引:145
|
作者
Reynolds, Douglas A. [1 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02173 USA
来源
关键词
Communication channels (information theory) - Database systems - Digital filters - Identification (control systems) - Iterative methods - Mathematical models - Matrix algebra - Robustness (control systems) - Speech analysis - Speech processing - Vectors;
D O I
10.1109/89.326623
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This correspondence presents an experimental evaluation of different features and channel compensation techniques for robust speaker identification. The goal is to keep all processing and classification steps constant and to vary only the features and compensations used to allow a controlled comparison. A general, maximum-likelihood classifier based on Gaussian mixture densities is used as the classifier, and experiments are conducted on the King speech database, a conversational, telephone-speech database. The features examined are mel-frequency and linear-frequency filterbank cepstral coefficients, linear prediction cepstral coefficients, and perceptual linear prdiction (PLP) cepstral coefficients. The channel compensation techniques examined are cepstral mean removal, RASTA processing, and a quadratic trend removal technique. It is shown for this database that performance differences between the basic features is small, and the major gains are due to the channel compensation techniques. The best "across-the-divide" recognition accuracy of 92% is obtained for both high-order LPC features and band-limited filterbank features.
引用
收藏
页码:639 / 643
页数:5
相关论文
共 50 条
  • [1] Fusion features for robust speaker identification
    Ben Fredj, Ines
    Zouhir, Youssef
    Ouni, Kais
    [J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2018, 11 (02) : 65 - 72
  • [2] Robust Q Features for Speaker Identification
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 209 - 213
  • [3] Bispectrum features for robust speaker identification
    Wenndt, S
    Shamsunder, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1095 - 1098
  • [4] Robust prosodic features for speaker identification
    Carey, MJ
    Parris, ES
    LloydThomas, H
    Bennett, S
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1800 - 1803
  • [5] Modulation Features for Noise Robust Speaker Identification
    Mitra, Vikramjit
    McLaren, Mitchel
    Franco, Horacio
    Graciarena, Martin
    Scheffer, Nicolas
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3670 - 3674
  • [6] Robust Speaker Identification Incorporating High Frequency Features
    Latha
    [J]. TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 804 - 811
  • [7] Robust lip-motion features for speaker identification
    Çetingül, HE
    Yemez, Y
    Erzin, E
    Tekalp, AM
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 509 - 512
  • [8] Speaker Identification based on Robust AM-FM Features
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    [J]. 2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
  • [9] Robust speech features based on wavelet transform with application to speaker identification
    Hsieh, CT
    Lai, E
    Wang, YC
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
  • [10] Robust FHPD Features from Speech Harmonic Analysis for Speaker Identification
    Wang, Shuiping
    Tang, Zhenmin
    Jiang, Ye
    Chen, Ying
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1591 - 1598