Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features

被引:34
|
作者
Wang, Ning [1 ]
Ching, P. C. [1 ]
Zheng, Nengheng [2 ]
Lee, Tan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
[2] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Robust parameter estimation; source-tract features; speaker recognition; spectral subtraction; REPRESENTATIONS; NOISE;
D O I
10.1109/TASL.2010.2045800
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To alleviate the problem of severe degradation of speaker recognition performance under noisy environments because of inadequate and inaccurate speaker-discriminative information, a method of robust feature estimation that can capture both vocal source-and vocal tract-related characteristics from noisy speech utterances is proposed. Spectral subtraction, a simple yet useful speech enhancement technique, is employed to remove the noise-specific components prior to the feature extraction process. It has been shown through analytical derivation, as well as by simulation results, that the proposed feature estimation method leads to robust recognition performance, especially at low signal-to-noise ratios. In the context of Gaussian mixture model-based speaker recognition with the presence of additive white Gaussian noise, the new approach produces consistent reduction of both identification error rate and equal error rate at signal-to-noise ratios ranging from 0 to 15 dB.
引用
收藏
页码:196 / 205
页数:10
相关论文
共 50 条
  • [21] NORMALIZING THE VOCAL-TRACT LENGTH FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    LIN, QG
    CHE, CW
    IEEE SIGNAL PROCESSING LETTERS, 1995, 2 (11) : 201 - 203
  • [22] SPEAKER VARIATION AND VOCAL-TRACT SIZE
    MATTINGLY, IG
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 39 (06): : 1219 - +
  • [23] Speaker adaptive modeling by vocal tract normalization
    Welling, L
    Ney, H
    Kanthak, S
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 415 - 426
  • [25] Experiments on using Vocal Tract Estimates of Nasal Stops for Speaker Verification
    Enzinger, Ewald
    Kasess, Christian H.
    2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED), 2013,
  • [26] Fast and robust joint estimation of vocal tract and voice source parameters
    Ding, W
    Campbell, N
    Higuchi, N
    Kasuya, H
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1291 - 1294
  • [27] Study of the Input Impedance of the Vocal Tract - Coupling between Source and Vocal Tract.
    Mrayati, M.
    Guerin, B.
    Boe, L.J.
    Acustica, 1976, 35 (05): : 330 - 340
  • [28] Vocal imitation using physical vocal tract model
    Kanda, Hisashi
    Ogata, Tetsuya
    Kornatani, Kazunori
    Okuno, Hiroshi G.
    2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1852 - 1857
  • [29] JOINT ANALYSIS OF VOCAL TRACT LENGTH AND TEMPORAL INFORMATION FOR ROBUST SPEECH RECOGNITION
    Huang, Chien-Lin
    Hori, Chiori
    Kashioka, Hideki
    Ma, Bin
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7432 - 7436
  • [30] Joint estimation of glottal source and vocal tract for vocal synthesis using Kalman smoothing and EM algorithm
    Jinachitra, P
    Smith, JO
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 327 - 330