Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features

被引:34
|
作者
Wang, Ning [1 ]
Ching, P. C. [1 ]
Zheng, Nengheng [2 ]
Lee, Tan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
[2] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Robust parameter estimation; source-tract features; speaker recognition; spectral subtraction; REPRESENTATIONS; NOISE;
D O I
10.1109/TASL.2010.2045800
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To alleviate the problem of severe degradation of speaker recognition performance under noisy environments because of inadequate and inaccurate speaker-discriminative information, a method of robust feature estimation that can capture both vocal source-and vocal tract-related characteristics from noisy speech utterances is proposed. Spectral subtraction, a simple yet useful speech enhancement technique, is employed to remove the noise-specific components prior to the feature extraction process. It has been shown through analytical derivation, as well as by simulation results, that the proposed feature estimation method leads to robust recognition performance, especially at low signal-to-noise ratios. In the context of Gaussian mixture model-based speaker recognition with the presence of additive white Gaussian noise, the new approach produces consistent reduction of both identification error rate and equal error rate at signal-to-noise ratios ranging from 0 to 15 dB.
引用
收藏
页码:196 / 205
页数:10
相关论文
共 50 条
  • [1] Robust speaker recognition using both vocal source and vocal tract features estimated from noisy input utterances
    Wang, Ning
    Ching, P. C.
    Zheng, N. H.
    Lee, Tan
    2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 886 - 891
  • [2] Discrimination power of vocal source and vocal tract related features for speaker segmentation
    Chan, Wai Nang
    Zheng, Nengheng
    Lee, Tan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1884 - 1892
  • [3] Speaker recognition using vocal source model
    Sorokin V.N.
    Tananykin A.A.
    Trunov V.G.
    Pattern Recognition and Image Analysis, 2014, 24 (1) : 156 - 173
  • [4] Speaker verification using complementary information from vocal source and vocal tract
    Zheng, Nengheng
    Wang, Ning
    Lee, Tan
    Ching, P. C.
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 518 - +
  • [5] Vocal Source Contribution to Speaker Recognition
    Sorokin V.N.
    Sorokin, V.N. (vns@iitp.ru), 2018, Pleiades journals (28) : 546 - 556
  • [6] Speaker clustering for speech recognition using vocal tract parameters
    Naito, M
    Deng, L
    Sagisaka, Y
    SPEECH COMMUNICATION, 2002, 36 (3-4) : 305 - 315
  • [7] Spectral Characteristics of Vocal Tract for Speaker Recognition
    Sigmund, Milan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (1A): : 17 - 19
  • [8] Combining vocal source and MFCC features for enhanced speaker recognition performance using GMMs
    Hosseinzadeh, Danoush
    Krishnan, Sridhar
    2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 365 - 368
  • [9] Use of vocal source features in speaker segmentation
    Chan, W. N.
    Lee, Tan
    Zheng, Nengheng
    Hua Ouyang
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 657 - 660
  • [10] Using Haar transformed vocal source information for automatic speaker recognition
    Zheng, NH
    Ching, PC
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 77 - 80