Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features

被引：34

作者：

Wang, Ning ^{[1
]}

Ching, P. C. ^{[1
]}

Zheng, Nengheng ^{[2
]}

Lee, Tan ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China

[2] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Peoples R China

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Robust parameter estimation; source-tract features; speaker recognition; spectral subtraction; REPRESENTATIONS; NOISE;

D O I：

10.1109/TASL.2010.2045800

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

To alleviate the problem of severe degradation of speaker recognition performance under noisy environments because of inadequate and inaccurate speaker-discriminative information, a method of robust feature estimation that can capture both vocal source-and vocal tract-related characteristics from noisy speech utterances is proposed. Spectral subtraction, a simple yet useful speech enhancement technique, is employed to remove the noise-specific components prior to the feature extraction process. It has been shown through analytical derivation, as well as by simulation results, that the proposed feature estimation method leads to robust recognition performance, especially at low signal-to-noise ratios. In the context of Gaussian mixture model-based speaker recognition with the presence of additive white Gaussian noise, the new approach produces consistent reduction of both identification error rate and equal error rate at signal-to-noise ratios ranging from 0 to 15 dB.

引用

页码：196 / 205

页数：10

共 50 条

[1] Robust speaker recognition using both vocal source and vocal tract features estimated from noisy input utterances
Wang, Ning
Ching, P. C.
Zheng, N. H.
Lee, Tan
2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 886 - 891
[2] Discrimination power of vocal source and vocal tract related features for speaker segmentation
Chan, Wai Nang
Zheng, Nengheng
Lee, Tan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1884 - 1892
[3] Speaker recognition using vocal source model
Sorokin V.N.
Tananykin A.A.
Trunov V.G.
Pattern Recognition and Image Analysis, 2014, 24 (1) : 156 - 173
[4] Speaker verification using complementary information from vocal source and vocal tract
Zheng, Nengheng
Wang, Ning
Lee, Tan
Ching, P. C.
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 518 - +
[5] Vocal Source Contribution to Speaker Recognition
Sorokin V.N.
Sorokin, V.N. (vns@iitp.ru), 2018, Pleiades journals (28) : 546 - 556
[6] Speaker clustering for speech recognition using vocal tract parameters
Naito, M
Deng, L
Sagisaka, Y
SPEECH COMMUNICATION, 2002, 36 (3-4) : 305 - 315
[7] Spectral Characteristics of Vocal Tract for Speaker Recognition
Sigmund, Milan
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (1A): : 17 - 19
[8] Combining vocal source and MFCC features for enhanced speaker recognition performance using GMMs
Hosseinzadeh, Danoush
Krishnan, Sridhar
2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 365 - 368
[9] Use of vocal source features in speaker segmentation
Chan, W. N.
Lee, Tan
Zheng, Nengheng
Hua Ouyang
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 657 - 660
[10] Using Haar transformed vocal source information for automatic speaker recognition
Zheng, NH
Ching, PC
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 77 - 80

← 1 2 3 4 5 →