Robust Speaker Recognition Using Denoised Vocal Source and Vocal Tract Features

被引:34
|
作者
Wang, Ning [1 ]
Ching, P. C. [1 ]
Zheng, Nengheng [2 ]
Lee, Tan [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
[2] Shenzhen Univ, Coll Informat Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Robust parameter estimation; source-tract features; speaker recognition; spectral subtraction; REPRESENTATIONS; NOISE;
D O I
10.1109/TASL.2010.2045800
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To alleviate the problem of severe degradation of speaker recognition performance under noisy environments because of inadequate and inaccurate speaker-discriminative information, a method of robust feature estimation that can capture both vocal source-and vocal tract-related characteristics from noisy speech utterances is proposed. Spectral subtraction, a simple yet useful speech enhancement technique, is employed to remove the noise-specific components prior to the feature extraction process. It has been shown through analytical derivation, as well as by simulation results, that the proposed feature estimation method leads to robust recognition performance, especially at low signal-to-noise ratios. In the context of Gaussian mixture model-based speaker recognition with the presence of additive white Gaussian noise, the new approach produces consistent reduction of both identification error rate and equal error rate at signal-to-noise ratios ranging from 0 to 15 dB.
引用
收藏
页码:196 / 205
页数:10
相关论文
共 50 条
  • [31] Gender recognition from vocal source
    V. N. Sorokin
    I. S. Makarov
    Acoustical Physics, 2008, 54 : 571 - 578
  • [32] Gender recognition from vocal source
    Sorokin, V. N.
    Makarov, I. S.
    ACOUSTICAL PHYSICS, 2008, 54 (04) : 571 - 578
  • [33] ANALYSIS AND MITIGATION OF VOCAL EFFORT VARIATIONS IN SPEAKER RECOGNITION
    Nandwana, Mahesh Kumar
    McLaren, Mitchell
    Ferrer, Luciana
    Castan, Diego
    Lawson, Aaron
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6001 - 6005
  • [34] Model of Acoustic Interaction between the Vocal Tract, Subglottal Region, and Vocal Source
    Gorbunov, K. S.
    Makarov, I. S.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2010, 55 (12) : 1456 - 1465
  • [35] Model of acoustic interaction between the vocal tract, subglottal region, and vocal source
    K. S. Gorbunov
    I. S. Makarov
    Journal of Communications Technology and Electronics, 2010, 55 : 1456 - 1465
  • [36] Speaker Based Vocal Tract Shape Estimation for Kannada Vowels
    Prasad, Shiva K. M.
    Kumar, Anil C.
    Ramaiah, G. N. Kodanda
    Manjunatha, M. B.
    2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, SIGNALS, COMMUNICATION AND OPTIMIZATION (EESCO), 2015,
  • [37] Intelligent Diagnosis Approach for Depression Using Vocal Source Features
    Gao, Yuan
    Xin, Yinan
    Zhang, Li
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2022, 29 (03): : 971 - 975
  • [38] Development of vocal tract and acoustic features in children
    Mugitani, Ryoko
    Hiroya, Sadao
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2012, 33 (04) : 215 - 220
  • [39] SOURCE-SYSTEM INTERACTION IN VOCAL TRACT
    FLANAGAN, JL
    MEINHART, IS
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1964, 36 (10): : 2001 - &
  • [40] GLOTTAL SOURCE VOCAL-TRACT INTERACTION
    KOIZUMI, T
    TANIGUCHI, S
    HIROMITSU, S
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (05): : 1541 - 1547