Noise robust estimate of speech dynamics for speaker recognition

被引:0
|
作者
Openshaw, JP
Mason, JS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for o-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively.
引用
收藏
页码:925 / 928
页数:4
相关论文
共 50 条
  • [21] Robust Speaker Authentication Based on Combined Speech and Voiceprint Recognition
    Malcangi, Mario
    COMPUTATIONAL METHODS IN SCIENCE AND ENGINEERING, VOL 2: ADVANCES IN COMPUTATIONAL SCIENCE, 2009, 1148 : 872 - 875
  • [22] ROBUST SPEECH RECOGNITION THROUGH SELECTION OF SPEAKER AND ENVIRONMENT TRANSFORMS
    Bilgi, Raghavendra
    Joshi, Vikas
    Umesh, S.
    Garcia, L.
    Benitez, C.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4333 - 4336
  • [23] Emotional Speech Clustering based Robust Speaker Recognition System
    Li, Dongdong
    Yang, Yingchun
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4576 - +
  • [24] A posterior union model with applications to robust speech and speaker recognition
    Ming, Ji
    Lin, Jie
    Smith, F. Jack
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [25] A Posterior Union Model with Applications to Robust Speech and Speaker Recognition
    Ji Ming
    Jie Lin
    F. Jack Smith
    EURASIP Journal on Advances in Signal Processing, 2006
  • [26] Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition
    Novotny, Ondrej
    Plchot, Oldrich
    Glembek, Ondrej
    Cernocky, Jan ''Honza''
    Burget, Lukas
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 403 - 421
  • [27] Articulatory Information for Noise Robust Speech Recognition
    Mitra, Vikramjit
    Nam, Hosung
    Espy-Wilson, Carol Y.
    Saltzman, Elliot
    Goldstein, Louis
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
  • [28] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [29] Noise and speaker robustness in a Persian continuous speech recognition system
    Veisi, Hadi
    Sameti, Hossein
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 73 - 76
  • [30] Robust noise suppression methods in speech recognition
    Cui, Yi
    Zhang, Dong
    Shi, Liangping
    Chen, Liyuan
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14