Reducing the environmental sensitivity of cepstral features for speaker recognition

被引:0
|
作者
Openshaw, JP
Mason, JS
机构
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for a-prioro knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-land filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively, a relative reduction in error of 77% and 60.1%.
引用
收藏
页码:721 / 724
页数:4
相关论文
共 50 条
  • [1] Filter bank Based Cepstral Features for Speaker Recognition
    Chougule, Sharada V.
    Chavan, Mahesh S.
    Gaikwad, M. S.
    [J]. 2014 IEEE GLOBAL CONFERENCE ON WIRELESS COMPUTING AND NETWORKING (GCWCN), 2014, : 102 - 106
  • [2] Comparative Analysis on Different Cepstral Features for Speaker Identification Recognition
    Hanifa, R. M.
    Isa, K.
    Mohamad, S.
    [J]. 2020 18TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2020, : 487 - 492
  • [3] Variant Time-Frequency Cepstral Features for Speaker Recognition
    Zhang, Wei-Qiang
    Deng, Yan
    He, Liang
    Liu, Jia
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2122 - 2125
  • [4] LANGUAGE-INDEPENDENT CONSTRAINED CEPSTRAL FEATURES FOR SPEAKER RECOGNITION
    Shriberg, Elizabeth
    Stolcke, Andreas
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5296 - 5299
  • [5] A method of Automatic Speaker Recognition using cepstral features and vectorial quantization
    de Lara, JRC
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 146 - 153
  • [6] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. 2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [7] Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech
    Raikar, Aditya
    Gandhi, Ami
    Patil, Hemant A.
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 405 - 413
  • [8] A CEPSTRAL BASED SPEAKER RECOGNITION SYSTEM
    SETHURAMAN, R
    GOWDY, JN
    [J]. PROCEEDINGS : THE TWENTY-FIRST SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 1989, : 503 - 507
  • [9] Wavelet packet cepstral analysis for speaker recognition
    Kinney, A
    Stevens, J
    [J]. THIRTY-SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS - CONFERENCE RECORD, VOLS 1 AND 2, CONFERENCE RECORD, 2002, : 206 - 209
  • [10] Contribution of Prosodic and Cepstral Features in Improvment of a Synthesized Arabic Speaker Recognition Task performance
    Zergat, Kawthar Yasmine
    Amrouche, Abderrahmane
    Taher, Montadar Abas
    Zainal, Nasharuddin
    [J]. 2013 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED 2013), 2013, : 70 - 73