Reducing the environmental sensitivity of cepstral features for speaker recognition

被引:0
|
作者
Openshaw, JP
Mason, JS
机构
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for a-prioro knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-land filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively, a relative reduction in error of 77% and 60.1%.
引用
收藏
页码:721 / 724
页数:4
相关论文
共 50 条
  • [31] Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training
    Sanchez, Michelle Hewlett
    Ferrer, Luciana
    Shriberg, Elizabeth
    Stolcke, Andreas
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 148 - 151
  • [32] Emerging features for speaker recognition
    Ambikairajah, Eliathamby
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1690 - 1696
  • [33] Perceptual MVDR-based cepstral coefficients(PMCCs)for speaker recognition
    LIANG Chunyan ZHANG Xiang YANG Lin ZHANG Jianping YAN Yonghong (Key Laboratory of Speech Acoustics and Content Understanding
    [J]. Chinese Journal of Acoustics, 2012, 31 (04) : 489 - 498
  • [34] Phone-based Cepstral Polynomial SVM System for Speaker Recognition
    Kajarekar, Sachin S.
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 845 - 848
  • [35] Secure Speaker Verification at Web Login Page Using Cepstral Features
    Putra, B.
    Suyanto
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2013, 35 (05): : 92 - 107
  • [36] Speaker identification and verification based on cepstral features and fuzzy nonlinear classifier
    Dustor, A.
    [J]. Proceedings of the International Conference Mixed Design of Integrated Circuits and Systems, 2006, : 692 - 697
  • [37] Real-Time Speaker Identification System using Cepstral Features
    Barik, Monalisha
    Sarangi, Susanta Kumar
    Sahu, Sushanta Kumar
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND INTELLIGENT SYSTEMS (CCIS), 2016, : 89 - 93
  • [38] Feature Generator for Speaker Recognition Using the Fusion of Cepstral and Melcepstral Parameters
    Majda, Ewelina
    Dobrowolski, Andrzej P.
    [J]. 2012 JOINT CONFERENCE NEW TRENDS IN AUDIO & VIDEO AND SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, & APPLICATIONS (NTAV-SPA 2012), 2012, : 203 - 208
  • [39] Robust speaker identification via fusion of subglottal resonances and cepstral features
    Guo, Jinxi
    Yang, Ruochen
    Arsikere, Harish
    Alwan, Abeer
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (04): : EL420 - EL426
  • [40] A VQ-BASED PREPROCESSOR USING CEPSTRAL DYNAMIC FEATURES FOR SPEAKER-INDEPENDENT LARGE VOCABULARY WORD RECOGNITION
    FURUI, S
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (07): : 980 - 987