Reducing the environmental sensitivity of cepstral features for speaker recognition

被引:0
|
作者
Openshaw, JP
Mason, JS
机构
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for a-prioro knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-land filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively, a relative reduction in error of 77% and 60.1%.
引用
收藏
页码:721 / 724
页数:4
相关论文
共 50 条
  • [41] Cepstral Features and Text-Dependent Speaker Identification A Comparative Study
    Ouzounov, Atanas
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2010, 10 (01) : 3 - 12
  • [42] SPEAKER RECOGNITION BY STATISTICAL FEATURES AND DYNAMIC FEATURES
    FURUI, S
    [J]. REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1982, 30 (03): : 467 - 482
  • [43] Cepstral and Long-Term Features for Emotion Recognition
    Dumouchel, Pierre
    Dehak, Najim
    Attabi, Yazid
    Dehak, Reda
    Boufaden, Narjes
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 344 - +
  • [44] Evaluation of Lineal Relation between Shifted Delta Cepstral Features and Prosodic Features in Speaker Verification
    Calvo, Jose R.
    Ribas, Dayana
    Fernandez, Rafael
    Hernandez, Gabriel
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 112 - 119
  • [45] Robust Speech Recognition Combining Cepstral and Articulatory Features
    Zha, Zhuan-ling
    Hu, Jin
    Zhan, Qing-ran
    Shan, Ya-hui
    Xie, Xiang
    Wang, Jing
    Cheng, Hao-bo
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
  • [46] SPEAKER RECOGNITION USING SYLLABLE-BASED CONSTRAINTS FOR CEPSTRAL FRAME SELECTION
    Bocklet, Tobias
    Shriberg, Elizabeth
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4525 - +
  • [47] The wavelet packet based cepstral features for open set speaker classification in Marathi
    Patil, HA
    Dutta, PK
    Basu, TK
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 134 - +
  • [48] Speaker Recognition Using Mel Frequency Cepstral Coefficient and Locality Sensitive Hashing
    Awais, Ahmed
    Kun, She
    Yu, Yue
    Hayat, Shaukat
    Ahmed, Aftab
    Tu, Tianyi
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), 2018, : 271 - 276
  • [49] The use of harmonic features in speaker recognition
    Imperl, B
    Kacic, Z
    Horvat, B
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1131 - 1134
  • [50] A study of harmonic features for the speaker recognition
    Imperl, B
    Kacic, Z
    Horvat, B
    [J]. SPEECH COMMUNICATION, 1997, 22 (04) : 385 - 402