Auditory-model based robust feature selection for speech recognition

被引:11
|
作者
Koniaris, Christos [1 ]
Kuropatwinski, Marcin [1 ]
Kleijn, W. Bastiaan [1 ]
机构
[1] Royal Inst Technol, KTH, Sch Elect Engn, Sound & Image Proc Lab, SE-10044 Stockholm, Sweden
来源
关键词
feature extraction; hearing; speech recognition;
D O I
10.1121/1.3284545
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is shown that robust dimension-reduction of a feature set for speech recognition can be based on a model of the human auditory system. Whereas conventional methods optimize classification performance, the proposed method exploits knowledge implicit in the auditory periphery, inheriting its robustness. Features are selected to maximize the similarity of the Euclidean geometry of the feature domain and the perceptual domain. Recognition experiments using mel-frequency cepstral coefficients (MFCCs) confirm the effectiveness of the approach, which does not require labeled training data. For noisy data the method outperforms commonly used discriminant-analysis based dimension-reduction methods that rely on labeling. The results indicate that selecting MFCCs in their natural order results in subsets with good performance.
引用
下载
收藏
页码:EL73 / EL79
页数:7
相关论文
共 50 条
  • [1] AN AUDITORY-BASED FEATURE FOR ROBUST SPEECH RECOGNITION
    Shao, Yang
    Jin, Zhaozhang
    Wang, DeLiang
    Srinivasan, Soundararajan
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4625 - +
  • [2] Feature extraction based on auditory representations for robust speech recognition
    Kim, DS
    Lee, SY
    Kil, RM
    Zhu, XL
    ELECTRONICS LETTERS, 1997, 33 (01) : 15 - 16
  • [3] An auditory model for robust speech recognition
    Luo, Xuewen
    Soon, Ing Yann
    Yeo, Chai Kiat
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1105 - 1109
  • [4] Combining speech enhancement and auditory feature extraction for robust speech recognition
    Kleinschmidt, M
    Tchorz, J
    Kollmeier, B
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 75 - 91
  • [5] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [6] Model-based feature compensation for robust speech recognition
    School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing, 100876, China
    不详
    不详
    Fundam Inf, 2006, 4 (529-539):
  • [7] Robust Feature Extraction for Speech Recognition by Enhancing Auditory Spectrum
    Alam, Md Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1358 - 1361
  • [8] An auditory neural feature extraction method for robust speech recognition
    Guo, Wei
    Zhang, Liqing
    Xia, Bin
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 793 - +
  • [9] Noise Robust Feature Scheme for Automatic Speech Recognition Based on Auditory Perceptual Mechanisms
    Cai, Shang
    Xiao, Yeming
    Pan, Jielin
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (06): : 1610 - 1618
  • [10] Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition
    Shi, Yanyan
    Bai, Jing
    Xue, Peiyun
    Shi, Dianxi
    IEEE ACCESS, 2019, 7 : 81911 - 81922