Auditory-model based robust feature selection for speech recognition

被引:11
|
作者
Koniaris, Christos [1 ]
Kuropatwinski, Marcin [1 ]
Kleijn, W. Bastiaan [1 ]
机构
[1] Royal Inst Technol, KTH, Sch Elect Engn, Sound & Image Proc Lab, SE-10044 Stockholm, Sweden
来源
关键词
feature extraction; hearing; speech recognition;
D O I
10.1121/1.3284545
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is shown that robust dimension-reduction of a feature set for speech recognition can be based on a model of the human auditory system. Whereas conventional methods optimize classification performance, the proposed method exploits knowledge implicit in the auditory periphery, inheriting its robustness. Features are selected to maximize the similarity of the Euclidean geometry of the feature domain and the perceptual domain. Recognition experiments using mel-frequency cepstral coefficients (MFCCs) confirm the effectiveness of the approach, which does not require labeled training data. For noisy data the method outperforms commonly used discriminant-analysis based dimension-reduction methods that rely on labeling. The results indicate that selecting MFCCs in their natural order results in subsets with good performance.
引用
收藏
页码:EL73 / EL79
页数:7
相关论文
共 50 条
  • [41] Robust emotional speech recognition based on binaural model and emotional auditory mask in noisy environments
    Meysam Bashirpour
    Masoud Geravanchizadeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [42] Robust emotional speech recognition based on binaural model and emotional auditory mask in noisy environments
    Bashirpour, Meysam
    Geravanchizadeh, Masoud
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [43] Discriminative feature selection for speech recognition
    Bocchieri, E.L.
    Wilpon, J.G.
    Computer Speech and Language, 1993, 7 (03): : 229 - 246
  • [44] Speech recognition based on a model of human auditory system
    Koizumi, T
    Mori, M
    Taniguchi, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 937 - 940
  • [45] A COMPARISON OF AUDITORY FEATURES FOR ROBUST SPEECH RECOGNITION
    Kelly, Finnian
    Harte, Naomi
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1968 - 1972
  • [46] Auditory contrast spectrum for robust speech recognition
    Lu, Xugang
    Dang, Jianwu
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 325 - +
  • [47] Discriminative auditory features for robust speech recognition
    Mak, B
    Tam, YC
    Li, Q
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 381 - 384
  • [48] A DFE-based algorithm for feature selection in speech recognition
    delaTorre, A
    Peinado, AM
    Rubio, AJ
    Sanchez, V
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1519 - 1522
  • [49] Robust speech recognition based on joint model and feature space optimization of hidden Markov models
    Moon, S
    Hwang, JN
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (02): : 194 - 204
  • [50] Feature extraction based on wavelet domain hidden Markov tree model for robust speech recognition
    Jung, S
    Son, J
    Bae, K
    AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 1154 - 1159