A new feature normalization scheme based on eigenspace for noisy speech recognition

被引:0
|
作者
Lee, Y [1 ]
Ko, H [1 ]
机构
[1] Korea Univ, Dept Elect & Comp Engn, Seoul 136701, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new feature normalization scheme based on eigenspace, for achieving robust speech recognition. In particular, we employ the Mean and Variance Normalization (MVN) in eigenspace using unique and independent eigenspaces to cepstra, delta and delta-delta cepstra respectively. We also normalize training data in eigenspace and get the model from the normalized training data. In addition, a feature space rotation procedure is introduced to reduce the mismatch of training and test data distribution in noisy condition. As a result, we obtain a substantial recognition improvement over the basic eigenspace normalization.
引用
收藏
页码:76 / 78
页数:3
相关论文
共 50 条
  • [1] Robust Feature Normalization Scheme Using Separated Eigenspace in Noisy Environments
    Lee, Yoonjae
    Ko, Hanseok
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (04): : 210 - 216
  • [2] Feature weighting in noisy speech recognition
    Huang, KC
    Juang, YT
    [J]. ELECTRONICS LETTERS, 2003, 39 (12) : 938 - 939
  • [3] Double Gaussian based feature normalization for robust speech recognition
    Liu, B
    Dai, LR
    Li, JY
    Wang, RH
    [J]. 2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 253 - 256
  • [4] Word graph based feature enhancement for noisy speech recognition
    Yan, Zhi-Jie
    Soong, Frank K.
    Wang, Ren-Hua
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 373 - +
  • [5] Model-based feature enhancement for noisy speech recognition
    Couvreur, C
    Van hamme, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722
  • [6] Temporal feature selection for noisy speech recognition
    Department of Computer Science and Software Engineering, Université Laval, Quebec
    QC
    G1V 0A6, Canada
    [J]. Lect. Notes Comput. Sci., (155-166):
  • [7] Temporal Feature Selection for Noisy Speech Recognition
    Trottier, Ludovic
    Chaib-draa, Brahim
    Giguere, Philippe
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 155 - 166
  • [8] Feature normalization based on non-extensive statistics for speech recognition
    Pardede, Hilman F.
    Iwano, Koji
    Shinoda, Koichi
    [J]. SPEECH COMMUNICATION, 2013, 55 (05) : 587 - 599
  • [9] Temporal structure normalization of speech feature for robust speech recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (07) : 500 - 503
  • [10] Emotional speech feature normalization and recognition based on speaker-sensitive feature clustering
    Huang, Chengwei
    Song, Baolin
    Zhao, Li
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (04) : 805 - 816