Histogram Equalization to Model Adaptation for Robust Speech Recognition

被引:1
|
作者
Suh, Youngjoo [1 ]
Kim, Hoirin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Taejon 305701, South Korea
关键词
COMPENSATION;
D O I
10.1155/2010/628018
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a new model adaptation method based on the histogram equalization technique for providing robustness in noisy environments. The trained acoustic mean models of a speech recognizer are adapted into environmentally matched conditions by using the histogram equalization algorithm on a single utterance basis. For more robust speech recognition in the heavily noisy conditions, trained acoustic covariance models are efficiently adapted by the signal-to-noise ratio-dependent linear interpolation between trained covariance models and utterance-level sample covariance models. Speech recognition experiments on both the digit-based Aurora2 task and the large vocabulary-based task showed that the proposed model adaptation approach provides significant performance improvements compared to the baseline speech recognizer trained on the clean speech data. Copyright (C) 2010 Y. Suh and H. Kim.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Youngjoo Suh
    Hoirin Kim
    [J]. EURASIP Journal on Advances in Signal Processing, 2010
  • [2] MAXIMUM LIKELIHOOD ADAPTATION OF HISTOGRAM EQUALIZATION WITH CONSTRAINT FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Li, Jinyu
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5480 - 5483
  • [3] Histogram equalization of speech representation for robust speech recognition
    de la Torre, A
    Peinado, AM
    Segura, JC
    Pérez-Córdoba, JL
    Benítez, MC
    Rubio, AJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 355 - 366
  • [4] Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 876 - 880
  • [5] HISTOGRAM EQUALIZATION AND NOISE MASKING FOR ROBUST SPEECH RECOGNITION
    Zhang, Xueru
    Demuynck, Kris
    Van Hamme, Hugo
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4578 - 4581
  • [6] Probabilistic class histogram equalization for robust speech recognition
    Suh, Youngjoo
    Ji, Mikyong
    Kim, Hoirin
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (04) : 287 - 290
  • [7] Histogram equalization of contextual statistics of speech features for robust speech recognition
    Hsieh, Hsin-Ju
    Chen, Berlin
    Hung, Jeih-weih
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (17) : 6769 - 6795
  • [8] Histogram equalization of contextual statistics of speech features for robust speech recognition
    Hsin-Ju Hsieh
    Berlin Chen
    Jeih-weih Hung
    [J]. Multimedia Tools and Applications, 2015, 74 : 6769 - 6795
  • [9] Stereo-based histogram equalization for robust speech recognition
    Al-Wakeel, Randa
    Shoman, Mahmoud
    Aboul-Ela, Magdy
    Abdou, Sherif
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [10] Stereo-based histogram equalization for robust speech recognition
    Randa Al-Wakeel
    Mahmoud Shoman
    Magdy Aboul-Ela
    Sherif Abdou
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2015