Histogram Equalization to Model Adaptation for Robust Speech Recognition

被引：1

作者：

Suh, Youngjoo ^{[1
]}

Kim, Hoirin ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Taejon 305701, South Korea

来源：

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2010年

关键词：

COMPENSATION;

D O I：

10.1155/2010/628018

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We propose a new model adaptation method based on the histogram equalization technique for providing robustness in noisy environments. The trained acoustic mean models of a speech recognizer are adapted into environmentally matched conditions by using the histogram equalization algorithm on a single utterance basis. For more robust speech recognition in the heavily noisy conditions, trained acoustic covariance models are efficiently adapted by the signal-to-noise ratio-dependent linear interpolation between trained covariance models and utterance-level sample covariance models. Speech recognition experiments on both the digit-based Aurora2 task and the large vocabulary-based task showed that the proposed model adaptation approach provides significant performance improvements compared to the baseline speech recognizer trained on the clean speech data. Copyright (C) 2010 Y. Suh and H. Kim.

引用

页数：8

共 50 条

[1] Histogram Equalization to Model Adaptation for Robust Speech Recognition
Youngjoo Suh
Hoirin Kim
[J]. EURASIP Journal on Advances in Signal Processing, 2010
[2] MAXIMUM LIKELIHOOD ADAPTATION OF HISTOGRAM EQUALIZATION WITH CONSTRAINT FOR ROBUST SPEECH RECOGNITION
Xiao, Xiong
Li, Jinyu
Chng, Eng Siong
Li, Haizhou
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5480 - 5483
[3] Histogram equalization of speech representation for robust speech recognition
de la Torre, A
Peinado, AM
Segura, JC
Pérez-Córdoba, JL
Benítez, MC
Rubio, AJ
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 355 - 366
[4] Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 876 - 880
[5] HISTOGRAM EQUALIZATION AND NOISE MASKING FOR ROBUST SPEECH RECOGNITION
Zhang, Xueru
Demuynck, Kris
Van Hamme, Hugo
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4578 - 4581
[6] Probabilistic class histogram equalization for robust speech recognition
Suh, Youngjoo
Ji, Mikyong
Kim, Hoirin
[J]. IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (04) : 287 - 290
[7] Histogram equalization of contextual statistics of speech features for robust speech recognition
Hsieh, Hsin-Ju
Chen, Berlin
Hung, Jeih-weih
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (17) : 6769 - 6795
[8] Histogram equalization of contextual statistics of speech features for robust speech recognition
Hsin-Ju Hsieh
Berlin Chen
Jeih-weih Hung
[J]. Multimedia Tools and Applications, 2015, 74 : 6769 - 6795
[9] Stereo-based histogram equalization for robust speech recognition
Al-Wakeel, Randa
Shoman, Mahmoud
Aboul-Ela, Magdy
Abdou, Sherif
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
[10] Stereo-based histogram equalization for robust speech recognition
Randa Al-Wakeel
Mahmoud Shoman
Magdy Aboul-Ela
Sherif Abdou
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2015

← 1 2 3 4 5 →