A discriminative and robust training algorithm for noisy speech recognition

被引:0
|
作者
Hong, WT
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A combined technique of discriminative and robust training algorithms, referred to as the D-REST (Discriminative and Robust Environment-effects Suppression Training), is proposed for noisy speech recognition. The D-REST technique can separately model the environmental characteristics and phonetic information and thus it can train speech models discriminatively on phonetic variability by eliminating the disturbance of environment-specific effects. According to the experimental results of Taiwan stock name recognition task over wireless network, the proposed D-REST algorithm has the potential to improve performance not only on diverse training data but also on noise-type unmatched environments between training and testing. Furthermore, the usage of the D-REST algorithm amounted to a 60% reduction in average word error rate over the performance by the conventional MCE/GPD-based training approach without environment-effects suppression training technique.
引用
收藏
页码:8 / 11
页数:4
相关论文
共 50 条
  • [41] A robust feature extraction for automatic speech recognition in noisy environments
    Lima, C
    Almeida, LB
    Monteiro, JL
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 540 - 543
  • [42] Robust Feature Extraction Methods for Speech Recognition in Noisy Environments
    Mukheolkar, Ajinkya Sunil
    Alex, John Sahaya Rani
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 295 - 299
  • [43] Robust emotion recognition in noisy speech via sparse representation
    Zhao, Xiaoming
    Zhang, Shiqing
    Lei, Bicheng
    NEURAL COMPUTING & APPLICATIONS, 2014, 24 (7-8): : 1539 - 1553
  • [44] A constrained line search optimization for discriminative training in speech recognition
    Liu, Cong
    Liu, Peng
    Jiang, Hui
    Soong, Frank
    Wang, Ren-Hua
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 329 - +
  • [45] ROBUST FEATURES FOR NOISY SPEECH RECOGNITION USING JITTER AND SHIMMER
    Rahali, Hajer
    Hajaiej, Zied
    Ellouze, Noureddine
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2015, 11 (03): : 955 - 963
  • [46] Towards discriminative training estimators for HMM speech recognition system
    Frikha, Mondher
    Messaoud, Z. Ben
    Hamida, A. Ben
    Journal of Applied Sciences, 2007, 7 (24) : 3891 - 3899
  • [47] Simultaneous Discriminative Training and Mixture Splitting of HMMs for Speech Recognition
    Tahir, Muhammad Ali
    Nussbaum-Thom, Markus
    Schlueter, Ralf
    Ney, Hermann
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 570 - 573
  • [48] Robust emotion recognition in noisy speech via sparse representation
    Xiaoming Zhao
    Shiqing Zhang
    Bicheng Lei
    Neural Computing and Applications, 2014, 24 : 1539 - 1553
  • [49] Comparison of discriminative training criteria and optimization methods for speech recognition
    Schlüter, R
    Macherey, W
    Müller, B
    Ney, H
    SPEECH COMMUNICATION, 2001, 34 (03) : 287 - 310
  • [50] OVERVIEW OF LARGE SCALE OPTIMIZATION FOR DISCRIMINATIVE TRAINING IN SPEECH RECOGNITION
    Kanevsky, Dimitri
    Heigold, Georg
    Wright, Stephen
    Ney, Hermann
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5233 - 5236