A discriminative and robust training algorithm for noisy speech recognition

被引:0
|
作者
Hong, WT
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A combined technique of discriminative and robust training algorithms, referred to as the D-REST (Discriminative and Robust Environment-effects Suppression Training), is proposed for noisy speech recognition. The D-REST technique can separately model the environmental characteristics and phonetic information and thus it can train speech models discriminatively on phonetic variability by eliminating the disturbance of environment-specific effects. According to the experimental results of Taiwan stock name recognition task over wireless network, the proposed D-REST algorithm has the potential to improve performance not only on diverse training data but also on noise-type unmatched environments between training and testing. Furthermore, the usage of the D-REST algorithm amounted to a 60% reduction in average word error rate over the performance by the conventional MCE/GPD-based training approach without environment-effects suppression training technique.
引用
收藏
页码:8 / 11
页数:4
相关论文
共 50 条
  • [21] Techniques for robust speech recognition in noisy and reverberant conditions
    Brown, GJ
    Palomäki, KJ
    SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, : 213 - 220
  • [22] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [23] TOWARDS SINGLE PASS DISCRIMINATIVE TRAINING FOR SPEECH RECOGNITION
    Hsiao, Roger
    Schultz, Tanja
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4093 - 4096
  • [24] Discriminative training of stochastic Markov graphs for speech recognition
    Wolfertstetter, F
    Ruske, G
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 581 - 584
  • [25] Adaptive Training with Noisy Constrained Maximum Likelihood Linear Regression for Noise Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2367 - 2370
  • [26] Noisy training for deep neural networks in speech recognition
    Shi Yin
    Chao Liu
    Zhiyong Zhang
    Yiye Lin
    Dong Wang
    Javier Tejedor
    Thomas Fang Zheng
    Yinguo Li
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [27] Improved Noisy Student Training for Automatic Speech Recognition
    Park, Daniel S.
    Zhang, Yu
    Jia, Ye
    Han, Wei
    Chiu, Chung-Cheng
    Li, Bo
    Wu, Yonghui
    Le, Quoc, V
    INTERSPEECH 2020, 2020, : 2817 - 2821
  • [28] Noisy training for deep neural networks in speech recognition
    Yin, Shi
    Liu, Chao
    Zhang, Zhiyong
    Lin, Yiye
    Wang, Dong
    Tejedor, Javier
    Zheng, Thomas Fang
    Li, Yinguo
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015, : 1 - 14
  • [29] Robust Voice Activity Detection Algorithm for Noisy Speech
    Verteletskaya, Ekaterina
    Simak, Boris
    RTT 2009: 11TH INTERNATIONAL CONFERENCE RTT 2009 RESEARCH IN TELECOMMUNICATION TECHNOLOGY, CONFERENCE PROCEEDINGS, 2009, : 98 - 101
  • [30] A robust algorithm for formant frequency extraction of noisy speech
    Zhao, QF
    Shimamura, T
    Suzuki, J
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D534 - D537